I whipped together a quick video to somewhat explain this idea.
I forgot to mention the GTK+ UVC Viewer, which is what I used to record using the webcam on Ubuntu. It was excellent.
From watching it once, I see that I should probably talk slower. Mostly the video seemed in-sync with the audio. What did you think about the quality and/or the idea?