|
The University of Washington is working on an amazing voice recognition system called VJ which stands for Vocal Joystick. No it doesn’t do text to speech like everyone else is doing, it allows simple vocal sounds control your computer mouse. After watching the videos it looks much easier than the other technologies I have seen in the past. The VJ system consists of three main components: acoustic signal processing, pattern recognition and motion control. First, the signal processing module extracts short-term acoustic features, such as energy, autocorrelation coefficients, linear prediction coeffients and mel frequency cepstral coefficients (MFCC). Signal conditioning and analysis techniques are needed for accurate estimation of these features. Next, these features are piped into the pattern recognition module, where energy smoothing, pitch and formant tracking, vowel classification and discrete sound recognition take place. This stage involves statistical learning techniques such as neural networks and dynamic Bayesian networks. Finally, energy, pitch, vowel quality and discrete sound become acoustic parameters to be transformed into direction, speed and other motion related parameters. The application driver takes the motion control parameters and launches corresponding actions.”
|
November 6th, 2007
Huh.
Interesting.
Though I don’t see a whole lot of real world possibilities…
November 6th, 2007
Muito legal mesmo, outra ideia simples e genial…
November 8th, 2007
GREEEEEEEAT!
November 12th, 2007
[…] You can read the rest of this blog post by going to the original source, here […]
November 12th, 2007
[…] (more…) […]