Monday, September 16, 2013

Automatic Speech Recognition with Google

I recently found out that Google Chrome has a speech recognition component.



(Yes, this video is already 2 years old!) This component utilizes a Web service to do the actual speech recognition. The Web service isn't available officially, but can be accessed without Chrome to perform some quick speech recognition. Here's a script that does the work for you:

The script is mostly based on existing work by Sunil. I had to modify a few things to get it working:

  • Fix the fancy quotation marks
  • Fix the wget line (it wasn't outputting to the correct file)
Other than that, full credit should go to Sunil.  

It only seems to work well for smaller files (no more than a couple of seconds of audio). For larger file, the response from the Web service appears to be empty.

You can find more details about the API in this gist by alotaiba.