简体   繁体   中英

Google Cloud Speech API real time recognition

I am developing a Python application for real-time translation. I need to recognize speech in real time: as user says something it automatically sends this piece of audio to Google Speech API and returns a text. So I want the recognized text appearing immediately while speaking.

I've found Streaming Speech Recognition but it seems that I still need to record the full speech first and then send it to the server. Also, there are no examples of how to use it in Python

Is it possible to do this with Google Speech API?

You can do it with Google Speech API.

But, it has a 1 minute content limit.

Please check the link below.

https://cloud.google.com/speech/quotas

So you have to restart every 1 minute.

and the link below is example code of microphone streaming by python.

https://cloud.google.com/speech/docs/streaming-recognize#speech-streaming-recognize-python

Check this link out:

https://github.com/Uberi/speech_recognition/blob/master/examples/microphone_recognition.py

This is an example for obtaining audio from the microphone. There are several components for the recognition process. In my experience the Sphinx Recognition lacks on accuracy. The Google Speech Recognition works very well.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM