[英]Google speech recognition API not listening
I was trying the below speech recognition code using Google Speech API. 我正在尝试使用Google Speech API进行以下语音识别代码。
#!/usr/bin/env python3
# Requires PyAudio and PySpeech.
import speech_recognition as sr
# Record Audio
r = sr.Recognizer()
with sr.Microphone() as source:
print("Say something!")
audio = r.listen(source)
# Speech recognition using Google Speech Recognition
try:
# for testing purposes, we're just using the default API key
# to use another API key, use `r.recognize_google(audio, key="GOOGLE_SPEECH_RECOGNITION_API_KEY")`
# instead of `r.recognize_google(audio)`
print("You said: " + r.recognize_google(audio))
except sr.UnknownValueError:
print("Google Speech Recognition could not understand audio")
except sr.RequestError as e:
print("Could not request results from Google Speech Recognition service; {0}".format(e))
But I am getting only this. 但是我只得到这个。
jobin@jobin-Satellite-A665:~/scr$ python3 scr.py
Say something!
Even though I say something, nothing happens. 即使我说了什么,也没有任何反应。
I don't have an external microphone. 我没有外接麦克风。 I think this script will work with my laptop's in-built microphone.
我认为此脚本可以与笔记本电脑的内置麦克风配合使用。
I have tested my laptop's microphone here . 我在这里测试了笔记本电脑的麦克风。 It is working fine.
一切正常。
Am I missing anything? 我有什么想念的吗?
You could test that pyAudio is finding your microphone by running the following: 您可以通过运行以下命令来测试pyAudio是否正在找到您的麦克风:
"""PyAudio example: Record a few seconds of audio and save to a WAVE file."""
import pyaudio
import wave
CHUNK = 1024
FORMAT = pyaudio.paInt16
CHANNELS = 2
RATE = 44100
RECORD_SECONDS = 5
WAVE_OUTPUT_FILENAME = "output.wav"
p = pyaudio.PyAudio()
stream = p.open(format=FORMAT,
channels=CHANNELS,
rate=RATE,
input=True,
frames_per_buffer=CHUNK)
print("* recording")
frames = []
for i in range(0, int(RATE / CHUNK * RECORD_SECONDS)):
data = stream.read(CHUNK)
frames.append(data)
print("* done recording")
stream.stop_stream()
stream.close()
p.terminate()
wf = wave.open(WAVE_OUTPUT_FILENAME, 'wb')
wf.setnchannels(CHANNELS)
wf.setsampwidth(p.get_sample_size(FORMAT))
wf.setframerate(RATE)
wf.writeframes(b''.join(frames))
wf.close()
And playing the resulting output.wav
file. 并播放生成的
output.wav
文件。
Once you are confident that you are getting some audio I would add a couple of print statements to the original code to locate how far you are getting, ie: 一旦您确定自己会获得一些音频,我将在原始代码中添加一些打印语句以定位您得到的声音,即:
print("Audio captured!") # before trying to recognise see if you have something
and 和
print('Recognition Ended') # at the end of the script
This will let you see how far you are getting. 这将使您看到要走多远。
Next you may need to find out which is the default audio device with: 接下来,您可能需要找出具有以下内容的默认音频设备:
import pyaudio
print(pyaudio.pa.get_default_input_device())
Which should tell you the default input device, this was one on my machine so used this: 哪个应该告诉您默认输入设备,这是我机器上的设备,所以使用了以下命令:
with sr.Microphone(1) as source: # Specify which input device to use
r.adjust_for_ambient_noise(source, 1) # Adjust for ambient
print("Say something!")
audio = r.listen(source, 2) # 2 Second time out
print('Done Listening sample size =', len(audio.frame_data))
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.