简体   繁体   English

非常简单的语音/语音识别算法

[英]Algorithm for very simple voice/speech recognition

I'm writing a game for Google Glass, but unfortunately SpeechRecognizer API isn't available on the current builds on Google Glass GDK. 我正在为Google Glass编写游戏,但是很遗憾,SpeechRecognizer API在Google Glass GDK的当前版本中不可用。

So I've been thinking about implementing an algorithm for a very simple voice recognition. 因此,我一直在考虑实现一种非常简单的语音识别算法。 Let's say I want to recognize only: "Yes" and "No". 假设我只想识别:“是”和“否”。
Do you know any example code or any helpful resources to help me in implementing this ? 您是否知道任何示例代码或任何有用的资源来帮助我实现这一目标?
Is it so hard that I should drop the idea and go with big frameworks like CMUSphinx ? 我放弃这个想法并选择像CMUSphinx这样的大型框架难吗?

What about recognizing: up, down, right, left or numbers from 1 to 10 ? 如何识别:上,下,右,左或1到10之间的数字?

As I know, there often used transition to the frequency domain by fast Fourier transform (FFT) and it analyzing. 据我所知,通常通过快速傅立叶变换(FFT)进行分析并转换到频域。 Also need some dictionary of speeched words for frequency correlation. 还需要一些有声单词词典来进行频率相关。

Please see this links: 请查看此链接:

CMU Sphinx have java implementation. CMU Sphinx有Java实现。

David Wagner have a good article and matlab implementation. David Wagner有一篇不错的文章和matlab实现。

PS Ohh, if you speak in russian, why you don't read this article - very simple, with java examples. PS哦,如果您用俄语讲,为什么不读这篇文章-非常简单,带有Java示例。

PPS Honestly, I never use this framework, but if you have only a superficial knowledge about speech recognition, robust and easyest way is to use existing complete solutions like frameworks or libraries, otherwise you need spend time to possess the necessary knowledge threshold. PPS老实说,我从不使用此框架,但是,如果您仅对语音识别有一个肤浅的了解,那么健壮和最简单的方法就是使用现有的完整解决方案,例如框架或库,否则,您需要花费时间来拥有必要的知识门槛。 In this case you can read this article. 在这种情况下,你可以阅读文章。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM