简体繁体 English

有没有办法使用 Alexa 或 DialogFlow.ai 等技术来使用原始音频数据？

[英]Is there a way to use the raw audio data using technologies as Alexa or DialogFlow.ai?

原文 2019-10-08 08:51:42 7 1 speech-recognition/ alexa/ web-audio-api/ speech-to-text/ voice-recognition

I am developing an application that uses voice recognition to help blind people to learn music.我正在开发一个使用语音识别来帮助盲人学习音乐的应用程序。 For this, I am thinking about using something as DialogFlow.ai , or even Amazon Alexa , in order not to reivent the wheel.为此，我正在考虑使用DialogFlow.ai甚至Amazon Alexa之类的东西，以免重蹈覆辙。 Thus, there are times when I want to use the raw audio data in order to check if instruments are tuned.因此，有时我想使用原始音频数据来检查乐器是否经过调音。 With these technologies, by default, all the audio input is interpreted and, consequently, converted in text.使用这些技术，默认情况下，所有音频输入都会被解释并因此转换为文本。 So, is there a way to use the raw audio data instead of interpreting the user speech?那么，有没有办法使用原始音频数据而不是解释用户语音呢？

1 个解决方案

For a number of reasons (mainly security) Amazon Alexa and other similar technologies will not allow you to get the raw input of the user.出于多种原因（主要是安全性），Amazon Alexa 和其他类似技术不允许您获取用户的原始输入。 Using Amazon Alexa as a way to capture the audio input of an instrument is not a plausible way to implement a tuner.使用 Amazon Alexa 作为一种捕获乐器音频输入的方法并不是实现调谐器的合理方法。 You should implement your own way to capture the audio and maybe use it in conjuction with Alexa/DialogFlow for command interpretation.您应该实现自己的方式来捕获音频，并可能将其与 Alexa/DialogFlow 结合使用以进行命令解释。

Wit.ai中分块数据的最大音频时间是多少？ - What's the maximum audio time for chunked data in Wit.ai?

如何在wit.ai中发送分块的音频数据进行语音识别？ - how to send chunked audio data for speech recognition in wit.ai?

有没有办法将Javascript SpeechRecognition API与音频文件一起使用？ - Is there a way to use the Javascript SpeechRecognition API with an audio file?

检测用户的音频并将其转换为文本以在Unity中命令AI机器人 - Detect audio from the user and converte to text to command AI bots in Unity

如何在Qt中使用来自麦克风的流音频数据进行ASR - How to use streaming audio data from microphone for ASR in Qt

语音API micClient与使用原始麦克风数据的dataClient - Speech API micClient versus dataClient using raw microphone data

如何使用 AI 根据特征对语音进行分类？ - How to classify speech sound by characteristics using AI?

有没有办法将 DialogFlow 与 HTML Web Speech API 结合起来？ - Is there a way for me to combine DialogFlow with HTML Web Speech API?

如何在我的服务中使用api.ai语音识别 - How to use api.ai speech recognition in my service

python 中的音频数据聚合 - Audio Data Agmentation in python

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 Wit.ai中分块数据的最大音频时间是多少？ - What's the maximum audio time for chunked data in Wit.ai? 如何在wit.ai中发送分块的音频数据进行语音识别？ - how to send chunked audio data for speech recognition in wit.ai? 有没有办法将Javascript SpeechRecognition API与音频文件一起使用？ - Is there a way to use the Javascript SpeechRecognition API with an audio file? 检测用户的音频并将其转换为文本以在Unity中命令AI机器人 - Detect audio from the user and converte to text to command AI bots in Unity 如何在Qt中使用来自麦克风的流音频数据进行ASR - How to use streaming audio data from microphone for ASR in Qt 语音API micClient与使用原始麦克风数据的dataClient - Speech API micClient versus dataClient using raw microphone data 如何使用 AI 根据特征对语音进行分类？ - How to classify speech sound by characteristics using AI? 有没有办法将 DialogFlow 与 HTML Web Speech API 结合起来？ - Is there a way for me to combine DialogFlow with HTML Web Speech API? 如何在我的服务中使用api.ai语音识别 - How to use api.ai speech recognition in my service python 中的音频数据聚合 - Audio Data Agmentation in python

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM