Web 应用程序中的 Azure 文本转语音（认知服务） - 如何阻止它输出音频？

Question

I'm using Azure Cognitive Services for Text to Speech in a web app.我在 Web 应用中使用 Azure 文本转语音认知服务。

I return the bytes to the browser and it works great, however on the server (or local machine) the speechSynthesizer.SpeakTextAsync(inp) line outputs the audio to the speaker.我将字节返回给浏览器并且效果很好，但是在服务器（或本地计算机）上， speechSynthesizer.SpeakTextAsync(inp)行将音频输出到扬声器。

Is there a way to turn this off, since this runs on a web server (and even if I ignore it, there's the delay while it outputs audio before sending back the data)有没有办法关闭它，因为它在网络服务器上运行（即使我忽略它，它在发送回数据之前输出音频时也会有延迟）

Here's my code...这是我的代码...

            var speechConfig = SpeechConfig.FromSubscription(speechKey, speechRegion);

            speechConfig.SpeechSynthesisVoiceName = "fa-IR-FaridNeural";
            speechConfig.OutputFormat = OutputFormat.Detailed;

            using (var speechSynthesizer = new SpeechSynthesizer(speechConfig))
            {
                // todo - how to disable it saying it here?
                var speechSynthesisResult = await speechSynthesizer.SpeakTextAsync(inp);
                return Convert.ToBase64String(speechSynthesisResult.AudioData);
            }

Answer 1

What you can do is add an audioconfig to the speechSynthesizer .您可以做的是将audioconfig添加到speechSynthesizer 。
In this audioconfig object you can specify a file path to a .wav file which already exist on the server.在此audioconfig对象中，您可以指定服务器上已存在的.wav文件的文件路径。
Whenever you run speaktextasyn instead of a speaker it will redirect the data to the.wav file.每当您运行speaktextasyn而不是扬声器时，它会将数据重定向到 .wav 文件。
This audio file you can read and perform your logic later.您可以稍后阅读此音频文件并执行您的逻辑。
Just add the following code before creating the speechSynthesizer object.只需在创建speechSynthesizer对象之前添加以下代码。

 var audioconfig = AudioConfig.FromWavFileOutput(filepath);

here filepath is a location of the .wav file as a string.这里的文件filepath是.wav文件的位置，作为字符串。

Complete code:完整代码：

string filepath = "<file path> " ; 
var speechConfig = SpeechConfig.FromSubscription(speechKey, speechRegion); 
var audioconfig = AudioConfig.FromWavFileOutput(filepath);


            speechConfig.SpeechSynthesisVoiceName = "fa-IR-FaridNeural";
            speechConfig.OutputFormat = OutputFormat.Detailed;

            using (var speechSynthesizer = new SpeechSynthesizer(speechConfig, audioconfig))
            {
                // todo - how to disable it saying it here?
                var speechSynthesisResult = await speechSynthesizer.SpeakTextAsync(inp);
                return Convert.ToBase64String(speechSynthesisResult.AudioData);
            }

Web 应用程序中的 Azure 文本转语音（认知服务） - 如何阻止它输出音频？

问题描述

1 个解决方案

解决方案1
1 已采纳 2022-12-15 12:31:04

Web 应用程序中的 Azure 文本转语音（认知服务） - 如何阻止它输出音频？

问题描述

1 个解决方案

解决方案1 1 已采纳 2022-12-15 12:31:04

解决方案1
1 已采纳 2022-12-15 12:31:04