Web 應用程序中的 Azure 文本轉語音（認知服務） - 如何阻止它輸出音頻？

Question

我在 Web 應用中使用 Azure 文本轉語音認知服務。

我將字節返回給瀏覽器並且效果很好，但是在服務器（或本地計算機）上， speechSynthesizer.SpeakTextAsync(inp)行將音頻輸出到揚聲器。

有沒有辦法關閉它，因為它在網絡服務器上運行（即使我忽略它，它在發送回數據之前輸出音頻時也會有延遲）

這是我的代碼...

            var speechConfig = SpeechConfig.FromSubscription(speechKey, speechRegion);

            speechConfig.SpeechSynthesisVoiceName = "fa-IR-FaridNeural";
            speechConfig.OutputFormat = OutputFormat.Detailed;

            using (var speechSynthesizer = new SpeechSynthesizer(speechConfig))
            {
                // todo - how to disable it saying it here?
                var speechSynthesisResult = await speechSynthesizer.SpeakTextAsync(inp);
                return Convert.ToBase64String(speechSynthesisResult.AudioData);
            }

Answer 1

您可以做的是將audioconfig添加到speechSynthesizer 。
在此audioconfig對象中，您可以指定服務器上已存在的.wav文件的文件路徑。
每當您運行speaktextasyn而不是揚聲器時，它會將數據重定向到 .wav 文件。
您可以稍后閱讀此音頻文件並執行您的邏輯。
只需在創建speechSynthesizer對象之前添加以下代碼。

 var audioconfig = AudioConfig.FromWavFileOutput(filepath);

這里的文件filepath是.wav文件的位置，作為字符串。

完整代碼：

string filepath = "<file path> " ; 
var speechConfig = SpeechConfig.FromSubscription(speechKey, speechRegion); 
var audioconfig = AudioConfig.FromWavFileOutput(filepath);


            speechConfig.SpeechSynthesisVoiceName = "fa-IR-FaridNeural";
            speechConfig.OutputFormat = OutputFormat.Detailed;

            using (var speechSynthesizer = new SpeechSynthesizer(speechConfig, audioconfig))
            {
                // todo - how to disable it saying it here?
                var speechSynthesisResult = await speechSynthesizer.SpeakTextAsync(inp);
                return Convert.ToBase64String(speechSynthesisResult.AudioData);
            }

Web 應用程序中的 Azure 文本轉語音（認知服務） - 如何阻止它輸出音頻？

問題描述

1 個解決方案

解決方案1
1 已采納 2022-12-15 12:31:04

Web 應用程序中的 Azure 文本轉語音（認知服務） - 如何阻止它輸出音頻？

問題描述

1 個解決方案

解決方案1 1 已采納 2022-12-15 12:31:04

解決方案1
1 已采納 2022-12-15 12:31:04