如何在 Google Text to Speech 中调整发音音高 API

Question

I used the Google Text2Speech API, it works well but I'd like to adjust the pitch.我使用了 Google Text2Speech API，它运行良好，但我想调整音调。 I used the gTTS.我用的是 gTTS。

tts = gTTS("ご返信ありがとうございます。", lang = 'ja')

How should I go ahead?我应该怎么提前go？ Thanks in advance!提前致谢！

Answer 1

Looking through the official documentation the text2speech API has an AudioConfig function where you can pass in the pitch.查看官方文档，text2speech API 有一个AudioConfig function，您可以在其中传递音高。 The pitch can be changed in the range [-20.0, 20.0] .间距可以在[-20.0, 20.0]范围内更改。 Here is a workinng example.这是一个工作示例。

from google.cloud import texttospeech

# Instantiates a client
client = texttospeech.TextToSpeechClient()

# Set the text input to be synthesized
synthesis_input = texttospeech.SynthesisInput(text="Hello, World!")

# Build the voice request, select the language code ("en-US") and the ssml
# voice gender ("neutral")
voice = texttospeech.VoiceSelectionParams(
    language_code="en-US", ssml_gender=texttospeech.SsmlVoiceGender.NEUTRAL
)

# Select the type of audio file you want returned
audio_config = texttospeech.AudioConfig(
    pitch=-1.20,
    audio_encoding=texttospeech.AudioEncoding.MP3
)

# Perform the text-to-speech request on the text input with the selected
# voice parameters and audio file type
response = client.synthesize_speech(
    input=synthesis_input, voice=voice, audio_config=audio_config
)

# The response's audio_content is binary.
with open("output.mp3", "wb") as out:
    # Write the response to the output file.
    out.write(response.audio_content)
    print('Audio content written to file "output.mp3"')

如何在 Google Text to Speech 中调整发音音高 API

问题描述

1 个解决方案

解决方案1
1 已采纳 2021-10-01 11:27:09

如何在 Google Text to Speech 中调整发音音高 API

问题描述

1 个解决方案

解决方案1 1 已采纳 2021-10-01 11:27:09

解决方案1
1 已采纳 2021-10-01 11:27:09