[英]How to adjust Pronunciation Pitch in Google Text to Speech API
I used the Google Text2Speech API, it works well but I'd like to adjust the pitch.我使用了 Google Text2Speech API,它运行良好,但我想调整音调。 I used the gTTS.我用的是 gTTS。
tts = gTTS("ご返信ありがとうございます。", lang = 'ja')
How should I go ahead?我应该怎么提前go? Thanks in advance!提前致谢!
Looking through the official documentation the text2speech API has an AudioConfig
function where you can pass in the pitch.查看官方文档,text2speech API 有一个AudioConfig
function,您可以在其中传递音高。 The pitch can be changed in the range [-20.0, 20.0]
.间距可以在[-20.0, 20.0]
范围内更改。 Here is a workinng example.这是一个工作示例。
from google.cloud import texttospeech
# Instantiates a client
client = texttospeech.TextToSpeechClient()
# Set the text input to be synthesized
synthesis_input = texttospeech.SynthesisInput(text="Hello, World!")
# Build the voice request, select the language code ("en-US") and the ssml
# voice gender ("neutral")
voice = texttospeech.VoiceSelectionParams(
language_code="en-US", ssml_gender=texttospeech.SsmlVoiceGender.NEUTRAL
)
# Select the type of audio file you want returned
audio_config = texttospeech.AudioConfig(
pitch=-1.20,
audio_encoding=texttospeech.AudioEncoding.MP3
)
# Perform the text-to-speech request on the text input with the selected
# voice parameters and audio file type
response = client.synthesize_speech(
input=synthesis_input, voice=voice, audio_config=audio_config
)
# The response's audio_content is binary.
with open("output.mp3", "wb") as out:
# Write the response to the output file.
out.write(response.audio_content)
print('Audio content written to file "output.mp3"')
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.