简体   繁体   中英

IBM watson Speech to text integration

Hi I want to transcript running video using speech to text of IBM Watson library. My code is:

new RecognizeOptions.Builder()
                    .audio(audio)
                    .contentType("audio/mp3")
                    .model("en-US_BroadbandModel")
                    .interimResults(true)
                    .inactivityTimeout(2000)
                    //TODO: Uncomment this to enable Speaker Diarization
                    .speakerLabels(true)
                    .build();

I am getting following error: java.lang.RuntimeException: unable to transcode data stream audio/mpeg -> audio/x-float-array

Please help.

您将需要通过ffmpeg之类的视频来提取音频轨道,并将音频轨道传递给“语音转文本”服务。

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM