I've just started thinking about my new project which is about tts & stt(text-to-speech & speech-to-text) and I walked over some tricky problems that must be solved.
I'm very new to this topic so be patient with me.
Lg Michael
How do I record audio? Are there any other approches how I can record meaningful audio? Maybe with a threshold so it only records when a certain amount of noise is given?
You record audio in small chunks of 0.1 second and process them one by one accumulating results. Once keyword is detected you perform action. There is no need to store the result into wav file, you can keep everything in memory. You can check for example existing software:
https://github.com/castorini/honk
Which language should I use? The whole system should run as a background process on Linux. TensorFlow has also a wide range of supported languages. The once I care the most are C++ or Java.
Most of TF development is done with Python
Is threading and option or necessary? The recording software is running on Linux as a background process.
Threading is not necessary. Linux kernel buffers audio internally while your software processes it.
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.