繁体 English 中英

使用其音频源和开源工具有效地生成预转录语音的时间索引

[英]Efficiently generating time index of pre-transcribed speech using it's audio source and open source tools

原文 2012-07-03 22:21:44 7 1 linux/ speech-recognition/ cmusphinx/ transcription

在TED.com上，他们有转录，单击转录的一部分时，它们会转到视频的相应部分。

我想在具有OSS的Linux上使用80个小时的音频和转录来进行此操作。

这是我在想的方法：

这似乎是一种有效的方法吗？ 有人真的这样做过吗？

是否有其他值得尝试的替代方法，例如愚蠢的字数统计可能足够准确？

您只需将所有音频和文本输入一个较长的音频对齐器中，它就会为您提供单词的时间戳。 使用此时间戳，您可以跳至文件中的特定单词。

我不确定为什么要分割音频或做其他事情。

寻求开源Linux工具进行分布式任务管理

[英]Seeking open source linux tools for distributed task management

[英]ipsec open source for linux

[英]Open Source Development

[英]Distributing source files with an open source app

[英]Batch Job Dependencies Using Open Source/Free Software

[英]Is there any open source for Ip Tunnel?

[英]Compiling program with Open Source libFTDI

[英]How to open a source file in GDB

[英]Maven build of Saiku open source project fails second time after no changes

[英]Why I have to “source vitrualenvwrapper.sh” every time I open a new Terminal?

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 寻求开源Linux工具进行分布式任务管理 ipsec Linux开源开源开发使用开源应用程序分发源文件使用开源/免费软件批处理作业依赖性 Ip Tunnel是否有任何开源？使用开源libFTDI编译程序如何在 GDB 中打开源文件没有更改后，Saiku开源项目的Maven构建第二次失败为什么每次打开新终端时都必须“获得vitrualenvwrapper.sh资源”？

相关标签