建立新的声学模型,字典,语言模型以实现不常见的语言语音识别 [英] Build NEW Acoustic model, Dictionary , Language model for uncommon language speech recognition

查看:238
本文介绍了建立新的声学模型,字典,语言模型以实现不常见的语言语音识别的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我想为"Sinhala Language speech recognition"僧伽罗语语言建立新的声学模型,新的词典,新的语言模型字符是基于Unicode的.例如A =අ,I =ඉ,U =උ,KA =ක,BA =බ. 我确实浏览过 CMUSphinx开发人员指南.但这并没有帮助我.它适用于英语.

I want to build NEW Acoustic model ,New Dictionary ,New Language model for "Sinhala Language speech recognition" Sinhala language Characters are Unicode based. for an example A=අ,I=ඉ,U=උ,KA=ක,BA=බ. I did go through CMUSphinx Tutorial For Developers. But it did not help me. It works for English language.

语言模型应为ARPA模型.以及如何用英语音素映射Sinhala Unicode以及如何用不同的声音训练语言模型. 是否有任何工具可用于生成基于Unicode的语言模型?

Language model should be ARPA model. and How can I map Sinhala Unicode with English phonemes and how to train Language model with Different voices. Is there any tool available for generate Unicode based language model?

推荐答案

总体而言,它并不复杂.首先,您需要将任务分为几部分:构建语音词典,构建语言模型,构建声学模型.从语音词典开始.

Overall, it is not really complex. First you need to split the task on parts: build phonetic dictionary, build language model, build acoustic model. Start with phonetic dictionary.

您需要编写Python脚本以将unicode输入映射到音译:

You need to write a Python script to map unicode input to the transliteration:

රට  r a tt a
එකඟයි   e k a ng a yi
අවසර දිම    a v a s a r a d i m a

基本上,每当您编写相应的音译时.这就是您需要做的所有事情,以后您可以将单词列表输入脚本中,并获得cmusphinx格式的字典.本部分将在教程中涵盖

Basically for every you write a corresponding transliteration. That is all you need to do, later you can just feed the list of words into your script and get a dictionary in cmusphinx format. This part is covered in tutorial

http://cmusphinx.sourceforge.net/wiki/tutorialdict

一旦有了音译工具,就可以继续使用语言模型.您需要大量的文本来构建语言模型.您可以从维基百科或本地报纸下载文本.然后,您可以使用任何语言模型工具包来创建ARPA模型.它们都支持unicode-SRILM,MITLM,IRSTLM,您可以使用它们中的任何一个.本部分将在教程中涵盖

Once you have a transliteration tool you can proceed with language model. You need a lot of texts to build a language model. You can download texts from wikipedia or from local newspaper. Then you can use any language model toolkit to create an ARPA model. All of them support unicode - SRILM, MITLM, IRSTLM, you can use any of them. This part is covered in tutorial

http://cmusphinx.sourceforge.net/wiki/tutoriallm

第三步是创建声学模型.您需要录制音频或对现有录音进行分段并开始培训.本教程也介绍了这一部分

Third step is to create an acoustic model. You need to record audio or segment existing recordings and start training. This part is also covered in the tutorial

http://cmusphinx.sourceforge.net/wiki/tutorialam

这篇关于建立新的声学模型,字典,语言模型以实现不常见的语言语音识别的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆