sphinx-4 aligner会跳过诸如"you"，"in"和带破折号的单词之类的普通单词-为什么? [英] sphinx-4 aligner skips plain words like `you`, `in` and words with dashes - why?

查看：136 发布时间：2020/7/8 19:38:23 speech-recognition sphinx4

本文介绍了sphinx-4 aligner会跳过诸如"you"，"in"和带破折号的单词之类的普通单词-为什么?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试对齐简单文本.以下是文本和音频文件的链接:
http://s000.tinyupload.com/?file_id=48044768133759453374
http://s000.tinyupload.com/?file_id=99891199139563396901

I'm trying to align simple text. Here are the links to text and audio files:
http://s000.tinyupload.com/?file_id=48044768133759453374
http://s000.tinyupload.com/?file_id=99891199139563396901

以下是配置设置:

private static final String ACOUSTIC_MODEL_PATH =
        "resource:/edu/cmu/sphinx/models/en-us/en-us";
private static final String DICTIONARY_PATH =
        "resource:/edu/cmu/sphinx/models/en-us/cmudict-en-us.dict";

我得到的输出如下(省略号由我添加):

The output I get is the following (ellipsis are added by me):

- ï
- ¿in
  a                         [11250:11330]
  standard                  [11330:11920]
  shopping                  [11920:12440]
  centre                    [12440:13020]
- you
  can                       [13380:13730]
  ...
  shops                     [15170:15790]
- you
  can                       [16620:16890]
  buy                       [16890:17140]
  ...
  and                       [26920:27230]
  suits                     [27190:27220]
- thereâ€™s
  a                         [29160:29210]
  sportswear                [29210:29980]
  ...
  clothes                   [33330:33360]
- t-shirts
  shorts                    [35560:36320]
  jumpers                   [36630:37410]
  ...
  for                       [41860:42010]

由于某种原因，您可以看到它:

As you can see for some reason it:

在第一个a

in

you
无法识别there's，而是将其标识为thereâ€™s
没有时间对带有破折号的单词(例如t-shirts

didn't recognize in before the first a
no timing for multiple instances of you
didn't recognize there's, instead it identified it as thereâ€™s
no timing for words with dashes, like t-shirts

有什么方法可以配置狮身人面像以提供出现的时间?

Is there any way I can configure sphinx to provide timings for there occurrences?

sphinx-4 aligner会跳过诸如"you"，"in"和带破折号的单词之类的普通单词-为什么? [英] sphinx-4 aligner skips plain words like `you`, `in` and words with dashes - why?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

sphinx-4 aligner会跳过诸如"you"，"in"和带破折号的单词之类的普通单词-为什么? [英] sphinx-4 aligner skips plain words like `you`, `in` and words with dashes - why?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭