寻找发音的正确性 [英] Finding pronunciation correctness

查看：322 发布时间：2016/9/23 21:21:54 c# windows voice voice-recognition phonetics

本文介绍了寻找发音的正确性的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我需要识别用户的语音的质与微软的语音SDK的帮助下（ System.Speech.Recognition ）。我使用MS语音引擎 - 美国，所以我真正需要的是找出说话人的声音是如何接近北美口音。

I need to identify the "quality" of the user's pronunciation with the help of Microsoft speech SDK (System.Speech.Recognition). I am using MS Speech Engine - US, so what I actually need is to find out how close the speaker's voice is to the "North American" accent.

这样做的一种方式是通过检查用户的声音的接近程度的美国英语语音发音。正如MSDN提到的，好像这个过程是由它自身的语音SDK里面完成的，所以我需要说出来。既然我们可以设置语音的引擎通过我们的自我，以及，我相信这是可能的。

One way of doing this is by checking how close the user's voice is to the US English phonetic pronunciation. As mentioned in MSDN, it seems like this process is done inside the speech SDK by it self, so I need to get that out. Since we can set the phonetic to the engine by our selves as well, I am sure this is possible.

不过，我有什么，我必须做的没有明确的想法。所以，我能做些什么，找出用户的语音质量/它是如何接近美国的北美英语音标发音？用户只需要说话预定义的句子，如世界您好。我在这里。

However, I have no clear idea about what I have to do. So, what can I do to find out the quality of the user's pronunciation/How close it is to US North American English phonetic pronunciation? User will only have to speak pre-defined sentences like "Hello World. I am here".

请帮忙。

更新

我通过使用以下的有某种音素（如在MSDN中提到）的代码

I got some kind of "phonemes" (as mentioned in MSDN) by the use of following code

using System;
using System.Collections.Generic;
using System.Linq;
using System.Text;
using System.Speech.Recognition;
using System.Speech.Synthesis;
using System.Windows.Forms;
using System.IO;

namespace US_Speech_Recognizer
{
    public class RecognizeSpeech
    {
        private SpeechRecognitionEngine sEngine; //Speech recognition engine
        private SpeechSynthesizer sSpeak; //Speech synthesizer
        string text3 = "";

        public RecognizeSpeech()
        {
            //Make the recognizer ready
            sEngine = new SpeechRecognitionEngine(new System.Globalization.CultureInfo("en-US"));


            //Load grammar
            Choices sentences = new Choices();
            sentences.Add(new string[] { "I am hungry" });

            GrammarBuilder gBuilder = new GrammarBuilder(sentences);

            Grammar g = new Grammar(gBuilder);

            sEngine.LoadGrammar(g);

            //Add a handler
            sEngine.SpeechRecognized +=new EventHandler<SpeechRecognizedEventArgs>(sEngine_SpeechRecognized);


            sSpeak = new SpeechSynthesizer();
            sSpeak.Rate = -2;



            //Computer speaks the words to get the phones
            Stream stream = new MemoryStream();
            sSpeak.SetOutputToWaveStream(stream);


            sSpeak.Speak("I was hungry");
            stream.Position = 0;
            sSpeak.SetOutputToNull();


            //Configure the recognizer to stream
            sEngine.SetInputToWaveStream(stream);

            sEngine.RecognizeAsync(RecognizeMode.Single);


        }


        //Start the speech recognition task
        private void sEngine_SpeechRecognized(object sender, SpeechRecognizedEventArgs e)
        {
            string text = "";

            if (e.Result.Text == "I am hungry")
            {
                foreach (RecognizedWordUnit wordUnit in e.Result.Words)
                {
                    text = text + wordUnit.Pronunciation + "\n";
                }

                MessageBox.Show(e.Result.Text + "\n" + text);
            }


        }
    }
}

这是关系到音素的直接代码段（从上面的代码中提取）

This is the direct code snippet related to the phonemes (extracted from the above code)

   //Start the speech recognition task
    private void sEngine_SpeechRecognized(object sender, SpeechRecognizedEventArgs e)
    {
        string text = "";

        if (e.Result.Text == "I am hungry")
        {
            foreach (RecognizedWordUnit wordUnit in e.Result.Words)
            {
                text = text + wordUnit.Pronunciation + "\n";
            }

            MessageBox.Show(e.Result.Text + "\n" + text);
        }


    }

以下是我输出。我得到的音素显示来自第二行开始。第一行只显示公认的句子

Following is my output. The phonemes I got are displayed starting from the second line. First line simply shows the recognized sentence

那么，请告诉我，根据MSDN这是音素。那么，这就是音素其实？我从来没有见过这些，这就是为什么。

So, please tell me, according to the MSDN this is "phonemes". So, is this is the "phonemes" actually? I have never seen these, that is why.

上面的代码，根据这个链接的 http://msdn.microsoft.com/en-us/library/microsoft.speech.recognition.srgsgrammar.srgstoken.pronunciation（v = office.14）的.aspx

above code is done according to this link http://msdn.microsoft.com/en-us/library/microsoft.speech.recognition.srgsgrammar.srgstoken.pronunciation(v=office.14).aspx

寻找发音的正确性 [英] Finding pronunciation correctness

问题描述

推荐答案

相关文章

C#/.NET最新文章

热门教程

热门工具

登录关闭

寻找发音的正确性 [英] Finding pronunciation correctness

问题描述

推荐答案

相关文章

C#/.NET最新文章

热门教程

热门工具

登录 关闭

登录关闭