SFSpeechRecognizer - 检测话语结束 [英] SFSpeechRecognizer - detect end of utterance

查看：1132 发布时间：2018/9/17 9:11:20 ios sfspeechrecognizer

本文介绍了SFSpeechRecognizer - 检测话语结束的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用iOS 10内置语音识别来攻击一个小项目。我有使用设备麦克风的工作结果，我的语音被非常准确地识别。

I am hacking a little project using iOS 10 built-in speech recognition. I have working results using device's microphone, my speech is recognized very accurately.

我的问题是每个可用的部分转录都会调用识别任务回调，我想要它检测到人们停止说话并使用 isFinal 属性设置为true来调用回调。它没有发生 - 应用程序正在无限期地收听。

My problem is that recognition task callback is called for every available partial transcription, and I want it to detect person stopped talking and call the callback with isFinal property set to true. It is not happening - app is listening indefinitely.

SFSpeechRecognizer 是否能够检测到句尾？

Is SFSpeechRecognizer ever capable of detecting end of sentence?

这是我的代码 - 它基于在互联网上找到的示例，它主要是从麦克风源识别所需的样板。
我通过添加识别 taskHint 来修改它。我还将 shouldReportPartialResults 设置为false，但它似乎已被忽略。

Here's my code - it is based on example found on the Internets, it is mostly a boilerplate needed to recognize from microphone source. I modified it by adding recognition taskHint. I also set shouldReportPartialResults to false, but it seems it has been ignored.

    func startRecording() {

    if recognitionTask != nil {
        recognitionTask?.cancel()
        recognitionTask = nil
    }

    let audioSession = AVAudioSession.sharedInstance()
    do {
        try audioSession.setCategory(AVAudioSessionCategoryRecord)
        try audioSession.setMode(AVAudioSessionModeMeasurement)
        try audioSession.setActive(true, with: .notifyOthersOnDeactivation)
    } catch {
        print("audioSession properties weren't set because of an error.")
    }

    recognitionRequest = SFSpeechAudioBufferRecognitionRequest()
    recognitionRequest?.shouldReportPartialResults = false
    recognitionRequest?.taskHint = .search

    guard let inputNode = audioEngine.inputNode else {
        fatalError("Audio engine has no input node")
    }

    guard let recognitionRequest = recognitionRequest else {
        fatalError("Unable to create an SFSpeechAudioBufferRecognitionRequest object")
    }

    recognitionRequest.shouldReportPartialResults = true

    recognitionTask = speechRecognizer?.recognitionTask(with: recognitionRequest, resultHandler: { (result, error) in

        var isFinal = false

        if result != nil {
            print("RECOGNIZED \(result?.bestTranscription.formattedString)")
            self.transcriptLabel.text = result?.bestTranscription.formattedString
            isFinal = (result?.isFinal)!
        }

        if error != nil || isFinal {
            self.state = .Idle

            self.audioEngine.stop()
            inputNode.removeTap(onBus: 0)

            self.recognitionRequest = nil
            self.recognitionTask = nil

            self.micButton.isEnabled = true

            self.say(text: "OK. Let me see.")
        }
    })

    let recordingFormat = inputNode.outputFormat(forBus: 0)
    inputNode.installTap(onBus: 0, bufferSize: 1024, format: recordingFormat) { (buffer, when) in
        self.recognitionRequest?.append(buffer)
    }

    audioEngine.prepare()

    do {
        try audioEngine.start()
    } catch {
        print("audioEngine couldn't start because of an error.")
    }

    transcriptLabel.text = "Say something, I'm listening!"

    state = .Listening
}

SFSpeechRecognizer - 检测话语结束 [英] SFSpeechRecognizer - detect end of utterance

问题描述

推荐答案

相关文章

移动开发最新文章

热门教程

热门工具

登录关闭

SFSpeechRecognizer - 检测话语结束 [英] SFSpeechRecognizer - detect end of utterance

问题描述

推荐答案

相关文章

移动开发最新文章

热门教程

热门工具

登录 关闭

登录关闭