语音识别和语调检测 [英] Speech recogition and intonation detection
问题描述
我想制作一个iOS应用来计算疑问句。我将寻找WH问题,也将寻找我会吗?格式化问题。
I want to make an iOS app to count interrogative sentences. I will look for WH questions and also "will I, am I?" format questions.
我在语音或音频技术领域并不十分了解,但是我使用Google时发现语音识别SDK很少。但是仍然不知道如何检测和绘制语调。是否有任何支持语调或情感语音识别的SDK?
I am not very get in the speech or audio technology world, but I did Google and found that there are few speech recognition SDKs. But still no idea how can I detect and graph intonation. Are there any SDKs that support intonation or emotional speech recognition?
推荐答案
AFAIK没有基于云的语音识别SDK,它也可以提供你的语调。您可以搜索音高跟踪解决方案,并从音高轮廓中得出语调。在Python的 librosa
包中提供了一个开源程序:
AFAIK there is no cloud-based Speech Recognition SDK which also gives you intonation. You could search for pitch-tracking solutions and derive intonation from the pitch contour. An opensource one is available in the librosa
package in Python:
https://librosa.org/librosa/generated/librosa.core.piptrack.html
如果您无法在应用程序中嵌入Python,则始终可以选择使用 Flask
或 fastapi 在REST API中提供它。 code>。
If you can't embed Python in your application, there is always the option of serving it in a REST API with Flask
or fastapi
.
这篇关于语音识别和语调检测的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!