Google-speech-api 错误地转录语音数字 [英] Google-speech-api transcribing spoken numbers incorrectly
问题描述
我开始使用谷歌语音 api 转录音频.
I started using google speech api to transcribe audio.
正在转录的音频包含许多一个接一个说出的数字.
The audio being transcribed contains many numbers spoken one after the other.
例如273 298
但转录返回 270-3298
But the transcription comes back 270-3298
我的猜测是它会将其解释为某种电话号码.
My guess is that it is interpreting it as some sort of phone number.
我想要的是未解析的输出,例如2 73 2 98",我可以自己处理和解析.
What i want is unparsed output e.g. "two seventy three two ninety eight' which i can deal with and parse on my own.
有没有设置或者支持这种东西?
Is there a setting or support for this kind of thing?
谢谢
推荐答案
所以我遇到了完全相同的问题,我认为我们找到了解决方案.如果您使用英语作为输入,请在处理数字时切换到 en-PH.然后,Google 不会将结果格式化为美国电话号码,也不会尝试在其中添加额外的数字.
So I had this exact same problem and I think we found a solution. If you're using English as input, switch to en-PH just when working with numbers. Google will then not format the result as a U.S. phone number or try to stick an extra digit in there.
这篇关于Google-speech-api 错误地转录语音数字的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!