Google-Speech-api错误地转录语音号码 [英] Google-speech-api transcribing spoken numbers incorrectly
问题描述
我开始使用Google语音API转录音频.
I started using google speech api to transcribe audio.
正在转录的音频包含许多个接一个的语音.
The audio being transcribed contains many numbers spoken one after the other.
例如273298
但是转录又回到了270-3298
But the transcription comes back 270-3298
我的猜测是它将其解释为某种电话号码.
My guess is that it is interpreting it as some sort of phone number.
我想要的是未解析的输出,例如我可以自己处理和解析的二七三二二九十八".
What i want is unparsed output e.g. "two seventy three two ninety eight' which i can deal with and parse on my own.
对这种事情有设置或支持吗?
Is there a setting or support for this kind of thing?
谢谢
推荐答案
所以我遇到了同样的问题,我想我们找到了解决方案.如果您使用英语作为输入,请仅在处理数字时切换到en-PH.然后,Google不会将结果格式设置为美国电话号码,也不会尝试在其中添加多余的数字.
So I had this exact same problem and I think we found a solution. If you're using English as input, switch to en-PH just when working with numbers. Google will then not format the result as a U.S. phone number or try to stick an extra digit in there.
这篇关于Google-Speech-api错误地转录语音号码的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!