Android上的长音频语音识别 [英] Long audio speech recognition on Android

查看:108
本文介绍了Android上的长音频语音识别的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我想开发一个模块,该模块将使用语音对Android中的文本进行支持.我发现了许多与 RecognizerIntent 等相关的文档和演示.但是我发现所有这些演示都只能获取声音,直到10秒钟左右.但是我希望我的演示运行5-10分钟以上.如果没有离线运行,我没有任何问题,因为我的应用程序始终在线运行.

我还查看了 Android上的Pocketsphinx ,但是效果不佳.另外,它仅对Android Studio提供支持,而对Eclipse不提供支持.

我已经看到许多应用程序提供了将语音连续转换为文本5-10分钟的功能,例如:解决方案

我已经在 Google Cloud Speech API .他们还在此处添加了一个演示..>

Google Cloud语音转文本功能使开发人员可以通过易于使用的API应用强大的神经网络模型,将音频转换为文本.该API可以识别120种语言和变体,以支持您的全球用户群.您可以启用语音命令和控制,从呼叫中心转录音频等等.它可以使用Google的机器学习技术处理实时流式传输或预先录制的音频.

您可以将命令的用户文本转录到应用程序的麦克风,通过语音启用命令和控制,或者转录音频文件,以及许多其他用例.识别音频上传到请求中,并与您的音频存储集成Google Cloud Storage,使用与Google相同的技术为自己的产品供电.

I want to develop a module which will use a speech to text support in Android. I found out many documentation and demos related to RecognizerIntent and such others. But I found that all of such demos just fetch the voice till 10 secs or so. But I want my demo to run for more than 5-10 minutes. I don't have any issue if that is not running offline, as my app is always working online.

I have also looked in to Pocketsphinx on Android, but that didn't worked out well. Also, that gave support just for Android Studio and not on Eclipse.

I have seen many apps giving the facility to convert speech to text for 5-10 mins continuously, such as: Speech To Text Notepad.

Can anyone suggest any other library of demo code with which this can be achieved ? TIA.

解决方案

I have successfully accomplished this with the help of Google Cloud Speech API. They have also added a demo here.

Google Cloud Speech-to-Text enables developers to convert audio to text by applying powerful neural network models in an easy to use API. The API recognizes 120 languages and variants, to support your global user base. You can enable voice command-and-control, transcribe audio from call centers, and more. It can process real-time streaming or pre-recorded audio, using Google’s machine learning technology.

You can transcribe the text of users dictating to an application’s microphone, enable command-and-control through voice, or transcribe audio files, among many other use cases. Recognize audio uploaded in the request, and integrate with your audio storage on Google Cloud Storage, by using the same technology Google uses to power its own products.

这篇关于Android上的长音频语音识别的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆