如何使用从浏览器发送到Nodejs服务器的Blob进行Google语音文本转换 [英] How to Google Speech-to-Text using Blob sent from Browser to Nodejs Server

查看：130 发布时间：2021/4/12 19:47:11 node.js socket.io audio-streaming getusermedia google-speech-api

本文介绍了如何使用从浏览器发送到Nodejs服务器的Blob进行Google语音文本转换的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试将服务器设置为使用 SocketIO 从客户端浏览器接收音频，然后通过Google Speech-to-Text处理它，最后用文本回复给客户端.

I am trying to set up a server to receive audio from a client browser using SocketIO, then process it through Google Speech-to-Text, and finally reply back to the client with the text.

最初，理想情况下，我想设置为类似于此页面上的工具的功能:https://cloud.google.com/speech-to-text/

Originally and ideally, I wanted to set up to function somewhat like the tool on this page: https://cloud.google.com/speech-to-text/

我尝试使用 getUserMedia 并将其通过 SocketIO-Stream 进行流传输，但是我不知道如何管道" MediaStream .

I tried using getUserMedia and streaming it through SocketIO-Stream, but I couldn't figure out how to 'pipe' MediaStream.

相反，现在我决定在客户端使用 MediaRecorder ，然后将数据作为Blob一起发送(在此

Instead, now I've decided to use MediaRecorder on the client side, and then send the data altogether as a blob(seen in this example).

然后我将 toString('base64')应用于Blob，并在Blob上调用google-cloud/speech的 client.recognize().

I then apply toString('base64') to the blob and call google-cloud/speech's client.recognize() on the blob.

客户端(我正在使用VueJS)

Client Side(i'm using VueJS):

        new Vue({
            el: '#app',
            data: function () {
                return ({
                    msgs: [],
                    socket: null,
                    recorder: null,
                    : []
                })
            },
            mounted: function () {
                this.socket = io.connect('localhost:3000/user');
                console.log('Connected!')
                this.socket.on('text', function (text) {
                    this.msgs.push(text)
                })
            },
            methods: {
                startRecording: function () {
                    if (this.recorder && this.recorder.state == 'recording') {
                        console.log("Stopping!")
                        this.recorder.stop()
                    } else {
                        console.log("Starting!")
                        navigator.mediaDevices.getUserMedia({ audio: true, video: false })
                            .then(this.handleSuccess);
                    }
                },
                handleSuccess: function (stream) {
                    this.recorder = new MediaRecorder(stream)
                    this.recorder.start(10000)
                    this.recorder.ondataavailable = (e) => {
                        this.chunks.push(e.data)
                        console.log(e.data)
                    }
                    this.recorder.onstop = (e) => {
                        const blob = new Blob(this.chunks, { 'type': 'audio/webm; codecs=opus' })
                        this.socket.emit('audio', blob)
                    }
                }
            }
        })

服务器端:

const speech = require('@google-cloud/speech');
const client = new speech.SpeechClient();

const io = require('socket.io').listen(3000)
const ss = require('socket.io-stream')

const encoding = 'LINEAR16';
const sampleRateHertz = 16000;
const languageCode = 'en-US';

const audio = {
    content: null
}

const config = {
    encoding: encoding,
    sampleRateHertz: sampleRateHertz,
    languageCode: languageCode,
}

async function main() {
    const [response] = await client.recognize({
        audio: audio,
        config: config
    })
    const transcription = response.results
        .map(result => result.alternatives[0].transcript)
        .join('\n');
    console.log(`Transcription: ${transcription}`);
}

io.of('/user').on('connection', function (socket) {
    console.log('Connection made!')
    socket.on('audio', function (data) {
        audio.content = data.toString('base64')
        main().catch(console.error)
    });
});

服务器端 main()函数中的日志始终为:

The log from the main() function in the Server side is always:

转录:"

-这是空的！

它应该包含发送的音频中的文本.预先谢谢你！

It should contain the text from the audio sent. Thank you in advance!

如何使用从浏览器发送到Nodejs服务器的Blob进行Google语音文本转换 [英] How to Google Speech-to-Text using Blob sent from Browser to Nodejs Server

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

如何使用从浏览器发送到Nodejs服务器的Blob进行Google语音文本转换 [英] How to Google Speech-to-Text using Blob sent from Browser to Nodejs Server

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭