DialogFlow vs Lex用于流式意图检测 [英] DialogFlow vs Lex for streaming intent detection

查看:155
本文介绍了DialogFlow vs Lex用于流式意图检测的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

对于我当前的项目,我正在寻找具有以下功能的机器人构建平台。

For my current project, I am looking for a bot building platform with the following features.


  1. 应支持创建/通过API更新新的/现有的漫游器。

  1. Should support the creation/updating of new/existing bot via API.

支持从流音频中检测意图。

Support for intent detection from streaming audio.

启用托管多个漫游器

API来修改现有的漫游器/代理并将其保存为新的。

APIs to modify an existing bot/agent and save it as a new one.

因此,我已经深入研究并使用DialogFlow开发了一个简单的工作原型。 DialogFlow支持从流音频检测意图。但是不支持第1点和第3点。 DialogFlow目前不支持通过API创建代理。而且,最大的限制是,对于每个GCP项目,只能创建一个DialogFlow代理。假设我创建了多个agent-gcp项目。我还运行着一个Web应用程序,该应用程序通过WebSocket从另一个外部资源接收音频流以进行意图检测。为了将音频流定向到内置的特定DialogFlow代理,我需要在运行时解析该特定GCP项目的凭据。我已经读过我可以创建一个服务帐户并使用一个凭证来管理多个GCP项目,但这尚未尝试。

So I have thoroughly researched and also developed a simple working prototype using DialogFlow. DialogFlow supports intent detection from streaming audio. But point 1 and 3 are not supported. DialogFlow at the moment does not support the creation of agent via API. And moreover, the biggest limitation is that for each GCP project, only one DialogFlow agent can be created. So let's say I have multiple agents-gcp projects created. I also have a web app running which receives audio streams via WebSocket from another external resource for intent detection. In order to direct the audio stream to the specific DialogFlow agent built, I need to resolve the credentials at runtime for that specific GCP project. I have read that I can create a service account and manage multiple GCP projects with one credential, but this is yet to try.

另外,正如第4点所述,我也在寻找一种在运行时使用API​​创建现有代理的新版本的方法。当前,DialogFlow支持版本和环境,但是其实际目的并不像我提到的那样。此外,这是一个beta功能,不确定是否将其包含在生产中。

Also as point 4 dictates, I am also looking for a way to create new versions of an existing agent at run-time using APIs. Currently, DialogFlow supports versions and environments, but the actual purpose of this is not as I mentioned. Furthermore, this is a beta feature and not sure if it will ever be included in production.

另一方面,我正在研究AWS Lex。 Lex支持我的要求之一。目前,AWS Lex不支持从流中检测意图(我无法为此找到任何API)。

On the other hand, I was looking into AWS Lex. Lex supports all but one of my requirement. AWS Lex at the moment does not support intent detection from the stream (I could not find any APIs for this).

所以我的问题是如何利用现有的DialogFlow功能满足我的要求?还是有一种方法可以从AWS Lex中的流音频输入中检测意图?

So my question is how can I utilize existing DialogFlow features to fulfill my requirements? Or is there a way to detect intent from streaming audio input in AWS Lex?

推荐答案

没有机制可以通过创建新的Dialogflow代理API,因此无法使用 1。 4。的部分。

There's no mechanism for creating new Dialogflow agents via API, so parts of 1. and 4. are not possible.

对于3。,正如您提到的,您可以在运行时管理凭据,也可以配置一个可以访问所有GCP项目的服务帐户。

For 3., as you mention, you can either manage credentials at runtime or configure a single service account with access to all of the GCP projects.

这篇关于DialogFlow vs Lex用于流式意图检测的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆