如何将wit.ai与现有数据行一起使用? [英] How do I use wit.ai with existing rows of data?

查看:107
本文介绍了如何将wit.ai与现有数据行一起使用?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有很多现有数据想用作wit.ai聊天机器人的训练数据.数据存储在一个csv文件中,其中每行都有一个语句/问题以及对该语句/问题的响应.

I have a lot of existing data that I would like to use as training data for a wit.ai chatbot. The data is stored in a csv file where each row has a statement/question and a response to that statement/question.

我知道wit.ai要求您将意图分配给所发表的评论,因此我想知道是否有一种方法可以简单地发送我拥有的数据并使聊天机器人自己开始学习意图.

I know that wit.ai requires you to assign intents to comments made and so I'm wondering if there is a way to simply send over the data I have and have the chatbot start learning intents on its own.

谢谢!

推荐答案

教学" Wit.Ai并不完全符合某些人的想法.

"Teaching" Wit.Ai is not exactly what some might think it is.

您将必须为User says列创建故事.说实话,这些答复是无关紧要的.您无法教" wit.ai进行回复.答复是在故事或您的代码中定义的.

You will have to create stories for your User says column. The replies are irrelevant to be honest. You can't "teach" wit.ai to reply. Replies are defined in the story or in your code.

wit.ai可能需要从您的数据中获得的关键字和关键短语使wit.ai更好地识别实体.

What wit.ai might need from your data are keywords and key-phrases which make the entity recognition better for wit.ai.

这是最简单的示例:

实体color是根据列出的关键字来识别的.因此,如果您有大量数据作为用户输入的示例-您可以尝试将其首先分解为用户输入应产生哪些实体",然后将这些输入中的关键字分类.

Entity color is recognized based on keywords listed. So if you have a lot of data as an example of user input - you can try to break it down first into "which entities which user input should produce" and then keywords from those input.

使用数据进行教学"会有些困难,因为这将需要您在wit.ai中创建很多Stories来涵盖可能的用户输入和实体标识.但是您仍然可以这样:

Using your data for "teaching" - would be a little difficult since it will require you to create a lot of Stories in wit.ai to cover possible user input and entity identification. But you can still do it like this:

(粗略的例子)

  1. 制作一个有关用户询问时间的故事
  2. 在用户输入中标记应从该输入派生哪些实体:
  3. 对列表进行排序,您必须获得所有可能的询问时间的方式:
    • 现在几点了?
    • 你能告诉我时间吗?
    • 我想知道现在几点了?
  1. Make one story about user asking the time for example
  2. Mark in the user input which entities should be derived from that input:
  3. Sort your list you have to get all possible way of asking for the time:
    • How late is it?
    • Can you tell me the time?
    • I wonder what's the time now?

如果实体是基于关键字或其他某种算法的,则此过程将教"实体.

This process will "teach" entities if they are keywords based or some other algorithm.

关于如何使用现有数据,这是我能想到的最好的方法. Wit.Ai与其他语言处理工具集不同,使用现有数据教"它有点令人费解":)

That's the best I can think of about how to use your existing data. Wit.Ai is different from other language processing tool-sets and "teaching" it with existing data is somewhat "puzzling" :)

这篇关于如何将wit.ai与现有数据行一起使用?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆