有关创建斯坦福CoreNLP培训模型的问题 [英] Questions about creating stanford CoreNLP training models

查看：157 发布时间：2020/8/6 3:02:10 stanford-nlp sentiment-analysis training-data scoring

本文介绍了有关创建斯坦福CoreNLP培训模型的问题的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我一直在与斯坦福大学的coreNLP合作，对我拥有的某些数据进行情感分析，并且正在创建一个训练模型.我知道我们可以使用以下命令创建训练模型:

I've been working with Stanford's coreNLP to perform sentiment analysis on some data I have and I'm working on creating a training model. I know we can create a training model with the following command:

java -mx8g edu.stanford.nlp.sentiment.SentimentTraining -numHid 25 -trainPath train.txt -devPath     dev.txt -train -model model.ser.gz

我知道train.txt文件中的内容.您为句子评分，然后将其放入train.txt中，如下所示: (0 (2 Today) (0 (0 (2 is) (0 (2 a) (0 (0 bad) (2 day)))) (..)))

I know what goes in the train.txt file. You score sentences and put them in train.txt, something like this: (0 (2 Today) (0 (0 (2 is) (0 (2 a) (0 (0 bad) (2 day)))) (..)))

但是我不明白dev.txt文件中的内容. 我多次阅读了这个问题，试图尝试了解dev.txt中的内容，但我仍然不清楚.此外，手动为这些句子评分也很麻烦，是否有可用的工具使之更容易?我担心我使用了错误的括号或类似的其他愚蠢错误.

But I don't understand what goes in the dev.txt file. I read through this question several times to try to understand what goes in dev.txt, but it's still unclear to me. Also, scoring these sentences manually has become a pain, is there a tool available that makes it easier? I'm worried that I've been using the wrong number of parentheses or some other stupid mistake like that.

此外，关于train.txt文件应保留多长时间的任何建议?我正在考虑给1000个句子打分.这个数字是否太小或太大?

Also, any suggestions on how long my train.txt file should be? I'm thinking of scoring a 1000 sentences. Is that number too small, too large?

感谢您的所有帮助:)

有关创建斯坦福CoreNLP培训模型的问题 [英] Questions about creating stanford CoreNLP training models

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

有关创建斯坦福CoreNLP培训模型的问题 [英] Questions about creating stanford CoreNLP training models

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭