分类中如何处理低频实例? [英] How to deal with low frequency examples in classification?

查看：68 发布时间：2020/5/4 9:50:57 machine-learning classification

本文介绍了分类中如何处理低频实例?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正面临文本分类问题，我需要将示例分类为34个组.

I'm facing a text classification problem, and I need to classify examples to 34 groups.

问题是，34组训练数据的大小不平衡.对于某些小组，我有2000多个示例，而对于某些小组，我只有100多个示例.

The problem is, the size of training data of 34 groups are not balanced. For some groups I have 2000+ examples, while for some I only have 100+ examples.

对于某些小组，分类准确度很高.我想这些小组可能有特定的关键词来识别和分类.对于某些人来说，准确度很低，而且预测总是针对大群人.

For some small groups, the classification accuracy is quite high. I guess those groups may have specific key words to recognize and classify. While for some, the accuracy is low, and the prediction always goes to large groups.

我想知道如何处理低频示例问题".是否会简单地复制和复制小组数据工作?还是我需要选择训练数据并扩展和平衡数据大小?有什么建议吗?

I want to know how to deal with the "low frequency example problem". Would simply copy and duplicate the small group data work? Or I need to choose the training data and expand and balance the data size? Any suggestions?

分类中如何处理低频实例? [英] How to deal with low frequency examples in classification?

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

分类中如何处理低频实例? [英] How to deal with low frequency examples in classification?

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭