面向对象的贝叶斯垃圾邮件过滤? [英] Object Oriented Bayesian Spam Filtering?
问题描述
我想知道是否有用于垃圾邮件和文本分类的贝叶斯过滤的良好且干净的面向对象编程(OOP)实现?这只是出于学习目的.
I was wondering if there is any good and clean object-oriented programming (OOP) implementation of Bayesian filtering for spam and text classification? This is just for learning purposes.
推荐答案
我绝对推荐 Weka 是用Java编写的开源数据挖掘软件:
I definitely recommend Weka which is an Open Source Data Mining Software written in Java:
Weka是用于数据挖掘任务的机器学习算法的集合.这些算法既可以直接应用于数据集,也可以从您自己的Java代码中调用. Weka包含用于数据预处理,分类,回归,聚类,关联规则和可视化的工具.它也非常适合开发新的机器学习方案.
Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes.
如上所述,它附带了许多不同的分类器,例如 SVM , Winnow , API文档). 请注意,在垃圾邮件检测或文本分类领域,许多分类器的性能要比朴素贝叶斯的性能强得多.
As mentioned above, it ships with a bunch of different classifiers like SVM, Winnow, C4.5, Naive Bayes (of course) and many more (see the API doc). Note that a lot of classifiers are known to have much better perfomance than Naive Bayes in the field of spam detection or text classification.
此外,Weka为您带来了非常功能强大的GUI …
Furthermore Weka brings you a very powerful GUI…
这篇关于面向对象的贝叶斯垃圾邮件过滤?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!