对于垃圾短信保护算法 [英] Spam protection algorithm for SMS

查看:130
本文介绍了对于垃圾短信保护算法的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在开发一个Android消息应用程序。是否有短信行之有效的好垃圾邮件过滤算法?请给一些事情来启动。

I am developing an android messaging application. Is there a good spam filtering algorithm that works well for SMS? Please give some things to kick start.

拉​​希姆。

推荐答案

我不认为这是一个集算法,通过它你绝对可以知道用户是否认为短信是垃圾邮件(短信广告可以对某些用户和垃圾邮件给别人),你可以做但什么是谷歌并识别垃圾邮件很重要的。

I don't think there is a set algorithm through which you can definitely know whether or not user considers an SMS to be spam, (an ad in SMS can be important to some users and spam to others) what you can do however is what Google does to identify spam mail.

您可以允许用户标记短信为垃圾邮件或者是基于什么样的内容已经被标记为用户的垃圾邮件不是垃圾邮件,然后,你可以决定用户是否认为垃圾邮件。

You could allow the user to mark an SMS as spam or not spam and then based on what content has been marked as spam by the user, you can decide whether the user considers it spam or not.

编辑:还是最接近你在找什么,因为我发现的基于内容的短信垃圾邮件过滤本PDF

still closest to what you are looking for I found in this pdf on Content Based SMS Spam Filtering.

这不是一个算法,而是你应该记住的事情。

It's not an algorithm but rather things you should keep in mind.

从PDF报价:

用于时下减少垃圾邮件最流行的技术包括
  下面的。

The most popular techniques used to reduce spam nowadays include the following ones.

白和黑上市。在黑名单(RBL例如)发生的发送者被认为是垃圾邮件发送者,他们的邮件阻塞。该
  在白名单(例如地址簿,或从发件人的邮件
  提供者本身 - Hotmail等)被认为是合法的,因而
  交付。

White and black listing. The senders occurring in a black list (e.g. RBL) are considered spammers, and their messages blocked. The messages from senders in a white list (e.g. the address book, or the provider itself – Hotmail) are considered legitimate, and thus delivered.

协同过滤。当用户标记为垃圾邮件,这被认为是垃圾邮件的用户类似于他/她。可替代地,
  服务提供商认为,大规模的邮件是垃圾邮件。

Collaborative filtering. When a user tags a message as spam, this is considered spam for users similar to him/her. Alternatively, the service provider considers that massive messages are spam.

数字签名即可。没有数字签名的消息被认为是垃圾邮件。数字签名可以由发送者提供或
  服务提供商。

Digital signatures. Messages without a digital signature are considered spam. Digital signatures can be provided by the sender or the service provider.

基于内容的过滤。最常用的方法。每个传递消息中搜索邮件功能,比如指示词(如自由,
  伟哥等),标点符号和资本的异常分布
  字母(例如像在买入!!!!!!),等等。

Content-based filtering . The most used method. Each messaged is searched for spam features, like indicative words (e.g. "free", "viagra", etc.), unusual distribution of punctuation marks and capital letters (like e.g. in "BUY!!!!!!"), etc.

有很多的在那里好消息。检查出来。

There is a lot of good info in there. Check it out.

这篇关于对于垃圾短信保护算法的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆