是否有看起来与英文字母相似的字符列表? [英] Is there a list of characters that look similar to English letters?

查看:557
本文介绍了是否有看起来与英文字母相似的字符列表?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

对于使用Python编写的网络论坛的亵渎过滤,我深表歉意.

I’m having a crack at profanity filtering for a web forum written in Python.

为此,我试图编写一个使用单词的函数,并返回该单词的所有可能的模拟拼写,这些拼写使用视觉上相似的字符代替特定字母(例如s†å©køv€rƒ| øw).

As part of that, I’m attempting to write a function that takes a word, and returns all possible mock spellings of that word that use visually similar characters in place of specific letters (e.g. s†å©køv€rƒ|øw).

我希望我必须随着时间的推移来扩展此列表,以涵盖人们的创造力,但是在互联网上的任何地方都可以找到一个可以用作起点的列表吗?

I expect I’ll have to expand this list over time to cover people’s creativity, but is there a list floating around anywhere on the internet that I could use as a starting point?

推荐答案

这可能比您需要的要深得多,但还不足以涵盖您的用例,但是Unicode联盟不得不应对针对国际化的攻击域名,并提出以下同形异义字(具有相同或相似渲染的字符)列表:

This is probably both vastly more deep than you need, yet not wide enough to cover your use case, but the Unicode consortium have had to deal with attacks against internationalised domain names and came up with this list of homographs (characters with the same or similar rendering):

http://www.unicode.org/Public/security/latest/confusables.txt

至少可以作为起点.

这篇关于是否有看起来与英文字母相似的字符列表?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆