c# - 如何确定文件是二进制文件还是文本文件? [英] How can I determine if a file is binary or text in c#?
问题描述
我需要确定 80% 的文件是二进制文件还是文本文件,有什么方法可以在 C# 中快速、肮脏/丑陋地做到这一点?
I need to determine in 80% if a file is binary or text, is there any way to do it even quick and dirty/ugly in c#?
推荐答案
我可能会寻找大量的控制字符,这些字符通常出现在二进制文件中,但很少出现在文本文件中.二进制文件往往使用 0 足够多,以至于仅测试许多 0 字节可能就足以捕获大多数文件.如果您关心本地化,则还需要测试多字节模式.
I would probably look for an abundance of control characters which would typically be present in a binary file but rarely in an text file. Binary files tend to use 0 enough that just testing for many 0 bytes would probably be sufficient to catch most files. If you care about localization you'd need to test multi-byte patterns as well.
尽管如此,你总是会不走运,得到一个看起来像文本的二进制文件,反之亦然.
As stated though, you can always be unlucky and get a binary file that looks like text or vice versa.
这篇关于c# - 如何确定文件是二进制文件还是文本文件?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!