C#消毒文件名 [英] C# Sanitize File Name
问题描述
我最近一直在移动从不同位置一堆MP3音乐到存储库。我一直在使用ID3标签(!谢谢,标签库,夏普)构建新的文件名,而我注意到,我得到一个 System.NotSupportedException
:
I recently have been moving a bunch of MP3s from various locations into a repository. I had been constructing the new file names using the ID3 tags (thanks, TagLib-Sharp!), and I noticed that I was getting a System.NotSupportedException
:
的指定路径的格式不被支持。的
这是由生成两种 File.Copy()
或 Directory.CreateDirectory()
。
这没多久就意识到,我需要的文件名进行消毒。所以我做了明显的事情:
It didn't take long to realize that my file names needed to be sanitized. So I did the obvious thing:
public static string SanitizePath_(string path, char replaceChar)
{
string dir = Path.GetDirectoryName(path);
foreach (char c in Path.GetInvalidPathChars())
dir = dir.Replace(c, replaceChar);
string name = Path.GetFileName(path);
foreach (char c in Path.GetInvalidFileNameChars())
name = name.Replace(c, replaceChar);
return dir + name;
}
要我的意料,我继续得到例外。原来,:是不是在集合 Path.GetInvalidPathChars的()
,因为它是在一个根路径有效。我想这是有道理的 - 但这必须是pretty通病。有没有人有一些短期code,它进行消毒的路径?最彻底的,我想出了这一点,但感觉它可能是矫枉过正。
To my surprise, I continued to get exceptions. It turned out that ':' is not in the set of Path.GetInvalidPathChars()
, because it is valid in a path root. I suppose that makes sense - but this has to be a pretty common problem. Does anyone have some short code that sanitizes a path? The most thorough I've come up with this, but it feels like it is probably overkill.
// replaces invalid characters with replaceChar
public static string SanitizePath(string path, char replaceChar)
{
// construct a list of characters that can't show up in filenames.
// need to do this because ":" is not in InvalidPathChars
if (_BadChars == null)
{
_BadChars = new List<char>(Path.GetInvalidFileNameChars());
_BadChars.AddRange(Path.GetInvalidPathChars());
_BadChars = Utility.GetUnique<char>(_BadChars);
}
// remove root
string root = Path.GetPathRoot(path);
path = path.Remove(0, root.Length);
// split on the directory separator character. Need to do this
// because the separator is not valid in a filename.
List<string> parts = new List<string>(path.Split(new char[]{Path.DirectorySeparatorChar}));
// check each part to make sure it is valid.
for (int i = 0; i < parts.Count; i++)
{
string part = parts[i];
foreach (char c in _BadChars)
{
part = part.Replace(c, replaceChar);
}
parts[i] = part;
}
return root + Utility.Join(parts, Path.DirectorySeparatorChar.ToString());
}
任何改进,使这一功能更快,更巴洛克风格会更AP preciated。
Any improvements to make this function faster and less baroque would be much appreciated.
推荐答案
要清理,你可以做到这一点的文件名
To clean up a file name you could do this
private static string MakeValidFileName( string name )
{
string invalidChars = System.Text.RegularExpressions.Regex.Escape( new string( System.IO.Path.GetInvalidFileNameChars() ) );
string invalidRegStr = string.Format( @"([{0}]*\.+$)|([{0}]+)", invalidChars );
return System.Text.RegularExpressions.Regex.Replace( name, invalidRegStr, "_" );
}
这篇关于C#消毒文件名的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!