检测重复文件 [英] Detecting duplicate files

查看：139 发布时间：2020/6/3 20:32:58 algorithm duplicates hash

本文介绍了检测重复文件的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我想检测目录树中的重复文件。当找到两个相同的文件时，将仅保留一个重复项，其余的重复项将被删除以节省磁盘空间。

I'd like to detect duplicate files in a directory tree. When two identical files are found only one of the duplicates will be preserved and the remaining duplicates will be deleted to save the disk space.

重复项意味着文件内容相同

The duplicate means files having the same content which may differ in file names and path.

我当时正在考虑使用散列算法，但是有可能不同文件具有相同的哈希值，所以我需要一些额外的机制告诉我即使哈希相同，文件也不一样，因为我不想删除两个不同的文件。

I was thinking about using hash algorithms for this purpose but there is a chance that different files have the same hashes, so I need some additional mechanism to tell me that the files aren't the same even though the hashes are the same because I don't want to delete two different files.

还有哪些您将使用快速可靠的机制吗？

Which additional fast and reliable mechanism would you use?

检测重复文件 [英] Detecting duplicate files

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

检测重复文件 [英] Detecting duplicate files

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭