Bash目录排序问题 - 删除重复行？ [英] Bash Directory Sorting Issue - Removing Duplicate Lines?

查看：182 发布时间：2017/7/21 19:23:34 bash sorting unix directory duplicates

本文介绍了Bash目录排序问题 - 删除重复行？的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用此命令合并多个相同的目录，并从每个相应的文件中删除重复的行：

I'm using this command to merge multiple identical directories and to remove duplicate lines from each of the corresponding files:

for f in app1/*; do 
   bn="$(basename "$f")"
   sort -u "$f" "app2/$bn" > "app/$bn"
done

有没有办法编辑这个，检查所有文件的行，并删除所有的重复项以及？

Is there a way to edit this so that it checks the lines of all the files and removes all the duplicates as well? I do need to keep the existing file structure with individual files.

最终结果创建一个目录，300个文本文件不超过30mb。

The end result creates a directory with 300 text files that's no larger than 30mb.

示例：

**Directory app1**
*1.txt*       
a
b
c

*2.txt*
d
e
f

**Directory app2**
*1.txt*
a
b
c
g

*2.txt*
a
b
c
d
e
f

**Results in Directory app**
*1.txt*
a
b
c
g

*2.txt*
a
b
c
d
e
f

Desired Result in Directory app Should Be:
*1.txt*
a
b
c
g

*2.txt*
d
e
f

如您所见，当在1.txt中也找到了2.txt的重复的ABC行。每个文件中的所有行都应该保持唯一，所有重复项都应该被删除。

As you can see it's not removing the duplicate "A B C" lines from 2.txt when it's also found in 1.txt. All lines in each file should remain unique and all duplicates should be removed.

Bash目录排序问题 - 删除重复行？ [英] Bash Directory Sorting Issue - Removing Duplicate Lines?

问题描述

推荐答案

相关文章

服务器开发最新文章

热门教程

热门工具

登录关闭

Bash目录排序问题 - 删除重复行？ [英] Bash Directory Sorting Issue - Removing Duplicate Lines?

问题描述

推荐答案

相关文章

服务器开发最新文章

热门教程

热门工具

登录 关闭

登录关闭