返回两个文件比较后添加或删除的记录 [英] Returning records that have been added or removed after two file comparison
问题描述
我正在比较两个文件,其中一个文件可能添加或删除了项目.我正在检查这两个文件之间是否存在差异,如果存在差异,那么无论是否添加或删除了记录,这些差异是什么.我想返回那些记录(添加或删除)
I am comparing two files and one of the files might have added or removed items. I am checking to see if there are differences between the two files and if so then what are those differences whether records have been added or removed. I would like to return those records (added or removed)
它告诉我文件是否删除或添加了项目,但它不会返回已添加或删除的项目.对我所缺少的任何帮助将不胜感激.
With what i have it tells me whether the file had items removed or added but it is not returning the the items that have been added or removed. Any help to what i am missing would be appreciated.
foreach (ExcelRow rowA in fileA.excelRows)
{
if (!fileB.ContainsHash(rowA.rowHash))
{
MessageBox.Show("Files are NOT the same. Data was REMOVED.\n" + rowA.ToString());
}
}
foreach (ExcelRow rowB in fileB.excelRows)
{
if (!fileA.ContainsHash(rowB.rowHash))
{
MessageBox.Show("Row added" + rowB.ToString());
}
}
public List<ExcelRow> excelRows = new List<ExcelRow>();
public bool ContainsHash(byte[] hashToLook)
{
bool found;
found = false;
foreach (ExcelRow eRow in excelRows)
{
found = EqualHash(eRow.rowHash, hashToLook);
if (found)
{
break;
}
}
return found;
}
public static bool EqualHash(byte[] hashA, byte[] hashB)
{
bool bEqual ;
int i ;
bEqual = false;
if (hashA.Length == hashB.Length)
{
i = 0;
while ((i < hashA.Length) && (hashA[i] == hashB[i]))
{
i++ ;
}
if (i == hashA.Length)
{
bEqual = true;
}
}
return bEqual ;
}
读取文件:
public ExcelInfo ReadExcel(OpenFileDialog openFileDialog)
{
var _excelFile = new ExcelQueryFactory(openFileDialog.FileName);
var _info = from c in _excelFile.WorksheetNoHeader() select c;
ExcelRow excelRow;
ExcelInfo resp;
resp = new ExcelInfo();
foreach (var item in _info)
{
excelRow = new ExcelRow();
excelRow.lstCells.Add(item.ElementAt(0));
excelRow.lstCells.Add(item.ElementAt(1));
excelRow.lstCells.Add(item.ElementAt(2));
excelRow.lstCells.Add(item.ElementAt(3));
excelRow.lstCells.Add(item.ElementAt(4));
excelRow.lstCells.Add(item.ElementAt(5));
excelRow.lstCells.Add(item.ElementAt(6));
excelRow.lstCells.Add(item.ElementAt(7));
excelRow.lstCells.Add(item.ElementAt(8));
excelRow.lstCells.Add(item.ElementAt(9));
excelRow.lstCells.Add(item.ElementAt(10));
excelRow.lstCells.Add(item.ElementAt(11));
excelRow.lstCells.Add(item.ElementAt(12));
excelRow.CalculateHash();
resp.excelRows.Add(excelRow);
}
return resp;
}
计算哈希:
public void CalculateHash()
{
byte[] rowBytes;
byte[] cellBytes;
int pos;
int numRowBytes;
numRowBytes = 0;
foreach (string cellText in lstCells)
{
numRowBytes += NumBytes(cellText);
}
//Allocate space to calculate the HASH of a single row
rowBytes = new byte[numRowBytes];
pos = 0;
//Concatenate the cellText of each row into a single byte array
foreach (string cellText in lstCells)
{
cellBytes = GetBytes(cellText);
System.Buffer.BlockCopy(cellBytes, 0, rowBytes, pos, cellBytes.Length);
pos = cellBytes.Length;
}
rowHash = new MD5CryptoServiceProvider().ComputeHash(rowBytes);
}
调试时:
if (!fileB.ContainsHash(rowA.rowHash))
fileB 包含 3 行,fileA 包含 4 行.
fileB contains three rows and fileA contains 4 rows.
文件B = 3,rowA = fileA 中的第一行并且 (.rowHash) 是 byte[16]
fileB = 3, rowA = the first row in fileA and (.rowHash) is byte[16]
当我继续使用 ContainHash 方法时,byte[] hashToLook = 16 - 这不应该是 rowA 吗?
as i continue to ContainHash method, byte[] hashToLook = 16 - shouldn't this be rowA?
excelRows = 3 (fileB)
excelRows = 3 (fileB)
然后EqualHash(eRow.rowHash, hashToLook) 是(文件A中的第一行,字节[16])
then EqualHash(eRow.rowHash, hashToLook) is (first row in fileA, byte[16])
我传入 rowA 错了吗?
am i passing in rowA wrong?
推荐答案
大功告成,只需添加两个列表来跟踪在 A 中但不在 B 中的项目和在 B 中但不在 A 中的项目:
You are almost there, just add two lists to keep track of items that are in A but not in B and items that are in B but not in A:
var notInA = new List<ExcelRow>();
var notInB = new List<ExcelRow>();
现在在您的代码中,将它们添加到适当的列表中:
Now in your code, add them to the appropriate list:
foreach (ExcelRow rowA in fileA.excelRows)
{
if (!fileB.ContainsHash(rowA.rowHash))
{
MessageBox.Show("Files are NOT the same. Data was REMOVED.\n" + rowA.ToString());
notInB.Add(rowA);
}
}
foreach (ExcelRow rowB in fileB.excelRows)
{
if (!fileA.ContainsHash(rowB.rowHash))
{
MessageBox.Show("Row added" + rowB.ToString());
notInA.Add(rowB);
}
}
这篇关于返回两个文件比较后添加或删除的记录的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!