如何有效地在大的二进制文件中搜索模式 [英] How to search pattern in big binary files efficiently

查看：158 发布时间：2020/8/22 19:17:06 python algorithm search binaryfiles

本文介绍了如何有效地在大的二进制文件中搜索模式的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有几个二进制文件，它们大多大于 10GB . 在此文件中，我想查找带有Python的模式，即模式0x01 0x02 0x03和0xF1 0xF2 0xF3之间的数据.

I have several binary files, which are mostly bigger than 10GB. In this files, I want to find patterns with Python, i.e. data between the pattern 0x01 0x02 0x03 and 0xF1 0xF2 0xF3.

我的问题:我知道如何处理二进制数据或如何使用搜索算法，但是由于文件的大小，首先完全读取文件效率很低.这就是为什么我认为明智的做法是按块读取文件并在块内搜索模式.

My problem: I know how to handle binary data or how I use search algorithms, but due to the size of the files it is very inefficient to read the file completely first. That's why I thought it would be smart to read the file blockwise and search for the pattern inside a block.

我的目标:我想让Python确定找到的图案的位置(开始和停止).是否可以使用一种特殊的算法甚至Python library来解决问题?

My goal: I would like to have Python determine the positions (start and stop) of a found pattern. Is there a special algorithm or maybe even a Python library that I could use to solve the problem?

如何有效地在大的二进制文件中搜索模式 [英] How to search pattern in big binary files efficiently

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

如何有效地在大的二进制文件中搜索模式 [英] How to search pattern in big binary files efficiently

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭