从一亿行字符串中搜索一个字符串 [英] Search for a string from 100 million rows of strings

查看：63 发布时间：2021/4/28 20:09:00 database algorithm

本文介绍了从一亿行字符串中搜索一个字符串的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个包含一些md5哈希的文本文件，其中有1亿行.我有另一个带有数千个md5散列的较小文件.我想找到从这个新的较小文件到旧的较大文件的这些md5哈希值的对应索引.

I have this text file containing some md5 hashes, 100 million rows of them. I have this another smaller file with few thousand md5 hashes. I want to find the corresponding indices of these md5 hashes from this new smaller file to the old bigger file.

最有效的方法是什么?是否可以在15分钟左右的时间内完成?

what is the most efficient way to do it? Is it possible to do it in like 15 mins or so?

我尝试了很多东西，但是它们不起作用.首先，我尝试将更大的数据导入数据库文件，并在md5哈希列上创建索引.创建此哈希将永远花费.我什至不确定这是否会大大提高查询速度.有建议吗?

I have tried lots of things but they do not work. First I tried to import the bigger data to a database file and create an index on the md5 hash column. Creating this hash takes for ever. I am not even sure if this will increase the query speed much. Suggestions?

从一亿行字符串中搜索一个字符串 [英] Search for a string from 100 million rows of strings

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

从一亿行字符串中搜索一个字符串 [英] Search for a string from 100 million rows of strings

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭