从包含一些垃圾的文本块中提取相关数据 [英] Extracting Relevant Data from a Block of Text Containing some Junk

查看:61
本文介绍了从包含一些垃圾的文本块中提取相关数据的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

你好。

对于我的工作,如果你不知道你在找什么,我会得到看起来非常像乱码的txt文件(老实说有点像胡言乱语,即使你做的也是。)我的问题是在文本文件中存在长时间重复的垃圾,其中包含相关数据。例如:

Hello.
For my job I get in txt files that look fairly well like gibberish if you don''t know what you are looking for (and honestly a bit like gibberish even if you do.) My problem is that in the text files there are long repetitions of junk with relevant data in them. For example:

展开 | 选择 | Wrap | 行号

推荐答案

看起来您可以导入来自你的文本文件的数据,并使用空格作为最后一行的字段分隔符,但正如你所说,它看起来像乱码,所以我不能确定。一旦处理完毕,你到底想要它看起来是什么样?
It looks like you might be able to import the data from your text file and use the space as the field delimiter for those last lines, but as you said it all looks like gibberish, so I can''t be sure. What exactly are you wanting it to look like once it has been processed?


好吧,这是一个文本块的图像,其中包含我要为访问表保存的字段。大多数情况下你对这个空间是正确的,但第二行是那些不起作用的地方。有什么想法吗?

okay, so this is an image of the block of text with the fields I want to save for the access table highlighted. You are right about the spaces for the most part, but the second line is where that doesn''t really work. Any ideas?


1)您将不得不解释数据来自何处/何处。


2)在这个烂摊子里:
1) You''re going to have to explain just exactly what/where the data is coming from.

2) in this mess:
展开 | 选择 | Wrap | 行号


这篇关于从包含一些垃圾的文本块中提取相关数据的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆