读取文件和解析每一行的有效方法 [英] Effective way to read file and parse each line

查看：98 发布时间：2020/5/18 0:19:19 java nio java-io

本文介绍了读取文件和解析每一行的有效方法的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个下一个格式的文本文件:每行以字符串开头，后跟数字序列.每行的长度都是未知的(数字数量未知，数量从0到1000).

I have a text file of next format: each line starts with a string which is followed by sequence of numbers. Each line has unknown length (unknown amount of numbers, amount from 0 to 1000).

string_1 3 90 12 0 3
string_2 49 0 12 94 13 8 38 1 95 3
.......
string_n 9 43

然后，我必须使用handleLine方法处理每一行，该方法接受两个参数:字符串名称和数字集(请参见下面的代码).

Afterwards I must handle each line with handleLine method which accept two arguments: string name and numbers set (see code below).

如何读取文件并有效地使用handleLine处理每一行?

How to read the file and handle each line with handleLine efficiently?

我的解决方法:

使用java8流Files.lines逐行读取文件. 它阻止了吗?
用正则表达式分隔每一行
将每行转换为标题字符串和一组数字

Read file line by line with java8 streams Files.lines. Is it blocking?
Split each line with regexp
Convert each line into header string and set of numbers

由于第二步和第三步，我认为这几乎是无效的.第一步意味着java首先将文件字节转换为字符串，然后在第二和第三步中将它们转换回String/Set<Integer>. 这对性能有很大影响吗?如果是，如何做得更好?

I think it's pretty uneffective due 2nd and 3rd steps. 1st step mean that java convert file bytes to string first and then in 2nd and 3rd steps I convert them back to String/Set<Integer>. Does that influence performance a lot? If yes - how to do better?

public handleFile(String filePath) {
    try (Stream<String> stream = Files.lines(Paths.get(filePath))) {
        stream.forEach(this::indexLine);
    } catch (IOException e) {
        e.printStackTrace();
    }
}

private void handleLine(String line) {
    List<String> resultList = this.parse(line);
    String string_i = resultList.remove(0);
    Set<Integer> numbers = resultList.stream().map(Integer::valueOf).collect(Collectors.toSet());
    handleLine(string_i, numbers); // Here is te final computation which must to be done only with string_i & numbers arguments
}

private List<String> parse(String str) {
    List<String> output = new LinkedList<String>();
    Matcher match = Pattern.compile("[0-9]+|[a-z]+|[A-Z]+").matcher(str);
    while (match.find()) {
        output.add(match.group());
    }
    return output;
}

读取文件和解析每一行的有效方法 [英] Effective way to read file and parse each line

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

读取文件和解析每一行的有效方法 [英] Effective way to read file and parse each line

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭