使用jq，如何根据对象属性的值将对象的JSON流拆分为单独的文件? [英] Using jq, how can I split a JSON stream of objects into separate files based on the values of an object property?

查看：92 发布时间：2021/2/12 20:45:22 json bash stream jq partitioning

本文介绍了使用jq，如何根据对象属性的值将对象的JSON流拆分为单独的文件?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个名为input.json的超大文件(压缩了20GB以上)，其中包含JSON对象流，如下所示:

I have a very large file (20GB+ compressed) called input.json containing a stream of JSON objects as follows:

{
    "timestamp": "12345",
    "name": "Some name",
    "type": "typea"
}
{
    "timestamp": "12345",
    "name": "Some name",
    "type": "typea"
}
{
    "timestamp": "12345",
    "name": "Some name",
    "type": "typeb"
}

我想根据该文件的type属性将其拆分为文件:typea.json，typeb.json等，每个文件都包含它们自己的json对象流，而这些对象仅具有匹配的type属性.

I want to split this file into files dependent on their type property: typea.json, typeb.json etc., each containing their own stream of json objects that only have the matching type property.

我已经设法解决了较小文件的问题，但是对于如此大的文件，我的AWS实例上的内存不足.我希望降低内存使用量，所以我知道我需要使用--stream，但是我正在努力寻找如何实现这一目标的方法.

I've managed to solve this problem for smaller files, however with such a large file I run out of memory on my AWS instance. As I wish to keep memory usage down, I understand I need to use --stream but I'm struggling to see how I can achieve this.

cat input.json | jq -c --stream 'select(.[0][0]=="type") | .[1]'将为我返回每个类型属性的值，但是如何使用它来过滤对象?

cat input.json | jq -c --stream 'select(.[0][0]=="type") | .[1]' will return me the values of each of the type properties, but how do I use this to then filter the objects?

任何帮助将不胜感激！

使用jq，如何根据对象属性的值将对象的JSON流拆分为单独的文件? [英] Using jq, how can I split a JSON stream of objects into separate files based on the values of an object property?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

使用jq，如何根据对象属性的值将对象的JSON流拆分为单独的文件? [英] Using jq, how can I split a JSON stream of objects into separate files based on the values of an object property?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭