使用python和pandas传输和编写Parquet出现时间戳错误 [英] Transfer and write Parquet with python and pandas got timestamp error

查看：294 发布时间：2020/5/24 2:09:40 python pandas parquet

本文介绍了使用python和pandas传输和编写Parquet出现时间戳错误的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我试图用python在熊猫中连接两个拼花文件.
它可以工作，但是当我尝试将数据框架写入并保存到镶木地板文件中时，它会显示错误:

I tried to concat() two parquet file with pandas in python .
It can work , but when I try to write and save the Data frame to a parquet file ,it display the error :

 ArrowInvalid: Casting from timestamp[ns] to timestamp[ms] would lose data:

我检查了文档. of pandas，它在写入镶木地板文件时默认以ms为单位的时间戳语法.
如何在concat之后使用已使用的模式对镶木地板文件进行白色处理?
这是我的代码:

I checked the doc. of pandas, it default the timestamp syntax in ms when write the parquet file.
How can I white the parquet file with used schema after concat?
Here is my code:

import pandas as pd

table1 = pd.read_parquet(path= ('path.parquet'),engine='pyarrow')
table2 = pd.read_parquet(path= ('path.parquet'),engine='pyarrow')

table = pd.concat([table1, table2], ignore_index=True) 
table.to_parquet('./file.gzip', compression='gzip')

使用python和pandas传输和编写Parquet出现时间戳错误 [英] Transfer and write Parquet with python and pandas got timestamp error

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

使用python和pandas传输和编写Parquet出现时间戳错误 [英] Transfer and write Parquet with python and pandas got timestamp error

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭