在一个图中绘制来自多个 Pandas 数据框的数据 [英] Plotting data from multiple pandas data frames in one plot

查看：55 发布时间：2021/6/13 20:50:20 python pandas plot

本文介绍了在一个图中绘制来自多个 Pandas 数据框的数据的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有兴趣使用来自几个不同 Pandas 数据框的数据绘制时间序列.我知道如何为单个时间序列绘制数据，我知道如何绘制子图，但是我如何设法从单个图中的多个不同数据框中进行绘制?我在下面有我的代码.基本上我正在做的是我正在扫描一个包含 json 文件的文件夹并将该 json 文件解析为一个熊猫，以便我可以绘图.当我运行这段代码时，它只是从其中一只熊猫而不是创建的十只熊猫进行绘图.我知道创建了 10 个熊猫，因为我有一个打印语句来确保它们都是正确的.

I am interested in plotting a time series with data from several different pandas data frames. I know how to plot a data for a single time series and I know how to do subplots, but how would I manage to plot from several different data frames in a single plot? I have my code below. Basically what I am doing is I am scanning through a folder of json files and parsing that json file into a panda so that I can plot. When I run this code it is only plotting from one of the pandas instead of the ten pandas created. I know that 10 pandas are created because I have a print statement to ensure they are all correct.

import sys, re
import numpy as np
import smtplib
import matplotlib.pyplot as plt
from random import randint
import csv
import pylab as pl
import math
import pandas as pd
from pandas.tools.plotting import scatter_matrix
import argparse
import matplotlib.patches as mpatches
import os
import json



parser = argparse.ArgumentParser()
parser.add_argument('-file', '--f', help = 'folder where JSON files are stored')
if len(sys.argv) == 1:
    parser.print_help()
    sys.exit(1)
args = parser.parse_args()


dat = {}
i = 0

direc = args.f
directory = os.fsencode(direc)

fig1 = plt.figure()
ax1 = fig1.add_subplot(111)

for files in os.listdir(direc):
    filename = os.fsdecode(files)
    if filename.endswith(".json"):
        path = '/Users/Katie/Desktop/Work/' + args.f + "/" +filename
        with open(path, 'r') as data_file:
            data = json.load(data_file)
            for r in data["commits"]:
                dat[i] = (r["author_name"], r["num_deletions"], r["num_insertions"], r["num_lines_changed"],
                          r["num_files_changed"], r["author_date"])
                name = "df" + str(i).zfill(2)
                i = i + 1
                name = pd.DataFrame.from_dict(dat, orient='index').reset_index()
                name.columns = ["index", "author_name", "num_deletions",
                                          "num_insertions", "num_lines_changed",
                                          "num_files_changed",  "author_date"]
                del name['index']
                name['author_date'] = name['author_date'].astype(int)
                name['author_date'] =  pd.to_datetime(name['author_date'], unit='s')
                ax1.plot(name['author_date'], name['num_lines_changed'], '*',c=np.random.rand(3,))
                print(name)
                continue

    else:
        continue
plt.xticks(rotation='35')
plt.title('Number of Lines Changed vs. Author Date')
plt.show()

在一个图中绘制来自多个 Pandas 数据框的数据 [英] Plotting data from multiple pandas data frames in one plot

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

在一个图中绘制来自多个 Pandas 数据框的数据 [英] Plotting data from multiple pandas data frames in one plot

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭