从不同子文件夹的文件中读取数据帧

Question

I have 12 different directories, looking something like that: 我有12个不同的目录，看起来像这样：

directories = ['FOLDER/subfolder01/*.csv', 'FOLDER/subfolder02/*csv', ... ]

12 subfolders within one major FOLDER, each one containing a set of csv files with the same data format. 一个主要文件夹中的12个子文件夹，每个子文件夹包含一组具有相同数据格式的csv文件。

I'd like to loop over it and somehow read the dataframes within each subfolder and then continue with the plots. 我想遍历它，以某种方式读取每个子文件夹中的数据框，然后继续进行绘制。

Is there a way to set the subfolders as indexes over which I can manipulate with the files? 有没有一种方法可以将子文件夹设置为可以使用文件操作的索引？

Answer 1

from glob import glob
directory_dfs = {directory_name: load_csvs_from_directory(directory_name)
                 for directory_name in glob('FOLDER/subfolder*/')}

Then just encapsulate your logic for how to read all the CSV's from a directory into the load_csvs_from_directory function, and voila, you have a labelled collection of the DataFrames built from the data in each directory. 然后，只需封装有关如何从目录中读取所有CSV的逻辑到load_csvs_from_directory函数中，瞧，您就有从每个目录中的数据构建的带标签的DataFrame集合。

从不同子文件夹的文件中读取数据帧

问题描述

1 个解决方案

解决方案1
0 2018-06-06 18:11:09

从不同子文件夹的文件中读取数据帧

问题描述

1 个解决方案

解决方案1 0 2018-06-06 18:11:09

解决方案1
0 2018-06-06 18:11:09