如何访问 python 中文件名中具有相同字符串的文件夹中的文件？

Question

我正在尝试使用 python 来查看目录文件夹并匹配文件名中具有相同字符串的文件。 此文件夹中的每个感兴趣的文件都是一个“.csv”文件，其中包含一个值列， Value_Blue用于蓝色文件， Value_Red用于红色文件。 The files in this folder go: Blue_111.csv, Blue_124.csv, Blue_145.csv, Blue_165.csv, Blue_176.csv... and then: Red_111.csv, Red_124.csv, Red_145.csv, Red_165.csv, Red_176. csv...等等。 如图所示，与这些文件中的每一个相关联的数字不是等间隔顺序的 go，但这与此处无关。 对于大多数蓝色文件，有一个匹配的红色文件，文件名附加了相同的编号扩展名。 因此，有些蓝色文件没有对应的红色文件。

我要做的是遍历目录文件夹中的所有蓝色文件，将它们作为数据帧打开，然后找到匹配的红色文件，将该文件作为 Z6A8064B5DF4794555500553C47C55057DZ 打开，然后将这两个数据帧中的Value列相乘，然后将新的 dataframe 发送到新的 csv 文件名包含相同的扩展名。

例如，如果在循环中它以 Blue_111.csv 开头，那么我希望它找到 Red_111.csv。 我希望将这两个.csv 文件作为数据框打开，并且Value列成倍增加。 I then want to send this newly calculated dataframe to a new.csv called `Green_111.csv, and then keep going in the loop onto Blue_124.csv, etc.

这是示例我的目标的伪代码：

folder = Path/to/Directory/Folder

for f in folder that is a .csv with "Blue" in filename:
     blue_df = pd.read_csv(f)  
     red = matching Red file
     red_df = pd.read_csv(red)
     green_df = blue_df.join(red_df) 
     green_df = green_df['Value_Blue'] * green_df['Value_Red']
     green_df.to_csv(Path/to/Directory/Folder/Green_*matching_number*.csv)

如何匹配文件，然后在文件名中创建具有相同匹配扩展名的计算 output 文件？

Answer 1

使用glob.glob()匹配所有匹配通配符模式的文件名。 然后您可以使用.replace()将Blue替换为Red和Green以创建其他文件名。

import glob, os

folder = 'Path/to/Directory/Folder'

for blue in glob.glob(os.path.join(folder, "Blue_*.csv")):
    blue_df = pd.read_csv(blue)
    red = blue.replace("Blue_", "Red_")
    green = blue.replace("Blue_", "Green_")
    red_df = pd.read_csv(red)
    green_df = blue_df.join(red_df) 
    green_df = green_df['Value_Blue'] * green_df['Value_Red']
    green_df.to_csv(green)

如何访问 python 中文件名中具有相同字符串的文件夹中的文件？

问题描述

1 个解决方案

解决方案1
2 已采纳 2022-01-27 00:40:18

如何访问 python 中文件名中具有相同字符串的文件夹中的文件？

问题描述

1 个解决方案

解决方案1 2 已采纳 2022-01-27 00:40:18

解决方案1
2 已采纳 2022-01-27 00:40:18