[英]How can I find files and then loop over them in Python?
我有一个包含以下代码的Bash脚本:
merge_ROOT_files(){
output_file=""$(date -u "+%Y-%m-%dT%H%M%S")"Z_merged.root"
list_of_files="$(find . -name \*.root -type f)"
command="hadd "${output_file}""
for current_file in ${list_of_files}; do
echo "found ROOT file "${current_file}""
command=""${command}" "${current_file}""
done
echo "merge ROOT files to output ROOT file "${output_file}""
echo "merge command: ${command}"
eval "${command}"
}
merge_ROOT_files
您可以看到它以递归方式搜索以.root
结尾的文件,然后循环遍历这些文件。 如何在Python中做类似的事情? 我可以想象生成各种找到的文件及其完整或相对路径的列表,然后遍历该列表,但是我不确定如何生成这样的列表。
这里有一些代码应该可以帮助您。
import os
import re
searchdir = '.'
ext_rx = '\.root$'
filelist = []
for root, dir, files in os.walk(searchdir):
for file in files:
if re.search(ext_rx, file):
filelist.append(os.path.join(root, file))
for file in filelist:
print(file)
glob使执行此类操作变得非常容易。
import os
import glob
# Get full path for .root files
root_files = glob.glob('/your_path/*.root')
# Only get root file names
root_file_names = [os.path.basename(f) for f in glob.iglob('/your_path/*.root')]
我已提取的代码从我的常用utils
import os
import os.path
def flatten( seq ) :
res = []
for item in seq :
if ( isinstance( item, ( tuple, list ) ) ) :
res.extend( flatten( item ) )
else:
res.append( item )
return res
def get_roots( path ) :
"""Get a list of .root files in a given directory.
"""
rootfiles2 = []
os.listdir( path )
for root, dirs, files in os.walk( path, topdown=True ) :
#print ( 'root =', root, ', dirs =', dirs, ', files =', files )
print ( 'root =', root )
print ( 'dirs =', dirs )
print ( 'files =', files )
# Get only .root files
rootfiles2.append( [ root + '/' + file for file in files if ( file.split('.')[-1] == 'root' ) ] )
rootfiles = list( flatten( rootfiles2 ) )
return rootfiles
这是我的处理方式:
import os
import fnmatch
import subprocess
import datetime
def merge_ROOT_files():
output_file = datetime.datetime.utcnow().strftime("%Y-%m-%dT%H%M%SZ_merged.root")
root_files = [
os.path.join(root, filename)
for root, dirs, files in os.walk('.')
for filename in files
if fnmatch.fnmatch(filename, '*.root')
]
# Remove 'echo' when you want to go live.
subprocess.check_call(['echo', 'hadd', output_file]+root_files)
if __name__ == "__main__":
merge_ROOT_files()
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.