简体   繁体   English

将所有具有相同名称的文件放在文件夹中

[英]Put all files with same name in a folder

I'm new to Python and I need a program that copies files from the same day into a new folder. 我是Python新手,我需要一个程序将同一天的文件复制到一个新文件夹中。

Example files: 示例文件:

20120807_first_day_pic.jpg
20120807_first_day_sheet.jpg
20120807_first_day_sheet2.jpg
20120907_second_day_pic.jpg
20120907_second_day_sheet.jpg
20120907_second_day_sheet2.jpg

This is what I have so far, but every file gets a folder and not the whole day. 这是我到目前为止,但每个文件都有一个文件夹,而不是一整天。

import os, re, shutil

tfolder = 'D:/Testing/src/'

os.chdir(tfolder)

re_year19xxxxxx = re.compile('(19[0-9][0-9][0-9][0-9])')
re_year20xxxxxx = re.compile('(20[0-9][0-9][0-9][0-9])')

re_ed = re.compile('(ED[0-9])')
destPath = 'D:/Testing/Dest/'

def analyse_file_name(fname):
    filePath, coords = os.path.split(fname) #the new folders will be named according to the first 4 characters of the original file name
    coordsFolder = coords[:53]
    coordsFname = coords[:53]
    coordsExt = os.path.splitext(fname)
    year = 'year' #create variable year
    ed = 'ed' #create variable ed to store the edition number if necessary
    bname = fname #the original file name
    for re_year in (re_year19xxxxxx, re_year20xxxxxx):
        rx = re_year.search(fname) #search for regex in the file name and store it in rx
        if rx:
            year = rx.group(1) #if the regex is found, store the year
            bname.replace(year, ' ')
            res = re_ed.search(fname)
            if res:
                ed = res.group(1)
                bname.replace(ed, ' ')
        os.chdir(destPath)  
        if year is 'year':
            fname2 = os.path.join(destPath, coordsFolder) + '\\' + coordsFname + coordsExt[1]
        else:
            fname2 = os.path.join(destPath, coordsFolder,year,ed) + '\\' + coordsFname + coordsExt[1]
        print('%s -> %s' % (fname, fname2)) #debug print
        dirn, _ = os.path.split(fname2)
        if not os.path.exists(dirn):
            os.makedirs(dirn)
        shutil.copy(fname, fname2)

for root, dirs, files in os.walk(tfolder):
    for name in files:
        fn = os.path.join(root, name)
        analyse_file_name(fn)

If you just want to copy files that start with a known date string format, how about something like this? 如果您只想复制以已知日期字符串格式开头的文件,那么这样的事情怎么样?

def copyfile(filepath, target_dir):
    p, filename = os.path.split(filepath)

    # get date component of name
    date_component = filename.split("_", 1)[0]

    # try to parse out the date
    try:
        d = datetime.datetime.strptime(date_component, "%Y%m%d")
    except ValueError:  
        print "Could not place: ", filename
        return
    target_date_dir = os.path.join(target_dir, str(d.year), str(d.month), str(d.day))
    os.makedirs(target_date_dir)
    shutil.copy(filepath, target_date_dir)

First, create a dict (a defaultdict was even more convenient here) that will gather the files for a date (it's good to use re , but given the names of your files using split was easier): 首先,创建一个dict(这里的defaultdict更方便),它将收集日期的文件(使用re很好,但是使用split更容易使用文件的名称):

>>> import os
>>> import re
>>> pat = r'(\d+)(?:_\d+)?_(\w+?)[\._].*'
>>> from collections import defaultdict
>>> dict_date = defaultdict(lambda : defaultdict(list))
>>> for fil in os.listdir(path):
    if os.path.isfile(os.path.join(path, fil)):
        date, animal = re.match(pat, fil).groups()
        dict_date[date][animal].append(fil)


>>> dict_date['20120807']
defaultdict(<type 'list'>, {'first': ['20120807_first_day_pic.jpg', '20120807_first_day_sheet.jpg', '20120807_first_day_sheet2.jpg']})

Then for each date, create a subfolder and copy the corresponding files there: 然后为每个日期创建一个子文件夹并在那里复制相应的文件:

>>> from shutil import copyfile
>>> for date in dict_date:
        for animal in dict_date[date]:
        try:
            os.makedirs(os.path.join(path, date, animal))
        except os.error:
            pass
        for fil in dict_date[date][animal]:
            copyfile(os.path.join(path, fil), os.path.join(path, date, animal, fil))

EDIT : took into account OP's new requirements, and Khalid's remark. 编辑 :考虑到OP的新要求,以及Khalid的评论。

import os, shutil

src_path = "D:\\Testing\\Src\\"
dest_path = "D:\\Testing\\Dest\\"

for file in os.listdir(src_path):
    if not os.path.isdir(dest_path + file.split("-")[0]):
       os.mkdir(dest_path + file.split("-")[0])
shutil.copy(src_path + file, dest_path + file.split("-")[0])

Regex day :) What about trying to match the filename with 正则表达式:)如何尝试匹配文件名

pattern=r'(?P<filedate>(?P<year>\d{4})(?P<month>\d{2})(?P<day>\d{2}))\_(?P<bloodyrestofname>.*)'

Complete date, year, etc. may be retrieved from the respective named groups in the match. 可以从匹配中的相应命名组中检索完成日期,年份等。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 Zip 同一文件夹下一个文件夹的所有文件 - Zip all the files of a folder in the same folder 重命名子目录中与文件夹名称相同的文件 - Rename files in subdirectory with the same name as the name of the folder 获取同一文件夹下的所有PDF文件名,并根据PDF文件名保存在excel中 - Get all PDF files name under same folder and save in excel according to PDF file name Python3列出与特定文件夹名称相同级别的所有文件/目录 - Python3 listing all files/directories on same level as certain folder name 如何将一堆同名文件复制到一个文件夹? - How to copy a bunch of files with same name to a folder? 如何在python的文件夹中保存同名文件? - How to save files with same name in folder in python? 使用Python 2.7.5将文件夹中的所有压缩文件解压缩到同一文件夹 - Unzip all zipped files in a folder to that same folder using Python 2.7.5 将 function 应用于文件夹中的所有文件并将结果保存在同一文件夹中 - Apply function to all files in a folder and save the results in the same folder 循环遍历具有相同根名称且位于同一文件夹中的文件 - Looping over files with same root name and in the same folder 在目录python中搜索所有具有相同名称的文件 - Search all files with same name in a directory python
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM