簡體   English   中英

將特定行從多個文本文件復制到Excel文件

[英]Copy specific lines from multiple text files to an excel file

我有多達1500個文本文件,我想從每個文本文件中復制5行,例如第4、5、9、14和32行。我想將這些文件的列在excel表格中一個放在另一個的下面1500個文本文件。 我想出了一個僅接收一個txt文件但將所有數據復制到行中的代碼。 任何幫助將不勝感激。 這是我的代碼:

import csv
import xlwt

import os
import sys

# Look for input file in same location as script file:
inputfilename = os.path.join(os.path.dirname(sys.argv[0]), 
'C:/path/filename.txt')
# Strip off the path
basefilename = os.path.basename(inputfilename)
# Strip off the extension
basefilename_noext = os.path.splitext(basefilename)[0]
# Get the path of the input file as the target output path
targetoutputpath = os.path.dirname(inputfilename)
# Generate the output filename
outputfilename = os.path.join(targetoutputpath, basefilename_noext + '.xls')

# Create a workbook object
workbook = xlwt.Workbook()
# Add a sheet object
worksheet = workbook.add_sheet(basefilename_noext, cell_overwrite_ok=True)

# Get a CSV reader object set up for reading the input file with tab 
delimiters
datareader = csv.reader(open(inputfilename, 'rb'),
                    delimiter='\t', quotechar='"')

# Process the file and output to Excel sheet

for rowno, row in enumerate(datareader):
  for colno, colitem in enumerate(row):

     worksheet.write(rowno, colno, colitem)

 # Write the output file.
 workbook.save(outputfilename)

# Open it via the operating system (will only work on Windows)
# On Linux/Unix you would use subprocess.Popen(['xdg-open', filename])
os.startfile(outputfilename)

您首先需要將所有需要的文本文件放在當前文件夾中,然后可以使用glob.glob('*.txt')獲取這些文件名的列表。 對於每個文本文件,使用readlines()讀取文件,並使用itemgetter()提取所需的行。 對於每個文件,在輸出工作表中創建一個新行,並將每一行寫為不同的列條目。

import xlwt
import glob
import operator

# Create a workbook object
wb = xlwt.Workbook()
# # Add a sheet object
ws = wb.add_sheet('Sheet1', cell_overwrite_ok=True)
rowy = 0

for text_filename in glob.glob('*.txt'):
    with open(text_filename) as f_input:
        try:
            lines = [line.strip() for line in operator.itemgetter(4, 5, 9, 14, 32)(f_input.readlines())]
        except IndexError as e:
            print "'{}' is too short".format(text_filename)
            lines = []

    # Output to Excel sheet
    for colno, colitem in enumerate(lines):
        ws.write(rowy, colno, colitem)

    rowy += 1

# Write the output file.
wb.save('output.xls')

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM