在python 2.7中-如何從csv讀取數據，重新格式化數據並寫入新的csv

Question

我是一個正在編程的菜鳥，我被困住了...我的目標是打開一個csv文件（從程序A導出），重新格式化數據，並將其寫入新的csv文件中（以供程序B導入）。 我知道我的代碼不是很漂亮，盡管它可以工作到寫入新的csv文件。 它僅寫入舊csv中的最后一行數據。

import csv

print 'Enter file location or press enter for default location.'
aPath = raw_input('> ')  # user path to file
if aPath == '':
    aPath = '/default/path/to/file'  # default path
aFile = raw_input('Enter file name: ')  # user file name
if aFile == '':
    aFile = 'orginal.csv'  # default file name
myCSV = csv.DictReader(open(aPath + aFile, 'r'), delimiter=',')
print myCSV.fieldnames # verify fieldnames

for p in myCSV:
    w = csv.writer(open("output.csv", "w"))
    try:
        p = dict((k, v) for k, v in p.iteritems()
            if v.lower() != 'null')
    except AttributeError, e:
        print e
        print p
        raise Exception()
# reformats the columns of data for the output.csv
    pID = p.get('Last Name')[-4:]
    pName = p.get('Last Name')[:-4].strip() + ', ' + p.get('First Name')
    pDate = p.get('Start Time')[:-5]
    pBlank = p.get('')
    pCourse = p.get('Assigned Through')
    pScore = p.get('Score')[:-1]

# verifies the new columns
    print pID, pName, pDate, pBlank, pCourse, pBlank, pBlank, pScore

# NOT working...Only writes the last row of the orginal.csv to output.csv
    w.writerow([pID, pName, pDate, pBlank, pCourse, pBlank, pBlank, pScore])

Answer 1

您將重新打開文件，並for p in myCSV循環中的for p in myCSV每次傳遞時將其覆蓋。 進入for循環之前，您需要創建一次w 。

Answer 2

以下修改后的代碼應該可以工作：

import csv

print 'Enter file location or press enter for default location.'
aPath = raw_input('> ')  # user path to file
if aPath == '':
    aPath = '/default/path/to/file'  # default path
aFile = raw_input('Enter file name: ')  # user file name
if aFile == '':
    aFile = 'orginal.csv'  # default file name
myCSV = csv.DictReader(open(aPath + aFile, 'r'), delimiter=',')
print myCSV.fieldnames # verify fieldnames

with open("output.csv", "wb") as ofile:
    w = csv.writer(ofile)
    for p in myCSV:
        try:
            p = dict((k, v) for k, v in p.iteritems()
                if v.lower() != 'null')
        except AttributeError, e:
            print e
            print p
            raise Exception()
    # reformats the columns of data for the output.csv
        pID = p.get('Last Name')[-4:]
        pName = p.get('Last Name')[:-4].strip() + ', ' + p.get('First Name')
        pDate = p.get('Start Time')[:-5]
        pBlank = p.get('')
        pCourse = p.get('Assigned Through')
        pScore = p.get('Score')[:-1]

    # verifies the new columns
        print pID, pName, pDate, pBlank, pCourse, pBlank, pBlank, pScore

    # NOT working...Only writes the last row of the orginal.csv to output.csv
        w.writerow([pID, pName, pDate, pBlank, pCourse, pBlank, pBlank, pScore])

我們改變的事情：

我們使用了with運算符。 當使用打開和關閉文件進行操作時，遵循with是一種普遍接受的協議，除非另外不適用，因為它可以正確處理文件的關閉等操作。
使用wb而不是僅將w作為open的第二個參數。 這將允許您以寫二進制模式打開文件。 請參閱此作為參考。

希望這可以幫助。

Answer 3

恐怕您會在每次迭代時重新編寫輸出文件...您應該簡單地選擇這一行

w = csv.writer(open("output.csv", "w"))

跳出循環為：

w = csv.writer(open("output.csv", "w"))
try:
    for p in myCSV:
        try:
            ...
            w.writerow([pID, pName, pDate, pBlank, pCourse, pBlank, pBlank, pScore])
finally:
    w.close()

您還可以查看“帶有”結構，該結構可自動處理潛在的io錯誤，並在塊末尾關閉文件，而無需進行其他嘗試。

在python 2.7中-如何從csv讀取數據，重新格式化數據並寫入新的csv

問題描述

3 個解決方案

解決方案1
3 2014-04-28 21:35:16

解決方案2
1 已采納 2014-04-28 22:15:19

解決方案3
0 2014-04-28 21:44:42

在python 2.7中-如何從csv讀取數據，重新格式化數據並寫入新的csv

問題描述

3 個解決方案

解決方案1 3 2014-04-28 21:35:16

解決方案2 1 已采納 2014-04-28 22:15:19

解決方案3 0 2014-04-28 21:44:42

解決方案1
3 2014-04-28 21:35:16

解決方案2
1 已采納 2014-04-28 22:15:19

解決方案3
0 2014-04-28 21:44:42