如何删除csv文件中的烦人数据

Question

i want to delete some string("Description" "This is a simulation") in my csv file, and also i want to delete some"=" in the data and ", " at the end of the data. 我想在我的csv文件中删除一些字符串（“ Description”“ This is a Simulation”），并且我也想删除数据中的一些“ =”和数据末尾的“，”。 the file looks like the following 该文件如下所示

"time","student","items"

="09:00:00","Tim","apple",

="09:00:10","Jason","orange",

"09:10:10","Emily","grape",

"09:22:10","Ivy","kiwi",

"Description"

"This is a simulation"

i have tried .pop(). 我尝试过.pop（）。 it didn't work 它没有用

ff= []

import csv

with open('file.csv') as f:

    for row in csv.DictReader(f):

        row.replace(',','')

        ff.append(row)

i want to get like this: 我想得到这样的：

"time","student","items"

"09:00:00","Tim","apple"

"09:00:10","Jason","orange"

"09:10:10","Emily","grape"

"09:22:10","Ivy","kiwi"

Answer 1

You probably want to read the file as raw text file rather than csv so that it will be easier for you to perform string manipulation with it. 您可能希望将文件读取为原始文本文件而不是csv，以便您更轻松地执行字符串操作。

Edit: I assume that tmp is the path to the CSV file and the <list data> is a list of dictionary generated by csv.DictReader . 编辑：我假设tmp是CSV文件的路径，而<list data>是由csv.DictReader生成的字典列表。 Then you can write the convert(tmp) by performing 2 main steps. 然后，您可以通过执行2个主要步骤来编写convert(tmp) 。 One is to reformatted the file and it to a temporary file and the other is to read the temporary file into a list of dictionary data using csv.DictReader . 一种是将文件重新格式化为临时文件，另一种是使用csv.DictReader将临时文件读入字典数据列表。 After you're done reading the data, the temporary file will be deleted using the os module: 读取完数据后，将使用os模块删除临时文件：

import csv
import os

def convert(tmp):
    new_lines = []
    temp_file = tmp + '.tmp'
    with open(tmp) as fd:
        for line in fd:
            # remove new line characters
            line = line.replace('\n', '').replace('\r', '')

            # delete string
            line = line.replace('=', '').replace('"Description"', '').replace('"This is a simulation"', '')

            # don't add empty string
            if line.strip() == '':
                continue

            # remove last line commas
            if line[-1] == ',':
                line = line[:-1]

            new_lines.append(line)

    # write formatted data to temporary csv file
    with open(temp_file, 'w') as fd:
        fd.write('\n'.join(new_lines))

    # get list data
    ff = None
    with open(temp_file) as f:
        ff = list(csv.DictReader(f))

    # delete temporary file
    os.remove(temp_file)

    return ff

print convert('./file.csv')

Answer 2

Mostly leveraging built-in str methods, with an assumption that the first row is always a valid header row. 大多数情况下利用内置的str方法，并假设第一行始终是有效的标题行。

ff = []

with open('file.csv') as f:

    for row in f:
        # strip empty lines, and head/tail = ,
        line = row.strip().strip('=').strip(',')

        # skip empty lines
        if not line:
            continue

        # assume first row is always a valid header row
        # split by comma to see if it matches header row
        if not len(ff) or (len(line.split(',')) == len(ff[0].split(','))):
            ff.append(line)

如何删除csv文件中的烦人数据

问题描述

2 个解决方案

解决方案1
1 已采纳 2019-05-14 14:41:02

解决方案2
0 2019-05-14 14:51:05

如何删除csv文件中的烦人数据

问题描述

2 个解决方案

解决方案1 1 已采纳 2019-05-14 14:41:02

解决方案2 0 2019-05-14 14:51:05

解决方案1
1 已采纳 2019-05-14 14:41:02

解决方案2
0 2019-05-14 14:51:05