簡體   English   中英

基於第一個值的列表索引超出范圍python2.7循環

[英]List index out of Range python2.7 looping based on first value

美好的一天可能有人請幫助我嘗試根據第一個值“token”循環遍歷 txt 文件,它當前僅在我指定 tockens 時才會循環,例如 {600001130260} 文件中有多個 tockens 都在第一行 [0 ] id 喜歡它來迭代每個標記/行並提取指定的信息。

數據文件如下所示

600001130260|005|||IN|2197|01||20160905210028
600001130260|100|005|00|VAT|VAT|VAT @ 14%|2,150.14
600001130260|100|013|00|TOT|CTOT|Total Due|86,578.93
600001130260|100|014|00|DD|DD|Due Date|2015/09/22|2015/10/15
600001130260|200|019|01||YDACON|Daily average consumption 79.325 kWh||
28002385859|000|||||LT|||T0IQ04960000000016400000000000000||
28002385859|100|005|00|CUR|CUR|Current Charges (Excl. VAT)|304.48
28002385859|100|006|00|VAT|VAT|VAT @ 14%|10.62
28002385859|100|013|00|TOT|CTOT|Total Due|26,451.75
28002385859|100|014|00|DD|DD|Due Date|2015/09/23
28002385859|150||23,149.02|1,686.37|1,233.57|382.79|0.00|26,451.75

這是我的代碼

file1 = open(r"C:\Users\isaac.gumbi\Documents\jhb\Full test file.txt", 'r')
file2 = ""
with file1 as f:
    for line in f:
        tokens = line.split("|")

        keys = {'600001130260','118002191517','CTOT', 'CUR', 'Due Date', 
               'VAT', '020', '030', '010', '040', 'STOT', '000', '005',
                '050', '0100', 'BBF', 'INT','CIN', 'CTOT', 'DD', 'YVLEVY', 
                'YRREM'}

        if len(tokens) and tokens[0] in keys and tokens[5] == 'CTOT':
            Total_due = ' '.join(tokens[7:8])
            if Total_due == '' : Total_Due = "null"
            print ("Total_due", Total_due)

這是我目前的 output

('Total_due', '86,578.93\n')
('Total_due', '79,191.18\n')

我希望它給我 output total_due 而無需我在 [0] 中指定令牌

您想要Total Due的所有值嗎? 如果是這樣,您可以簡單地執行以下操作:

sep = "|"
value_name = "Total Due"
result = []

with open("thefile.txt", 'r') as f:
    for line in f:
        tokens = line.split(sep)
        try:
            ind_total_due = tokens.index(value_name) + 1
            result.append((value_name, tokens[ind_total_due]))
        except ValueError:
            continue

結果將是:

[('Total Due', '86,578.93'),
 ('Total Due', '26,451.75'),
 ('Total Due', '3,483.28'),
 ('Total Due', '983.04'),
 ('Total Due', '- 197,358.33')]

第一個“令牌”似乎是一個唯一標識符。 如果您想要 csv 導出和多列支持,您可以使用 go 進行如下操作:

token_sep = "|"
csv_sep = ";"

# Lambda function that whill format total due
float_formater = lambda string : float(
    string.replace(" ", "").replace(",", "")
)

# Attributes you want to parse
col_names = (
    ("Total Due", float_formater, 1),
    ("Due Date", None, 1),
)

# Python dictionary which associate to each identifier, a total due
# and a due date
records = {}

with open("thefile.txt", 'r') as f:
    for line in f:

        tokens = line.strip().split(token_sep)

        # We assume the first token is an identifier
        unique_id = tokens[0]

        # For each new identifier we create a new record and initialize 
        # total due and due date to an empty string
        if unique_id and unique_id not in records:
            records[unique_id] = {col_name: "" for col_name, _ in col_names}

        # then we look for values we are interesting in. If we find one, we can
        # update one value of the record
        for col_name, formatter, index_val in col_names:
            try:
                ind_col = tokens.index(col_name) + index_val 
                value = tokens[ind_col]

                if formatter:
                    value = formatter(value)

                records[unique_id][col_name] = value


            except ValueError:
                continue

# For an easier csv export we reformat the record dict to a list of values
list_values = [
    (unique_id,) + tuple((values[col] for col, _ in col_names))
    for unique_id, values in records.items()
]

# We can then easily write all the records one by one
with open("mycsv.csv", "w") as f:
    f.write(csv_sep.join(["id"] + [c for c, _ in col_names]))
    for values in list_values:
        print(values)
        f.write("\n")
        f.write(csv_sep.join(map(str, values)))

mycsv.csv

id;Total Due;Due Date
112002209769;3483.28;2015/09/23
142002121343;-197358.33;
600001130260;86578.93;2015/09/22
28002385859;26451.75;2015/09/23
100002232416;983.04;2015/09/23

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM