[英]Python Sorting Text File From Highest To Lowest Based On Column Values
我有一個非常大的文本文件,其中包含如下幾行數據:
('#DownWithAssad', '1')
('#DownYoTLParty', '1')
('#Download', '8')
('#Download:', '2')
('#Downloads', '2')
('#DownstairsMixtape', '1')
('#DowntonAbbey', '12')
('#DowntonAbbey?', '1')
('#DowntonPBS', '23')
('#Downtonabbey', '1')
('#DowntownAbbey', '1')
這似乎是一個簡單的問題,但是我想將數據從最高到最低進行軟化,因此它看起來像:
('#DowntonPBS', '23')
('#DowntonAbbey', '12')
('#Download', '8')
('#Download:', '2')
('#Downloads', '2')
('#DownstairsMixtape', '1')
('#DownWithAssad', '1')
('#DownYoTLParty', '1')
('#DowntonAbbey?', '1')
('#Downtonabbey', '1')
('#DowntownAbbey', '1')
我認為可以消除方括號()並使用以下方法拆分數據:
import sys
f = open(sys.argv[1])
for line in f:
line = str(line)[1 : -1]
for sect in line.split(','):
print sect
但是我不確定從這里去哪里。
您可以使用ast.literal_eval
輕松解析文本文件:
with open(datafile) as f:
file_sorted = sorted((ast.literal_eval(x) for x in f),
key=lambda z:(int(z[1]),z[0]),
reverse=True)
這個怎么運作:
(ast.literal_eval(x) for x in f) #turn each line in your file into a tuple
key=lambda z:(int(z[1]),z[0]) #function to determine how things are sorted. Basically
#sort as tuples: `( int(z[1]),z[0] )`
reverse=True #descending order instead of ascending
這就是您要嘗試做的事情。 請注意,以這種方式解析行非常脆弱(格式錯誤的行可能會破壞它)
from operator import itemgetter
import sys
result=[]
with open(sys.argv[1]) as f:
for line in f:
line = str(line.strip())[1: -1]
sect1, sect2 = line.split(', ')
sect1 = sect1[1: -1]
sect2 = int(sect2[1: -1])
result.append((sect1, sect2))
for line in sorted(result, key=itemgetter(1), reverse=True):
print line
解析它的更好方法是使用literal_eval
或正則表達式。 您是否知道在字符串中使用引號或逗號時是否有特殊處理?
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.