如何按字母顺序排列 Python 中的文件？

Question

我正在尝试获取按姓氏字母顺序排列的总统列表，即使它正在绘制的文件当前列出了名字、姓氏、就职日期和离任日期。

这就是我所拥有的，任何关于我需要做什么的帮助。 我搜索了一些答案，其中大多数超出了我的理解水平。 我觉得我错过了一些小东西。 我试图将它们全部分解成一个列表，然后对它们进行排序，但我无法让它工作，所以这就是我开始的地方。

INPUT_FILE = 'presidents.txt'
OUTPUT_FILE = 'president_NEW.txt'
OUTPUT_FILE2 = 'president_NEW2.txt'

def main():
  infile = open(INPUT_FILE)
  outfile = open(OUTPUT_FILE, 'w')
  outfile2 = open(OUTPUT_FILE2,'w')

  stuff = infile.readline()

  while stuff:
    stuff = stuff.rstrip()
    data = stuff.split('\t')

    president_First = data[1]
    president_Last = data[0]
    start_date = data[2]
    end_date = data[3]

    sentence = '%s %s was president from %s to %s' % \
              (president_First,president_Last,start_date,end_date)
    sentence2 = '%s %s was president from %s to %s' % \
               (president_Last,president_First,start_date, end_date)

    outfile2.write(sentence2+ '\n')
    outfile.write(sentence + '\n')

    stuff = infile.readline()

  infile.close()
  outfile.close()

main()

Answer 1

你应该做的是将总统放在一个列表中，对该列表进行排序，然后打印出结果列表。

在你的 for 循环之前添加：

presidents = []

提取名称/日期后，将此代码放在 for 循环中

president = (last_name, first_name, start_date, end_date)
presidents.append(president)

在 for 循环之后

presidents.sort() # because we put last_name first above
# it will sort by last_name

然后打印出来：

for president in presidents
    last_name, first_name, start_date, end_date = president
    string1 = "..."

听起来您试图将它们分解成一个列表。 如果您对此有疑问，请向我们展示该尝试产生的代码。 这是解决问题的正确方法。

其他的建议：

只有几点可以让您的代码更简单。 随意忽略或根据需要使用它：

president_First=data[1]
president_Last= data[0]
start_date=data[2]
end_date=data[3]

可以写成：

president_Last, president_First, start_date, end_date = data


stuff=infile.readline()

和

while stuff:
    stuff=stuff.rstrip()
    data=stuff.split('\t')
    ...
    stuff = infile.readline()

可以写成：

 for stuff in infile:
     ...

Answer 2

#!/usr/bin/env python

# this sounds like a homework problem, but ...

from __future__ import with_statement # not necessary on newer versions

def main():
    # input
    with open('presidents.txt', 'r') as fi:
        # read and parse
        presidents = [[x.strip() for x in line.split(',')] for line in fi]
        # sort
        presidents = sorted(presidents, cmp=lambda x, y: cmp(x[1], y[1]))
    # output
    with open('presidents_out.txt', 'w') as fo:
        for pres in presidents:
            print >> fo, "president %s %s was president %s %s" % tuple(pres)

if __name__ == '__main__':
    main()

Answer 3

我试图将它们全部分解成一个列表，然后对它们进行排序

你说的“他们”是什么意思？

将行分解为项目列表是一个好的开始：这意味着您将数据视为一组值（其中一个是姓氏），而不仅仅是一个字符串。 但是，仅对该列表进行排序是没有用的； Python 将从行中取出 4 个字符串（名字、姓氏等）并将它们按顺序排列。

您想要做的是拥有这些列表的列表，并按姓氏对其进行排序。

Python 的列表提供了一种sort方法来对它们进行排序。 当您将其应用于总统信息列表列表时，它会对这些列表进行排序。 但是列表的默认排序将逐项比较它们（首先是第一项，如果第一项相等，则为第二项，等等）。 您想按姓氏进行比较，这是子列表中的第二个元素。 （即元素 1；记住，我们从 0 开始计算列表元素。）

幸运的是，很容易给 Python 更具体的排序指令。 我们可以将排序 function 传递给一个key参数，它是一个 function，它将项目“翻译”成我们想要对它们进行排序的值。 是的，在 Python 中，所有内容都是 object -包括函数- 因此将 function 作为参数传递是没有问题的。 因此，我们要“按姓氏”排序，因此我们将传递一个 function ，它接受总统信息列表并返回姓氏（即元素[1] ）。

幸好这个是Python，而且“含电池”； 我们甚至不必自己写 function。 我们得到了一个神奇的工具，它可以创建返回序列的第 n 个元素的函数（这就是我们想要的）。 它被称为itemgetter （因为它生成了一个 function 来获取序列的第 n 个项目 - “项目”是更常见的 Python 术语；“元素”是一个更通用的 CS 术语），它存在于operator模块中。

顺便说一句，还有更简洁的方法来处理文件打开/关闭，我们不需要编写显式循环来处理文件读取 - 我们可以直接遍历文件（ for line in file:给我们文件的行依次循环，每次循环一个），这意味着我们可以只使用list comprehension （查找它们）。

import operator
def main():
  # We'll set up 'infile' to refer to the opened input file, making sure it is automatically
  # closed once we're done with it. We do that with a 'with' block; we're "done with the file"
  # at the end of the block.
  with open(INPUT_FILE) as infile:
    # We want the splitted, rstripped line for each line in the infile, which is spelled:
    data = [line.rstrip().split('\t') for line in infile]

  # Now we re-arrange that data. We want to sort the data, using an item-getter for
  # item 1 (the last name) as the sort-key. That is spelled:
  data.sort(key=operator.itemgetter(1))

  with open(OUTPUT_FILE) as outfile:
    # Let's say we want to write the formatted string for each line in the data.
    # Now we're taking action instead of calculating a result, so we don't want
    # a list comprehension any more - so we iterate over the items of the sorted data:
    for item in data:
      # The item already contains all the values we want to interpolate into the string,
      # in the right order; so we can pass it directly as our set of values to interpolate:
      outfile.write('%s %s was president from %s to %s' % item)

Answer 4

我确实在上面的 Karls 帮助下得到了这个工作，尽管由于我遇到了一些错误，我确实必须编辑代码才能让它为我工作。 我消除了这些并最终得到了这个。

import operator

INPUT_FILE = 'presidents.txt'

OUTPUT_FILE2= 'president_NEW2.txt'

def main():

with open(INPUT_FILE) as infile:
    data = [line.rstrip().split('\t') for line in infile]

data.sort(key=operator.itemgetter(0))

outfile=open(OUTPUT_FILE2,'w')   

for item in data:
    last=item[0]
    first=item[1]
    start=item[2]
    end=item[3]

    outfile.write('%s %s was president from %s to %s\n' % (last,first,start,end))

主要的（）

如何按字母顺序排列 Python 中的文件？

问题描述

4 个解决方案

解决方案1
2 2011-08-12 04:50:31

解决方案2
0 2011-08-12 05:41:54

解决方案3
0 2011-08-12 05:47:54

解决方案4
0 2011-08-12 23:49:35

如何按字母顺序排列 Python 中的文件？

问题描述

4 个解决方案

解决方案1 2 2011-08-12 04:50:31

解决方案2 0 2011-08-12 05:41:54

解决方案3 0 2011-08-12 05:47:54

解决方案4 0 2011-08-12 23:49:35

解决方案1
2 2011-08-12 04:50:31

解决方案2
0 2011-08-12 05:41:54

解决方案3
0 2011-08-12 05:47:54

解决方案4
0 2011-08-12 23:49:35