帮助Python中的if else循环

Question

嗨，这是我的问题。 我有一个计算列中数据平均值的程序。 例

Bob
1
2
3

输出是

Bob
2

有些数据对乔来说是“ na”。

Joe
NA
NA
NA

我希望此输出为NA

所以我写了一个if else循环

问题在于它不执行循环的第二部分，而只是打印出一个NA。 有什么建议么？

这是我的程序：

with open('C://achip.txt', "rtU") as f:
    columns = f.readline().strip().split(" ")
    numRows = 0
    sums = [0] * len(columns)

    numRowsPerColumn = [0] * len(columns) # this figures out the number of columns

    for line in f:
        # Skip empty lines since I was getting that error before
        if not line.strip():
            continue

        values = line.split(" ")
        for i in xrange(len(values)):
            try: # this is the whole strings to math numbers things
                sums[i] += float(values[i])
                numRowsPerColumn[i] += 1
            except ValueError:
                continue 

    with open('c://chipdone.txt', 'w') as ouf:
        for i in xrange(len(columns)):
           if numRowsPerColumn[i] ==0 :
               print 'NA' 
           else:
               print>>ouf, columns[i], sums[i] / numRowsPerColumn[i] # this is the average calculator

该文件如下所示：

Joe Bob Sam
1 2 NA
2 4 NA
3 NA NA
1 1  NA

最后的输出是名称和平均值

Joe Bob Sam 
1.5 1.5 NA

好吧，我尝试了罗杰的建议，现在我遇到了这个错误：

追溯（最近一次呼叫最近）：文件“ C：/avy14.py”，第5行，在f中的行：ValueError：对关闭文件的I / O操作

这是新代码：

使用open（'C：//achip.txt'，“ rtU”）作为f：列= f.readline（）。strip（）。split（“”）sums = [0] * len（columns）行= 0对于f中的行：line = line.strip（）如果不是line：继续

col +的行+ = 1，enumerate（line.split（））中的v：如果sums [col]不是None：如果v ==“ NA”：sums [col] =其他：sums [col] + = int （v）

使用open（“ c：/chipdone.txt”，“ w”）作为out：对于名称，zip中的总和（列，总和）：print >> out，名称，如果总和为None：print >> out，“ NA “ else：打印>> out，求和/行

Answer 1

with open("c:/achip.txt", "rU") as f:
  columns = f.readline().strip().split()
  sums = [0.0] * len(columns)
  row_counts = [0] * len(columns)

  for line in f:
    line = line.strip()
    if not line:
      continue

    for col, v in enumerate(line.split()):
      if v != "NA":
        sums[col] += int(v)
        row_counts[col] += 1

with open("c:/chipdone.txt", "w") as out:
  for name, sum, rows in zip(columns, sums, row_counts):
    print >>out, name,
    if rows == 0:
      print >>out, "NA"
    else:
      print >>out, sum / rows

获取列名称时，我也会使用split的无参数版本（它允许您使用多个空格分隔符）。

关于您的编辑以包括输入/输出样本，我保留了原始格式，输出为：

Joe 1.75
Bob 2.33333333333
Sam NA

此格式为（ColumnName，Avg）列的3行，但是您可以根据需要更改输出。 :)

Answer 2

使用numpy：

import numpy as np

with open('achip.txt') as f:
    names=f.readline().split()
    arr=np.genfromtxt(f)

print(arr)
# [[  1.   2.  NaN]
#  [  2.   4.  NaN]
#  [  3.  NaN  NaN]
#  [  1.   1.  NaN]]

print(names)
# ['Joe', 'Bob', 'Sam']

print(np.ma.mean(np.ma.masked_invalid(arr),axis=0))
# [1.75 2.33333333333 --]

Answer 3

使用您的原始代码，我将添加一个循环并编辑打印语句

    with open(r'C:\achip.txt', "rtU") as f:
    columns = f.readline().strip().split(" ")
    numRows = 0
    sums = [0] * len(columns)

    numRowsPerColumn = [0] * len(columns) # this figures out the number of columns

    for line in f:
        # Skip empty lines since I was getting that error before
        if not line.strip():
            continue

        values = line.split(" ")

        ### This removes any '' elements caused by having two spaces like
        ### in the last line of your example chip file above
        for count, v in enumerate(values):      
            if v == '':     
                values.pop(count)
        ### (End of Addition)

        for i in xrange(len(values)):
            try: # this is the whole strings to math numbers things
                sums[i] += float(values[i])
                numRowsPerColumn[i] += 1
            except ValueError:
                continue 

    with open('c://chipdone.txt', 'w') as ouf:
        for i in xrange(len(columns)):
           if numRowsPerColumn[i] ==0 :
               print>>ouf, columns[i], 'NA' #Just add the extra parts
           else:
               print>>ouf, columns[i], sums[i] / numRowsPerColumn[i]

此解决方案还以Roger格式而不是您想要的格式提供了相同的结果。

Answer 4

下面的解决方案更干净，代码行更少...

import pandas as pd

# read the file into a DataFrame using read_csv
df = pd.read_csv('C://achip.txt', sep="\s+")

# compute the average of each column
avg = df.mean()

# save computed average to output file
avg.to_csv("c:/chipdone.txt")

它们是实现此解决方案简单性的关键，是将输入文本文件读入数据框的方式。 熊猫read_csv允许您使用正则表达式指定sep / delimiter参数。 在这种情况下，我们使用“ \\ s +”正则表达式模式来确保列之间具有一个或多个空格。

一旦数据在数据框中，就可以使用简单的熊猫函数来计算平均值并将其保存到文件中。

帮助Python中的if else循环

问题描述

4 个解决方案

解决方案1
1 已采纳

解决方案2
0 2010-09-24 15:28:54

解决方案3
0 2010-09-24 16:22:21

解决方案4
0 2019-01-02 13:02:58

帮助Python中的if else循环

问题描述

4 个解决方案

解决方案1 1 已采纳

解决方案2 0 2010-09-24 15:28:54

解决方案3 0 2010-09-24 16:22:21

解决方案4 0 2019-01-02 13:02:58

解决方案1
1 已采纳

解决方案2
0 2010-09-24 15:28:54

解决方案3
0 2010-09-24 16:22:21

解决方案4
0 2019-01-02 13:02:58