熊猫数据框中行中的唯一文本

Question

I have a text file with text and numerical data in the format shown in the following picture:我有一个包含文本和数字数据的文本文件，格式如下图所示：

I am importing this file using pandas using the following command:我使用以下命令使用pandas导入此文件：

 df = pd.read_csv('dum.txt',sep='\t', header=[0,1], index_col=0)

In this file, I want to find the unique texts in the row called Tag ( ['Tag1', 'Tag1', 'Tag1', Tag1, 'Tag5'] ) as a python list.在这个文件中，我想在名为Tag ( ['Tag1', 'Tag1', 'Tag1', Tag1, 'Tag5'] ) 的行中找到作为 python 列表的唯一文本。 How can I do it?我该怎么做？

When I use df.columns , I get this:当我使用df.columns ，我得到了这个：

>>> df.columns
MultiIndex(levels=[[u'T1', u'T2', u'T3', u'T4', u'T5'], 
   [u'Tag1', u'Tag5']], labels=[[0, 1, 2, 3, 4], [0, 0, 
   0, 0, 1]], names=[u'Type', u'Tag'])

In the aforesaid example, how can I get the unique texts in the row called Tag ?在上述示例中，如何获取名为Tag的行中的唯一文本？ Thanks.谢谢。

Answer 1

用tolist做levels

df.columns.levels[1].tolist()

熊猫数据框中行中的唯一文本

问题描述

1 个解决方案

解决方案1
2 已采纳 2018-09-30 04:12:06

熊猫数据框中行中的唯一文本

问题描述

1 个解决方案

解决方案1 2 已采纳 2018-09-30 04:12:06

解决方案1
2 已采纳 2018-09-30 04:12:06