从不同列的唯一值编制索引

Question

I have a dataframe with a bunch of columns and rows, and I want to get the data in one column based on the unique values in another column. 我有一个包含一列列和行的数据框，并且我想根据另一列中的唯一值来获取一列中的数据。

  flag  name
0  1     bob
1  2     larry
2  1     alice
3  1     mary
4  3     peter
5  4     rick

if a use 如果使用

df['flag'].unique()

I get 1 2 3 4 我得到1 2 3 4

How do I get the names that correspond to those unique values? 如何获得与这些唯一值相对应的名称？

ie 即

  flag  name
0  1     bob
1  2     larry
4  3     peter
5  4     rick

It doesn't matter if I get bob, alice, or mary. 我得到鲍勃，爱丽丝还是玛丽都没关系。 I just need a name for that flag value. 我只需要该标志值的名称即可。

Answer 1

By using drop_duplicates 通过使用drop_duplicates

df.drop_duplicates(['flag'])
Out[1036]: 
   flag   name
0     1    bob
1     2  larry
4     3  peter
5     4   rick

Answer 2

Wen's answer is simpler, but another way is to use groupby() and then take the first entry per group using nth() : Wen的答案比较简单，但是另一种方法是使用groupby() ，然后使用nth()每个组的第一个条目：

import pandas as pd

df = pd.DataFrame({'flag':[1, 2, 1, 1, 3, 4],
                   'name':['bob', 'larry', 'alice', 'mary', 'peter', 'rick']})

print df.groupby('flag').nth(0)

Result: 结果：

       name
flag       
1       bob
2     larry
3     peter
4      rick

从不同列的唯一值编制索引

问题描述

2 个解决方案

解决方案1
2 已采纳 2017-11-03 21:43:50

解决方案2
0 2017-11-03 21:55:22

从不同列的唯一值编制索引

问题描述

2 个解决方案

解决方案1 2 已采纳 2017-11-03 21:43:50

解决方案2 0 2017-11-03 21:55:22

解决方案1
2 已采纳 2017-11-03 21:43:50

解决方案2
0 2017-11-03 21:55:22