使用子串系列的DataFrame的pandas groupby

Question

我有一個熊貓DataFrame，我想按其中一列的子字符串進行分組。 子字符串在另一個pandas系列（或列表）中給出。 我已經嘗試了很多方法，但是我根本無法使其正常工作。

我有這個：

tst = pd.DataFrame({'id': [0, 11, 222, 3333, 44444],
                    'bla': ['ab', 'ba', 'ca', 'bc', 'db']})
test = pd.Series(['a', 'b', 'c', 'd'])

我想組tst根據'A'， 'B'， 'C'， 'd'（從是否test ）被包含在tst['bla']

Answer 1

df.apply（）最好在這里。

import pandas as pd
def funcx(x, test_str):
    return test_str in x['bla']


tst = pd.DataFrame({'id': [0, 11, 222, 3333, 44444],
                'bla': ['ab', 'ba', 'ca', 'bc', 'db']})
test = pd.Series(['a', 'b', 'c', 'd'])
result = {}
for xstring in test:
    result[xstring] = tst.apply(funcx, args=( xstring), axis=1)

print result

給我們;

{'a': 0     True
1     True
2     True
3    False
4    False
dtype: bool, 'c': 0    False
1    False
2     True
3     True
4    False
dtype: bool, 'b': 0     True
1     True
2    False
3     True
4     True
dtype: bool, 'd': 0    False
1    False
2    False
3    False
4     True
dtype: bool}

然后可以用來選擇相關的行；

>>print tst[result['a']]
  bla   id
  0  ab    0
  1  ba   11
  2  ca  222

使用子串系列的DataFrame的pandas groupby

問題描述

1 個解決方案

解決方案1
0 已采納 2015-08-31 15:56:14

使用子串系列的DataFrame的pandas groupby

問題描述

1 個解決方案

解決方案1 0 已采納 2015-08-31 15:56:14

解決方案1
0 已采納 2015-08-31 15:56:14