在熊貓數據框列中的嵌套列表中轉換和求和元素

Question

我有一個這樣的 df 列：

col1
[[0.73, 0.43, 0.5, 0.0], [0.39, 0.5], [0.37], [0.38, 0.51, 0.0, 0.2]]
[[0.53, 0.33, 0.2, 0.0], [0.79, 0.5], [0.96], [0.88, 0.21, 0.0, 0.0]]

子列表可以是任意大小。 我正在嘗試將子列表中的數字轉換為浮點數（它們是字符串），然后創建一個對每個子列表求和的列，然后除以子列表中的項目數

第 1 行的總和：

(.73 + .43 + .5 + 0) / 4 =.415
(.39 + .5) / 2 = .445
(.37) / 1 = .37
(.38 + .51 + 0.0 + .2) / 4 = .272

對於第 2 行：

(.53 + .33 + .2 + 0) / 4 = .265
(.79 + .5) / 2 = .645
(.96) / 1 = .96
(.88 + .21 + 0.0 + 0.0) / 4 = .272

結果：

new_col
[[.415],[.445],[.37],[.272]]
[[.265],[.645],[.96],[.272]]

我嘗試了很多東西：

#something like this where it creates a column of the number of elements in each sublist and then uses that to divide the sum of each number

# this didn't work - just grabbed the first lists size
df1['words_in_company_name'] = df1['children_org_name_sublists'].str.len()

#this doesn't really work - i mean it shows the numbers per list, just not sure where to go from here
for i in df1.func_scores:
    length = []
    for j in i:
        print(j)

一種

Answer 1

只需apply np.mean

df['new_col'] = df.col.apply(lambda x : [[np.mean(y)] for y in x ])
df
Out[17]: 
                                                 col                               new_col
0  [[0.73, 0.43, 0.5, 0.0], [0.39, 0.5], [0.37], ...  [[0.415], [0.445], [0.37], [0.2725]]
1  [[0.53, 0.33, 0.2, 0.0], [0.79, 0.5], [0.96], ...  [[0.265], [0.645], [0.96], [0.2725]]

在熊貓數據框列中的嵌套列表中轉換和求和元素

問題描述

1 個解決方案

解決方案1
3 已采納 2020-09-17 00:05:22

在熊貓數據框列中的嵌套列表中轉換和求和元素

問題描述

1 個解決方案

解決方案1 3 已采納 2020-09-17 00:05:22

解決方案1
3 已采納 2020-09-17 00:05:22