查找列表中所有元素的百分位數

Question

例如，我有一個排序列表，

S = [0, 10.2, 345.9, ...]

如果 S 很大（500k+ 個元素），找到每個元素屬於哪個百分位的最佳方法是什么？

我的目標是存儲在看起來像這樣的數據庫表中：

Svalue | Percentile
-------------------
0      |     a
10.2.  |     b
345.9  |     c
...    |    ...

Answer 1

嘗試熊貓排名

import pandas as pd

df = pd.DataFrame()
df["Svalue"] = S
df["Percentile"] = df["Svalue"].rank(pct=True)

Answer 2

解決方案：

# Import and initialise pandas into session: 
import pandas as pd

# Store a scalar of the length of the list: list_length => list
list_length = len(S)

# Use a list comprehension to retrieve the indices of each element: idx => list
idx = [index for index, value in enumerate(S)]

# Divide each of the indices by the list_length scalar using a list 
# comprehension: percentile_rank => list
percentile_rank = [el / list_length for el in idx]

# Column bind separate lists into a single DataFrame in order to achieved desired format: df => pd.DataFrame
df = pd.DataFrame({"Svalue": S,  "Percentile": percentile_rank}) 

# Send the first 6 rows to console: stdout
df.head()

數據：

# Ensure list is sorted: S => list
S = sorted([0, 10.2, 345.9])

# Print the result: stdout
print(S)

查找列表中所有元素的百分位數

問題描述

2 個解決方案

解決方案1
3 2020-03-29 05:34:57

解決方案2
1 2020-03-29 05:16:09

查找列表中所有元素的百分位數

問題描述

2 個解決方案

解決方案1 3 2020-03-29 05:34:57

解決方案2 1 2020-03-29 05:16:09

解決方案1
3 2020-03-29 05:34:57

解決方案2
1 2020-03-29 05:16:09