[英]add a column that can separate the numeric values into low median and high segments?
pd.qcut
几乎可以完全满足您的需求:
In [37]: number["quantile"] = pd.qcut(number["Number to segement"], [0, .25, .75, 1], labels=["<25%", "25-75%", ">75%"])
In [38]: number
Out[38]:
Number to segement quantile
0 53 25-75%
1 85 25-75%
2 95 >75%
3 96 >75%
4 60 25-75%
5 37 25-75%
6 32 <25%
7 42 25-75%
8 50 25-75%
9 32 <25%
10 30 <25%
11 51 25-75%
12 88 >75%
13 76 25-75%
14 10 <25%
15 51 25-75%
16 97 >75%
17 1 <25%
18 23 <25%
19 87 >75%
如果省略了labels参数,则如果希望查看这些值,它将返回截止值。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.