繁体 English 中英

Pandas基于两个现有变量创建一个新变量

[英]Pandas creating a new variable based on two existing variables

原文 2018-06-14 06:43:03 0 2 python/ pandas

我认为以下代码效率很低。 有没有更好的方法在熊猫中进行这种类型的常见重新编码？

df['F'] = 0
df['F'][(df['B'] >=3) & (df['C'] >=4.35)] = 1
df['F'][(df['B'] >=3) & (df['C'] < 4.35)] = 2
df['F'][(df['B'] < 3) & (df['C'] >=4.35)] = 3
df['F'][(df['B'] < 3) & (df['C'] < 4.35)] = 4

2 个解决方案

使用numpy.select并将布尔掩码缓存到变量以获得更好的性能：

m1 = df['B'] >= 3
m2 = df['C'] >= 4.35
m3 = df['C'] < 4.35
m4 = df['B'] < 3

df['F'] = np.select([m1 & m2, m1 & m3, m4 & m2, m4 & m3], [1,2,3,4], default=0)

在您的特定情况下，您可以利用布尔实际上是整数（False == 0，True == 1）并使用简单算术的事实：

df['F'] = 1 + (df['C'] < 4.35) + 2 * (df['B'] < 3)

请注意，这将忽略B和C列中的任何NaN，这些将被指定为高于您的限制。

基于两列创建新变量作为索引一列作为新变量名称python pandas或R.

[英]Creating new variables based on two columns as index one column as new variable names python pandas or R

大熊猫：在循环中创建现有变量的滞后变量

[英]pandas: creating lagged variables of existing variable in a loop

基于 pandas 中的现有列创建新列

[英]Creating new column based on existing column in pandas

根据 if 和现有列在 pandas 中创建新列

[英]creating new column in pandas based on if and existing column

根据不同索引的值减去两个变量，在 Pandas 中创建新列

[英]Creating New Columns in Pandas based on subtracting two variables based on value from different indexes

Python pandas 和 numpy：根据现有变量的多个条件为新变量分配数值

[英]Python pandas and numpy: assign numerical values to new variable based on multiple conditions for existing variables

基于两个参数创建新的 Pandas DataFrame

[英]Creating new Pandas DataFrame based on two parameters

Python / Pandas - 基于几个变量和if / elif / else函数创建新变量

[英]Python/Pandas - creating new variable based on several variables and if/elif/else function

根据其他两个变量的值创建一个变量

[英]Creating a variable based on the values of two other variables

无法根据现有变量创建新变量

[英]Trouble creating new variable based off of existing variable

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 基于两列创建新变量作为索引一列作为新变量名称python pandas或R. 大熊猫：在循环中创建现有变量的滞后变量基于 pandas 中的现有列创建新列根据 if 和现有列在 pandas 中创建新列根据不同索引的值减去两个变量，在 Pandas 中创建新列 Python pandas 和 numpy：根据现有变量的多个条件为新变量分配数值基于两个参数创建新的 Pandas DataFrame Python / Pandas - 基于几个变量和if / elif / else函数创建新变量根据其他两个变量的值创建一个变量无法根据现有变量创建新变量

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM