簡體 English 中英

Pandas基於兩個現有變量創建一個新變量

[英]Pandas creating a new variable based on two existing variables

原文 2018-06-14 06:43:03 7 2 python/ pandas

我認為以下代碼效率很低。 有沒有更好的方法在熊貓中進行這種類型的常見重新編碼？

df['F'] = 0
df['F'][(df['B'] >=3) & (df['C'] >=4.35)] = 1
df['F'][(df['B'] >=3) & (df['C'] < 4.35)] = 2
df['F'][(df['B'] < 3) & (df['C'] >=4.35)] = 3
df['F'][(df['B'] < 3) & (df['C'] < 4.35)] = 4

2 個解決方案

使用numpy.select並將布爾掩碼緩存到變量以獲得更好的性能：

m1 = df['B'] >= 3
m2 = df['C'] >= 4.35
m3 = df['C'] < 4.35
m4 = df['B'] < 3

df['F'] = np.select([m1 & m2, m1 & m3, m4 & m2, m4 & m3], [1,2,3,4], default=0)

在您的特定情況下，您可以利用布爾實際上是整數（False == 0，True == 1）並使用簡單算術的事實：

df['F'] = 1 + (df['C'] < 4.35) + 2 * (df['B'] < 3)

請注意，這將忽略B和C列中的任何NaN，這些將被指定為高於您的限制。

基於兩列創建新變量作為索引一列作為新變量名稱python pandas或R.

[英]Creating new variables based on two columns as index one column as new variable names python pandas or R

大熊貓：在循環中創建現有變量的滯后變量

[英]pandas: creating lagged variables of existing variable in a loop

基於 pandas 中的現有列創建新列

[英]Creating new column based on existing column in pandas

根據 if 和現有列在 pandas 中創建新列

[英]creating new column in pandas based on if and existing column

根據不同索引的值減去兩個變量，在 Pandas 中創建新列

[英]Creating New Columns in Pandas based on subtracting two variables based on value from different indexes

Python pandas 和 numpy：根據現有變量的多個條件為新變量分配數值

[英]Python pandas and numpy: assign numerical values to new variable based on multiple conditions for existing variables

基於兩個參數創建新的 Pandas DataFrame

[英]Creating new Pandas DataFrame based on two parameters

Python / Pandas - 基於幾個變量和if / elif / else函數創建新變量

[英]Python/Pandas - creating new variable based on several variables and if/elif/else function

根據其他兩個變量的值創建一個變量

[英]Creating a variable based on the values of two other variables

無法根據現有變量創建新變量

[英]Trouble creating new variable based off of existing variable

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 基於兩列創建新變量作為索引一列作為新變量名稱python pandas或R. 大熊貓：在循環中創建現有變量的滯后變量基於 pandas 中的現有列創建新列根據 if 和現有列在 pandas 中創建新列根據不同索引的值減去兩個變量，在 Pandas 中創建新列 Python pandas 和 numpy：根據現有變量的多個條件為新變量分配數值基於兩個參數創建新的 Pandas DataFrame Python / Pandas - 基於幾個變量和if / elif / else函數創建新變量根據其他兩個變量的值創建一個變量無法根據現有變量創建新變量

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM