在 Pandas 中使用 for 循环创建新的 df

Question

Not sure if I am doing this right - first post here so please be gentle:)不知道我这样做是否正确 - 第一次在这里发帖所以请温柔:)

Se below picture.见下图。

Print screen from my Jupyter Notebook从我的 Jupyter Notebook 打印屏幕

What I am trying to do is to create a new dataframe from the df_Grundinladdning['Datan'] dataframe which only include the rows that contain the string "#TRANS".我要做的是从 df_Grundinladdning['Datan'] dataframe 创建一个新的 dataframe，其中仅包含包含字符串“#TRANS”的行。

Answer 1

Here's a way to do that:这是一种方法：

df = pd.DataFrame({"Datan": ["x", "TRANS y", "z", "TRANS u", "v", "TRANS w"]})
print(df)

new_df = df[df.Datan.str.contains("TRANS")]
print(new_df)

Results:结果：

(original dataframe)
     Datan
0        x
1  TRANS y
2        z
3  TRANS u
4        v
5  TRANS w

(new dataframe)
     Datan
1  TRANS y
3  TRANS u
5  TRANS w

Answer 2

The right method is described here. 这里描述了正确的方法。 The loop, even if it did not have syntax errors, would be very very slow.循环，即使它没有语法错误，也会非常非常慢。

Answer 3

You don't need to loop over the dataframe you can get the result dataframe easily with this:您无需遍历 dataframe 即可轻松获得结果 dataframe ：

df_transOnly= df_Grundinladdning[df_Grundinladdning["Datan"].str.contains('#TRANS')]
df_transOnly #for printing df

So you will get the needed dataframe like this:因此，您将获得所需的 dataframe，如下所示：

      Datan
5     #TRANS232
12    #TRANS455
20    #TRANS3144
104   #TRANS1234
500   #TRANS213

在 Pandas 中使用 for 循环创建新的 df

问题描述

3 个解决方案

解决方案1
1 2020-05-23 15:22:40

解决方案2
0 2020-05-23 15:19:25

解决方案3
0 2020-05-23 15:24:41

在 Pandas 中使用 for 循环创建新的 df

问题描述

3 个解决方案

解决方案1 1 2020-05-23 15:22:40

解决方案2 0 2020-05-23 15:19:25

解决方案3 0 2020-05-23 15:24:41

解决方案1
1 2020-05-23 15:22:40

解决方案2
0 2020-05-23 15:19:25

解决方案3
0 2020-05-23 15:24:41