繁体 English 中英

在熊猫中使用isin的最快方法

[英]Fastest way to use isin in pandas

原文 2014-06-17 17:59:32 0 1 python/ pandas

我有两个带有ID列的csv，其中第一个csv中的ID是第二个csv中ID的子集。 为了节省空间，在读取第一个csv之后，我试图仅读取第二个csv中出现在第一个csv中的行，如下所示：

chunker = pd.read_csv(t_path)

df = pd.DataFrame()
for chunk in chunker:
    # keep_ids is a series of ids from previous table
    temp = chunk[chunk['Id'].isin(keep_ids)]
    df = df.append(temp, ignore_index=True)
df.reset_index()

我正在处理的文件多达30个演出，因此这可能有点慢。 有没有更快的方法来找到适当的ID（可能使用索引）？

编辑1：将块的索引设置为等于id列，然后仅保留与keep_ids成功合并的行，是否很快？

1 个解决方案

也许是这样的：

chunker = pd.read_csv(t_path, iterator=True, chunksize=1000)
df = pd.concat(chunk[chunk['Id'].isin(keep_ids) for chunk in chunker ])

有没有办法在多个列表中使用 pandas .isin() 函数？

[英]Is there a way to use pandas .isin() function with multiple lists?

如何在IF语句中使用pandas isin（）

[英]How to use pandas isin() with IF statement

将.isin 应用于 pandas 中每一行的有效方法

[英]Efficient way to apply .isin to each row in pandas

如何将熊猫 isin 用于多列

[英]how to use pandas isin for multiple columns

如何使用 isin 在 pandas dataframe 中填写值？

[英]How to use isin to fill in values in a pandas dataframe?

这是在熊猫队中分组的最快方式吗？

[英]Is this the fastest way to group in Pandas?

在 Pandas 中计算的最快方法？

[英]Fastest way to calculate in Pandas?

使用 pandas 循环通过 dataframe 时使用 if/else 语句的最快方法

[英]Fastest way to use if/else statements when looping through dataframe with pandas

有没有办法使用 isin() 作为 pandas 数据框中另一列的计算器函数？

[英]Is there a way of using isin() as calculator function for another column in pandas dataframe?

使用 pandas isin 过滤列时项目长度错误

[英]Item wrong length when use pandas isin to filter column

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 有没有办法在多个列表中使用 pandas .isin() 函数？如何在IF语句中使用pandas isin（）将.isin 应用于 pandas 中每一行的有效方法如何将熊猫 isin 用于多列如何使用 isin 在 pandas dataframe 中填写值？这是在熊猫队中分组的最快方式吗？在 Pandas 中计算的最快方法？使用 pandas 循环通过 dataframe 时使用 if/else 语句的最快方法有没有办法使用 isin() 作为 pandas 数据框中另一列的计算器函数？使用 pandas isin 过滤列时项目长度错误

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM