繁体 English 中英

通过使用熊猫将现有列向下移动1行来创建新列

[英]Creating a new column by shifting an existing column to 1 row down using pandas

原文 2018-05-09 02:54:00 1 1 python/ pandas

我正在从事体育运动。 目的是记录游戏中的当前eventdatetime和PreviousEventTime。 我在下面的链接中有一个样本数据集。

https://drive.google.com/open?id=1DUNrWPFwrkZHpq_KeA4rZCJ94sbpUEDI

在此文件中，有11列。 该事件是基于时间收集的。 对于此重新安排，我将使用以下列gsm_ID ， eventdatetime列

我想创建一个新列PreviousEventTime ，该列占用eventdatetime列的n-1行。 这意味着对于每个gsm_ID ，都会有第一个eventdatetime 。 与时间列相比，新列将代表下一个事件时间。

gsm_ID eventdatetime PreviousEventTime

2462794 08/11/2017 18:46 08/11/2017 18:45

2462794 08/11/2017 18:49 08/11/2017 18:46

2462794 08/11/2017 19:13 08/11/2017 18:49

2462794 08/11/2017 19:31 08/11/2017 19:13

2462794 08/11/2017 20:09 08/11/2017 19:31

2462795 08/12/2017 17:39 08/12/2017 16:30

2462795 08/12/2017 17:44 08/12/2017 17:39

上面的示例仅用于两个游戏。 您可以通过gsm_id进行区分 。 PreviousEventTime的for行将始终为matchdatetime。 我将有100场比赛。 但是该过程将如上述示例重复。

eventdata ['PreviousEventTime-1'] = eventdata.groupby(['gsm_id'])['eventdatetime'].shift(-1)

但这仅适用于第一个gsm_ID 。 它不适用于其他gsm_ID 。 上面脚本的输出如下：

您的建议将不胜感激。 问候，西风

1 个解决方案

排序正确解决了问题。 我添加了以下排序和索引：

eventdata = eventdata.set_index(['gsm_id']) .sort_index(ascending =True)

eventdata=eventdata.sort_values(['matchdatetime','time'],ascending=[True,True])

eventdata ['PreviousEventTime-1'] = eventdata.groupby(['gsm_id','matchdatetime'])['eventdatetime'].shift(1, axis = 0)

但是剩下的部分是用matchdatetime填充NaT。 谢谢大家给我的建议。 关于西风

使用Pandas df的字典基于现有列创建新列

[英]creating new column based on existing column using a dictionary for a Pandas df

基于 pandas 中的现有列创建新列

[英]Creating new column based on existing column in pandas

根据 if 和现有列在 pandas 中创建新列

[英]creating new column in pandas based on if and existing column

Pandas 使用现有列和字典的新列

[英]Pandas new column using a existing column and a dictionary

使用基于现有行值的条件在Pandas中创建新列并返回另一行的值

[英]Creating new column in Pandas with a condition based on existing row values and returning another row's values

从pandas中的每一行创建一个新列

[英]Creating a new column from each row in pandas

通过将现有列与十进制数相乘在 Pandas 中创建新列

[英]Creating new column in pandas by multiplying existing column with decimal number

Pandas：根据现有列的值创建新列

[英]Pandas: Creating new column based on values from existing column

基于现有列的下一行元素创建新列

[英]Creating a new column based on the element of next row of existing column

使用 pandas 移动值并创建新索引

[英]Shifting a value and creating a new index using pandas

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 使用Pandas df的字典基于现有列创建新列基于 pandas 中的现有列创建新列根据 if 和现有列在 pandas 中创建新列 Pandas 使用现有列和字典的新列使用基于现有行值的条件在Pandas中创建新列并返回另一行的值从pandas中的每一行创建一个新列通过将现有列与十进制数相乘在 Pandas 中创建新列 Pandas：根据现有列的值创建新列基于现有列的下一行元素创建新列使用 pandas 移动值并创建新索引

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM