如何从 python 中的 dataframe 列中删除每个第 n 个元素

Question

I am using python and have a pandas dataframe imported from a csv.我正在使用 python 并有一个从 Z628CB5675FF524F3E719BFEAAZE8 导入的 pandas dataframe。 I would like to remove every nth value from each entry in a specific column.我想从特定列中的每个条目中删除每个第 n 个值。

For example:例如：

the dataframe column to transform is called:要转换的 dataframe 列称为：
"Linestring" “线串”
- each entry has a varying float lengths and goes like this: Linestring(151.420 -33.540, 155.464722 -39.069046, 153.30925678 -33.08364825, 152.0998 -31.8090, 150.539067 -30.57578)每个条目都有不同的浮点长度，如下所示： Linestring(151.420 -33.540, 155.464722 -39.069046, 153.30925678 -33.08364825, 152.0998 -31.8090, 150.539067 -30.57578)
each entry has varying lengths每个条目都有不同的长度
I would like to remove say every two elements after each comma giving: Linestring(151.420 -33.540, 153.30925678 -33.08364825, 150.539067 -30.57578)我想在每个逗号给出后删除每两个元素： Linestring(151.420 -33.540, 153.30925678 -33.08364825, 150.539067 -30.57578)

Attached/linked is a visual guide of what I am after.附加/链接是我所追求的视觉指南。

Example problem and outcome示例问题和结果

Thanks a lot: :)非常感谢：：）

Answer 1

Try this.尝试这个。 I hope it'll help.我希望它会有所帮助。

df['Linestring'] = df.Linestring.apply(lambda x: ','.join(x.split(',')[::2]) if ','.join(x.split(',')[::2])[-1] == ')' else ','.join(x.split(',')[::2]) + ')')

Answer 2

I wrote a function to replace every nth value with None, you can then drop these values leaving you with a new data frame that does not include these dropped cells.我写了一个 function 用 None 替换每个第 n 个值，然后您可以删除这些值，留下一个不包含这些删除单元格的新数据框。 I hope this helps.我希望这有帮助。

import pandas as pd

df = pd.DataFrame({'Numbers': [10,15,33,22,17,77,9]}) #a dataframe with column and some values
print(df) #prints the original dataframe

def rmNth(dFrame, col, n): #dataframe, column, delete every 'nth' 
    rows = len(dFrame.axes[0]) #stores the number of rows
    x = n - 1 #used in the while loop

    while (x <= rows): #replace every nth cell with a null value
        dFrame.at[x ,col] = None
        x = x + n #increment x by n

    print(dFrame) #prints the dataframe showing all cells that will be removed are replaced with 'nan'
    newDF = dFrame.dropna() #remove null cells
    newDF.reset_index(drop=True, inplace=True) #reset the index
    return (newDF)

print(rmNth(df, "Numbers", 3)) #print the data frame with every 3rd value removed from the Numbers column

如何从 python 中的 dataframe 列中删除每个第 n 个元素

问题描述

2 个解决方案

解决方案1
0 2022-09-13 01:40:18

解决方案2
0 2022-09-13 02:42:45

如何从 python 中的 dataframe 列中删除每个第 n 个元素

问题描述

2 个解决方案

解决方案1 0 2022-09-13 01:40:18

解决方案2 0 2022-09-13 02:42:45

解决方案1
0 2022-09-13 01:40:18

解决方案2
0 2022-09-13 02:42:45