如何从数据框中删除离群值？

Question

I have a (268X4) df and found the outliers (22,1) for one column. 我有（268X4）df，发现一列的异常值（22,1）。 I want to remove those outliers from the df. 我想从df中删除那些离群值。 How do I do that? 我怎么做？

> df=df_nonull import pandas as pd   # to manipulate dataframes import
> numpy as np   # to manipulate arrays
> 
> # a number "a" from the vector "x" is an outlier if 
> # a > median(x)+1.5*iqr(x) or a < median-1.5*iqr(x)
> # iqr: interquantile range = third interquantile - first interquantile def 
>outliers(x): 
>        return np.abs(x- x.median()) > 1.5*(x.quantile(.75)-
>x.quantile(0.25))
> 
> # Give the outliers for the first column for example 
>outliers=df.StockValue[outliers(df.StockValue)]

Answer 1

You can only remove the whole row, njot a single cell like (22,1). 您只能删除整行，njot像（22,1）这样的单个单元格。 If you want to remove the complete row of the data. 如果要删除数据的完整行。

df = df.drop(df.index[[22]]) df = df.drop（df.index [[22]]）

如何从数据框中删除离群值？

问题描述

1 个解决方案

解决方案1
1 2017-05-26 16:06:07

如何从数据框中删除离群值？

问题描述

1 个解决方案

解决方案1 1 2017-05-26 16:06:07

解决方案1
1 2017-05-26 16:06:07