简体   繁体   English

数据质量 仅限数字列

[英]Data quality Numeric Columns only

I'm trying to setup a data quality check for numeric columns in a dataframe.我正在尝试为 dataframe 中的数字列设置数据质量检查。 I want to run the describe() to produce stats on each numeric columns.我想运行 describe() 以生成每个数字列的统计信息。 How can I filter out other columns to produce stats.如何过滤掉其他列以生成统计信息。 See line of code I'm using.请参阅我正在使用的代码行。

df1 = pandas.read_csv("D:/dc_Project/loans.csv") print(df1.describe(include=sorted(df1))) df1 = pandas.read_csv("D:/dc_Project/loans.csv") 打印(df1.describe(include=sorted(df1)))

Went with the following from a teammate: import pandas as pd import numpy as np从队友那里得到以下信息: import pandas as pd import numpy as np

df1 = pandas.read_csv("D:/dc_Project/loans.csv") df2=df1.select_dtypes(include=np.number) df1 = pandas.read_csv("D:/dc_Project/loans.csv") df2=df1.select_dtypes(include=np.number)

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM