[英]Pandas analysis of data in csv
Im trying to code a statistical model using pandas. 我正在尝试使用熊猫编写统计模型。 First i wanna start by doing a simple mean and median check of the price. 首先,我想先做一个简单的均值和中位数价格检查。 I have shown data in csv below. 我已经在下面的csv中显示了数据。
what I have tried: 我尝试过的
import pandas
df = pandas.read_csv('audi-a4.csv', index_col=False, header=0);
serie = df.transpose() # here we convert the DataFrame into a Se
x = serie.describe()
print(serie)
print(x)
Data in CSV: CSV格式的数据:
NoteDate Fuel StartDate BestContactTime PriceDisplay IsBold Make Transmission Category Owners HasFreeShipping Year EngineSize AsAt IsDealer Doors ExteriorColour Odometer StartPrice NumberPlate ImportHistory EndDate Cylinders Title Vin CategoryPath WofExpires BodyStyle ListingLength Suburb StereoDescription ListingId Region Model RegistrationExpires
/Date(0)/ Petrol /Date(1457523875033)/ Evening $17,985 TRUE Audi Automatic 0001-0268-0271- 0 TRUE 2005 2000 /Date(1457854140345)/ FALSE 0 Dark Blue 61988 17985 ABC123 /Date(1458128675033)/ 0 Audi A4 Test887 2005 WAUZZZ8EX5A457186 /Trade-Me-Motors/Cars/Audi /Date(0)/ Sedan Whangarei 4554769 Northland A4 /Date(0)/
/Date(0)/ Petrol /Date(1457525091850)/ Evening $17,985 TRUE Audi Automatic 0001-0268-0271- 0 TRUE 2005 2000 /Date(1457854140345)/ FALSE 0 Dark Blue 61988 17985 ABC123 /Date(1458129891850)/ 0 Audi A4 Test biz1 2005 WAUZZZ8EX5A457186 /Trade-Me-Motors/Cars/Audi /Date(0)/ Sedan Whangarei 4554770 Northland A4 /Date(0)/
/Date(0)/ Petrol /Date(1457530941643)/ Evening $17,985 TRUE Audi Automatic 0001-0268-0271- 0 TRUE 2005 2000 /Date(1457854140345)/ FALSE 0 Dark Blue 61988 17985 ABC123 /Date(1458135741643)/ 0 Audi A4 Test123 2005 WAUZZZ8EX5A457186 /Trade-Me-Motors/Cars/Audi /Date(0)/ Sedan Whangarei 4554773 Northland A4 /Date(0)/
/Date(0)/ Petrol /Date(1457532268097)/ Evening $17,985 TRUE Audi Automatic 0001-0268-0271- 0 TRUE 2005 2000 /Date(1457854140345)/ FALSE 0 Dark Blue 61988 17985 ABC123 /Date(1458137068097)/ 0 Audi A4 Luxary Ship for Sale 2005 WAUZZZ8EX5A457186 /Trade-Me-Motors/Cars/Audi /Date(0)/ Sedan Whangarei 4554774 Northland A4 /Date(0)/
/Date(0)/ Petrol /Date(1457612061090)/ Evening $17,985 TRUE Audi Automatic 0001-0268-0271- 0 TRUE 2005 2000 /Date(1457854140345)/ FALSE 0 Dark Blue 61988 17985 ABC123 /Date(1458216861090)/ 0 Audi A4 Well Established, EST T/O $8.8M, Owne 2005 WAUZZZ8EX5A457186 /Trade-Me-Motors/Cars/Audi /Date(0)/ Sedan Whangarei 4557063 Northland A4 /Date(0)/
First, you're going to want to get those "/" and the "," out of your data frames. 首先,您将要从数据框中获取那些“ /”和“,”。 For the mean of a column, you can use the df.mean() function from pandas. 对于列的均值,可以使用pandas中的df.mean()函数。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.