How to print specific years on a massive csv file using Python

Question

import pandas

fileref = open('chart.csv')
f = pandas.read_csv(fileref)

f_set = f[f.year >= 2005]
print(f_set.groupby('y').namefromchart.nunique())

So I know f_set will print out my y cells with the corresponding years of 2005 through current. What if I want to print out the years 2002 through 2009?

Answer 1

It seems you need add second condition to boolean indexing :

import pandas as pd

f = pd.read_csv('chart.csv')
f_set = f[(f.year >= 2002) & (f.year < 2010)]
print (f_set.groupby('y').namefromchart.nunique())

Another solution with between , by default is inclusive :

f_set = f[f.year.between(2002,2009)]

Answer 2

consider the pd.DataFrame f

f = pd.DataFrame(dict(year=range(2000, 2011), A=np.random.rand(11)))

you can set the index and slice how you'd like

f.set_index('year').ix[2002:2009]

How to print specific years on a massive csv file using Python

Question

2 answers

solution1
2 2016-10-20 06:37:18

solution2
1 2016-10-20 06:44:14

How to print specific years on a massive csv file using Python

Question

2 answers

solution1 2 2016-10-20 06:37:18

solution2 1 2016-10-20 06:44:14

solution1
2 2016-10-20 06:37:18

solution2
1 2016-10-20 06:44:14