简体   繁体   中英

Pandas Dataframe Bar Plot - Qualitative Variable?

I am looking to create a simple bar plot, where the grouped qualitative values for "source" are the x-axis and the quantiative values of "retweet_count" are the y-axis.

I want the total number of retweets by source.

    _id                         created_at              geo     id      retweet_count   source  text
0   52c2388a0b434a166c94a648    2013-12-31 03:22:30     None    417858271957893121  0   web     @sirenasmaster I'm one of 4 people that I foll...
1   52c2388a0b434a166c94a649    2013-12-31 03:22:29     None    417858271350120449  0   Twitter for iPhone  Thanks Mack brown always looked up to you and ...
2   52c2388a0b434a166c94a64a    2013-12-31 03:22:26     None    417858255121948672  0   Twitter for iPhone  @CottonGent I never said case was awesome, but...
3   52c2388a0b434a166c94a64b    2013-12-31 03:22:20     None    417858229625163776  0   buzztap     Longhorn Digest (Scout) >> UA Game: DB C...
4   52c2388a0b434a166c94a64c    2013-12-31 03:22:15     None    417858211186626560  1   web     RT @rudyp_: Marvin is on it tonight. He's a tr...
5   52c2388a0b434a166c94a64d    2013-12-31 03:22:12     None    417858197550931970  2   Twitter for iPhone  RT @marvintran76: If you're a Longhorn and you...
6   52c2388a0b434a166c94a64e    2013-12-31 03:21:49     None    417858102042443776  1   Twitter for iPhone  Always will be a Longhorn Fan 👌🐂
7   52c2388a0b434a166c94a64f    2013-12-31 03:21:38     None    417858055955423232  0   Twitter for Android     Longhorn and duke basketball, and lakers are o...
8   52c2388a0b434a166c94a650    2013-12-31 03:21:34     None    417858036716142592  1   Twitter for iPhone  Marvin is on it tonight. He's a true longhorn ...
9   52c2388a0b434a166c94a651    2013-12-31 03:21:32     None    417858031880134657  0   web     @ChipBrownOB but twice as many as there are Lo..

If I use the following simple code, it gives me a bar for each individual source entry:

df.plot(x='source', y='retweet_count', kind='bar')

This is something I've been banging my head on but it probably right in front of me.

You can groupby and plot then:

>>> df.groupby('source')['retweet_count'].sum().plot(kind='bar')
<matplotlib.axes.AxesSubplot object at 0x039C8070>

结果

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM