简体   繁体   English

如何向条形图添加多个注释

[英]How to add multiple annotations to a barplot

I would like to add percent values - in addition to counts - to my pandas barplot.我想在我的 pandas 条形图中添加百分比值 - 除了计数。 However, I am not able to do so.但是,我无法这样做。 My code is shown below and thus far I can get count values to display.我的代码如下所示,到目前为止,我可以获得要显示的计数值。 Can somebody please help me add relative % values next to/below the count values displayed for each bar?有人可以帮我在每个条显示的计数值旁边/下方添加相对百分比值吗?

import matplotlib
import matplotlib.pyplot as plt
%matplotlib inline
plt.style.use('ggplot')

import seaborn as sns
sns.set_style("white")

fig = plt.figure()
fig.set_figheight(5)
fig.set_figwidth(10)

ax = fig.add_subplot(111)

counts = [29227, 102492,  53269, 504028, 802994]

y_ax = ('A','B','C','D','E')
y_tick = np.arange(len(y_ax))

ax.barh(range(len(counts)), counts, align = "center", color = "tab:blue")
ax.set_yticks(y_tick)
ax.set_yticklabels(y_ax, size = 8)

#annotate bar plot with values
for i in ax.patches:
    ax.text(i.get_width()+.09, i.get_y()+.3, str(round((i.get_width()), 1)), fontsize=8)

sns.despine()
plt.show();

The output of my code is shown below.我的代码的 output 如下所示。 How can one add % values next to each count value displayed?如何在显示的每个计数值旁边添加 % 值?

在此处输入图像描述

With pandaspandas

  • Tested with pandas v1.2.4使用pandas v1.2.4测试

Imports and Load Data导入和加载数据

import pandas as pd
import matplotlib.pyplot as plt

# create the dataframe from values in the OP
counts = [29227, 102492,  53269, 504028, 802994]
df = pd.DataFrame(data=counts, columns=['counts'], index=['A','B','C','D','E'])

# add a percent column
df['%'] = df.counts.div(df.counts.sum()).mul(100).round(2)

# display(df)
   counts      %
A   29227   1.96
B  102492   6.87
C   53269   3.57
D  504028  33.78
E  802994  53.82

Plot use matplotlib from version 3.4.2 matplotlib

ax = df.plot(kind='barh', y='counts', figsize=(10, 5), legend=False, width=.75,
             title='This is the plot generated by all code examples in this answer')

# customize the label to include the percent
labels = [f' {v.get_width()}\n {df.iloc[i, 1]}%' for i, v in enumerate(ax.containers[0])]

# set the bar label
ax.bar_label(ax.containers[0], labels=labels, label_type='edge', size=13)

ax.spines['right'].set_visible(False)
ax.spines['top'].set_visible(False)
plt.show()

在此处输入图像描述

Plot use matplotlib before version 3.4.2 Plot 使用版本 3.4.2 之前的matplotlib

# plot the dataframe
ax = df.plot(kind='barh', y='counts', figsize=(10, 5), legend=False, width=.75)
for i, y in enumerate(ax.patches):

    # get the percent label
    label_per = df.iloc[i, 1]
    
    # add the value label
    ax.text(y.get_width()+.09, y.get_y()+.3, str(round((y.get_width()), 1)), fontsize=10)
    
    # add the percent label here
    ax.text(y.get_width()+.09, y.get_y()+.1, str(f'{round((label_per), 2)}%'), fontsize=10)

ax.spines['right'].set_visible(False)
ax.spines['top'].set_visible(False)
plt.show()

Original Answer without pandas没有pandas原始答案

  • Tested with matplotlib v3.3.4使用matplotlib v3.3.4测试
import matplotlib.pyplot as plt

fig, ax = plt.subplots(figsize=(10, 5))

counts = [29227, 102492,  53269, 504028, 802994]

# calculate percents
percents = [100*x/sum(counts) for x in counts]

y_ax = ('A','B','C','D','E')
y_tick = np.arange(len(y_ax))

ax.barh(range(len(counts)), counts, align = "center", color = "tab:blue")
ax.set_yticks(y_tick)
ax.set_yticklabels(y_ax, size = 8)

#annotate bar plot with values
for i, y in enumerate(ax.patches):
    label_per = percents[i]
    ax.text(y.get_width()+.09, y.get_y()+.3, str(round((y.get_width()), 1)), fontsize=10)
    # add the percent label here
    # ax.text(y.get_width()+.09, y.get_y()+.3, str(round((label_per), 2)), ha='right', va='center', fontsize=10)
    ax.text(y.get_width()+.09, y.get_y()+.1, str(f'{round((label_per), 2)}%'), fontsize=10)

ax.spines['right'].set_visible(False)
ax.spines['top'].set_visible(False)
plt.show()
  • You can play with the positioning.你可以玩定位。
  • Other formatting options mentioned by JohanC JohanC提到的其他格式选项
  • Print both parts of the text in one string with a \n in between to get a "natural" line spacing:将文本的两个部分打印在一个字符串中,中间有一个\n以获得“自然”行间距:
  • str(f'{round((y.get_width()), 1)}\n{round((label_per), 2)}%')
  • ax.text(..., va='center') to vertically center and be able to use a slightly larger font. ax.text(..., va='center')垂直居中并能够使用稍大的字体。
  • ax.set_xlim(0, max(counts) * 1.18) to get a bit more space for the text. ax.set_xlim(0, max(counts) * 1.18)为文本获得更多空间。
  • Start each line of text with a space to get a natural "horizontal" padding.以空格开始每一行文本以获得自然的“水平”填充。
  • str(f' {round((label_per), 2)}%') , note the space before { . str(f' {round((label_per), 2)}%') ,注意{之前的空格。
  • y.get_width()+.09 is extremely close to y.get_width() when these values are in the tens of thousands.当这些值为数万时, y.get_width()+.09非常接近y.get_width()

在此处输入图像描述

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM