使用 matplotlib 在單個圖表上繪制兩個直方圖

Question

我使用文件中的數據創建了一個直方圖，沒有問題。 現在我想在同一個直方圖中疊加來自另一個文件的數據，所以我做這樣的事情

n,bins,patchs = ax.hist(mydata1,100)
n,bins,patchs = ax.hist(mydata2,100)

但問題是，對於每個區間，只有最高值的條出現，而其他的則隱藏。 我想知道如何用不同的顏色同時繪制兩個直方圖。

Answer 1

這里有一個工作示例：

import random
import numpy
from matplotlib import pyplot

x = [random.gauss(3,1) for _ in range(400)]
y = [random.gauss(4,2) for _ in range(400)]

bins = numpy.linspace(-10, 10, 100)

pyplot.hist(x, bins, alpha=0.5, label='x')
pyplot.hist(y, bins, alpha=0.5, label='y')
pyplot.legend(loc='upper right')
pyplot.show()

在此處輸入圖片說明

Answer 2

接受的答案給出了帶有重疊條的直方圖的代碼，但如果您希望每個條並排（就像我所做的那樣），請嘗試以下變體：

import numpy as np
import matplotlib.pyplot as plt

x = np.random.normal(1, 2, 5000)
y = np.random.normal(-1, 3, 2000)
bins = np.linspace(-10, 10, 30)

plt.hist([x, y], bins, label=['x', 'y'])
plt.legend(loc='upper right')
plt.show()

參考： http : //matplotlib.org/examples/statistics/histogram_demo_multihist.html

編輯 [2018/03/16]：更新為允許繪制不同大小的數組，如@stochastic_zeitgeist 所建議的

Answer 3

如果您有不同的樣本量，可能很難將分布與單個 y 軸進行比較。 例如：

import numpy as np
import matplotlib.pyplot as plt

#makes the data
y1 = np.random.normal(-2, 2, 1000)
y2 = np.random.normal(2, 2, 5000)
colors = ['b','g']

#plots the histogram
fig, ax1 = plt.subplots()
ax1.hist([y1,y2],color=colors)
ax1.set_xlim(-10,10)
ax1.set_ylabel("Count")
plt.tight_layout()
plt.show()

在這種情況下，您可以在不同的軸上繪制兩個數據集。 為此，您可以使用 matplotlib 獲取直方圖數據，清除軸，然后在兩個單獨的軸上重新繪制它（移動 bin 邊緣，使它們不重疊）：

#sets up the axis and gets histogram data
fig, ax1 = plt.subplots()
ax2 = ax1.twinx()
ax1.hist([y1, y2], color=colors)
n, bins, patches = ax1.hist([y1,y2])
ax1.cla() #clear the axis

#plots the histogram data
width = (bins[1] - bins[0]) * 0.4
bins_shifted = bins + width
ax1.bar(bins[:-1], n[0], width, align='edge', color=colors[0])
ax2.bar(bins_shifted[:-1], n[1], width, align='edge', color=colors[1])

#finishes the plot
ax1.set_ylabel("Count", color=colors[0])
ax2.set_ylabel("Count", color=colors[1])
ax1.tick_params('y', colors=colors[0])
ax2.tick_params('y', colors=colors[1])
plt.tight_layout()
plt.show()

Answer 4

作為Gustavo Bezerra 回答的補充：

如果你想每個直方圖進行歸一化（ normed為MPL <= 2.1和density為MPL> = 3.1），你不能只用normed/density=True ，你需要為每個代替值的權重：

import numpy as np
import matplotlib.pyplot as plt

x = np.random.normal(1, 2, 5000)
y = np.random.normal(-1, 3, 2000)
x_w = np.empty(x.shape)
x_w.fill(1/x.shape[0])
y_w = np.empty(y.shape)
y_w.fill(1/y.shape[0])
bins = np.linspace(-10, 10, 30)

plt.hist([x, y], bins, weights=[x_w, y_w], label=['x', 'y'])
plt.legend(loc='upper right')
plt.show()

作為比較，具有默認權重和density=True的完全相同的x和y向量：

Answer 5

您應該使用hist返回的值中的bins ：

import numpy as np
import matplotlib.pyplot as plt

foo = np.random.normal(loc=1, size=100) # a normal distribution
bar = np.random.normal(loc=-1, size=10000) # a normal distribution

_, bins, _ = plt.hist(foo, bins=50, range=[-6, 6], normed=True)
_ = plt.hist(bar, bins=bins, alpha=0.5, normed=True)

Answer 6

這是一個簡單的方法，當數據具有不同的大小時，在同一個圖上繪制兩個直方圖，它們的條並排：

def plotHistogram(p, o):
    """
    p and o are iterables with the values you want to 
    plot the histogram of
    """
    plt.hist([p, o], color=['g','r'], alpha=0.8, bins=50)
    plt.show()

Answer 7

還有一個與 joaquin 答案非常相似的選項：

import random
from matplotlib import pyplot

#random data
x = [random.gauss(3,1) for _ in range(400)]
y = [random.gauss(4,2) for _ in range(400)]

#plot both histograms(range from -10 to 10), bins set to 100
pyplot.hist([x,y], bins= 100, range=[-10,10], alpha=0.5, label=['x', 'y'])
#plot legend
pyplot.legend(loc='upper right')
#show it
pyplot.show()

給出以下輸出：

Answer 8

繪制兩個重疊的直方圖（或更多）會導致繪圖相當混亂。 我發現使用階梯直方圖（又名空心直方圖）可以大大提高可讀性。 唯一的缺點是在 matplotlib 中，步驟直方圖的默認圖例格式不正確，因此可以像以下示例中那樣對其進行編輯：

import numpy as np                   # v 1.19.2
import matplotlib.pyplot as plt      # v 3.3.2
from matplotlib.lines import Line2D

rng = np.random.default_rng(seed=123)

# Create two normally distributed random variables of different sizes
# and with different shapes
data1 = rng.normal(loc=30, scale=10, size=500)
data2 = rng.normal(loc=50, scale=10, size=1000)

# Create figure with 'step' type of histogram to improve plot readability
fig, ax = plt.subplots(figsize=(9,5))
ax.hist([data1, data2], bins=15, histtype='step', linewidth=2,
        alpha=0.7, label=['data1','data2'])

# Edit legend to get lines as legend keys instead of the default polygons
# and sort the legend entries in alphanumeric order
handles, labels = ax.get_legend_handles_labels()
leg_entries = {}
for h, label in zip(handles, labels):
    leg_entries[label] = Line2D([0], [0], color=h.get_facecolor()[:-1],
                                alpha=h.get_alpha(), lw=h.get_linewidth())
labels_sorted, lines = zip(*sorted(leg_entries.items()))
ax.legend(lines, labels_sorted, frameon=False)

# Remove spines
ax.spines['top'].set_visible(False)
ax.spines['right'].set_visible(False)

# Add annotations
plt.ylabel('Frequency', labelpad=15)
plt.title('Matplotlib step histogram', fontsize=14, pad=20)
plt.show()

如您所見，結果看起來很干凈。 當重疊兩個以上的直方圖時，這尤其有用。 根據變量的分布方式，這最多可用於大約 5 個重疊分布。 不僅如此，還需要使用另一種類型的繪圖，例如此處介紹的一種。

Answer 9

聽起來您可能只想要一個條形圖：

或者，您可以使用子圖。

Answer 10

以防萬一您有熊貓（ import pandas as pd ）或者可以使用它：

test = pd.DataFrame([[random.gauss(3,1) for _ in range(400)], 
                     [random.gauss(4,2) for _ in range(400)]])
plt.hist(test.values.T)
plt.show()

Answer 11

當您想從二維 numpy 數組繪制直方圖時，有一個警告。 您需要交換 2 個軸。

import numpy as np
import matplotlib.pyplot as plt

data = np.random.normal(size=(2, 300))
# swapped_data.shape == (300, 2)
swapped_data = np.swapaxes(x, axis1=0, axis2=1)
plt.hist(swapped_data, bins=30, label=['x', 'y'])
plt.legend()
plt.show()

Answer 12

這個問題之前已經回答過，但想添加另一個快速/簡單的解決方法，可能會幫助其他訪問者解決這個問題。

import seasborn as sns 
sns.kdeplot(mydata1)
sns.kdeplot(mydata2)

這里有一些有用的例子，用於 kde 與直方圖的比較。

Answer 13

受到所羅門回答的啟發，但要堅持與直方圖相關的問題，一個干凈的解決方案是：

sns.distplot(bar)
sns.distplot(foo)
plt.show()

確保首先繪制較高的直方圖，否則您需要設置 plt.ylim(0,0.45) 以便不會切斷較高的直方圖。

使用 matplotlib 在單個圖表上繪制兩個直方圖

問題描述

13 個解決方案

解決方案1
497 已采納 2011-07-29 13:33:44

解決方案2
228 2016-09-14 02:41:04

解決方案3
35 2017-12-11 10:05:59

解決方案4
15 2018-12-10 01:48:00

解決方案5
11 2018-07-31 14:48:37

解決方案6
7 2017-07-05 11:56:36

解決方案7
3 2020-04-30 08:06:26

解決方案8
3 2020-12-25 20:13:27

解決方案9
3 2011-07-29 09:50:25

解決方案10
2 2017-06-16 12:35:46

解決方案11
2 2019-12-05 15:44:26

解決方案12
1 2019-04-30 18:07:04

解決方案13
1 2019-06-18 03:55:22

使用 matplotlib 在單個圖表上繪制兩個直方圖

問題描述

13 個解決方案

解決方案1 497 已采納 2011-07-29 13:33:44

解決方案2 228 2016-09-14 02:41:04

解決方案3 35 2017-12-11 10:05:59

解決方案4 15 2018-12-10 01:48:00

解決方案5 11 2018-07-31 14:48:37

解決方案6 7 2017-07-05 11:56:36

解決方案7 3 2020-04-30 08:06:26

解決方案8 3 2020-12-25 20:13:27

解決方案9 3 2011-07-29 09:50:25

解決方案10 2 2017-06-16 12:35:46

解決方案11 2 2019-12-05 15:44:26

解決方案12 1 2019-04-30 18:07:04

解決方案13 1 2019-06-18 03:55:22

解決方案1
497 已采納 2011-07-29 13:33:44

解決方案2
228 2016-09-14 02:41:04

解決方案3
35 2017-12-11 10:05:59

解決方案4
15 2018-12-10 01:48:00

解決方案5
11 2018-07-31 14:48:37

解決方案6
7 2017-07-05 11:56:36

解決方案7
3 2020-04-30 08:06:26

解決方案8
3 2020-12-25 20:13:27

解決方案9
3 2011-07-29 09:50:25

解決方案10
2 2017-06-16 12:35:46

解決方案11
2 2019-12-05 15:44:26

解決方案12
1 2019-04-30 18:07:04

解決方案13
1 2019-06-18 03:55:22