[英]How do I make bokeh omit missing dates when using datetime as x-axis
我正在查看散景文檔中的燭台示例,可在此處找到:
https://github.com/bokeh/bokeh/blob/master/examples/plotting/file/candlestick.py
我試圖找出一種消除 x 軸中沒有數據的“空格”的好方法。
具體來說,對於示例中使用的 MSFT 等財務數據,沒有周末和節假日的數據。 當沒有日期數據時,有沒有辦法告訴散景不要在圖表中留下空白?
為方便起見,這里粘貼了上面鏈接中的示例代碼:
from math import pi
import pandas as pd
from bokeh.sampledata.stocks import MSFT
from bokeh.plotting import *
df = pd.DataFrame(MSFT)[:50]
df['date'] = pd.to_datetime(df['date'])
mids = (df.open + df.close)/2
spans = abs(df.close-df.open)
inc = df.close > df.open
dec = df.open > df.close
w = 12*60*60*1000 # half day in ms
output_file("candlestick.html", title="candlestick.py example")
figure(x_axis_type = "datetime", tools="pan,wheel_zoom,box_zoom,reset,previewsave",
width=1000, name="candlestick")
hold()
segment(df.date, df.high, df.date, df.low, color='black')
rect(df.date[inc], mids[inc], w, spans[inc], fill_color="#D5E1DD", line_color="black")
rect(df.date[dec], mids[dec], w, spans[dec], fill_color="#F2583E", line_color="black")
curplot().title = "MSFT Candlestick"
xaxis().major_label_orientation = pi/4
grid().grid_line_alpha=0.3
show() # open a browser
更新:從 Bokeh 0.12.6
您可以為軸上的主要刻度標簽指定覆蓋。
import pandas as pd
from bokeh.io import show, output_file
from bokeh.plotting import figure
from bokeh.sampledata.stocks import MSFT
df = pd.DataFrame(MSFT)[:50]
inc = df.close > df.open
dec = df.open > df.close
p = figure(plot_width=1000, title="MSFT Candlestick with Custom X-Axis")
# map dataframe indices to date strings and use as label overrides
p.xaxis.major_label_overrides = {
i: date.strftime('%b %d') for i, date in enumerate(pd.to_datetime(df["date"]))
}
# use the *indices* for x-axis coordinates, overrides will print better labels
p.segment(df.index, df.high, df.index, df.low, color="black")
p.vbar(df.index[inc], 0.5, df.open[inc], df.close[inc], fill_color="#D5E1DD", line_color="black")
p.vbar(df.index[dec], 0.5, df.open[dec], df.close[dec], fill_color="#F2583E", line_color="black")
output_file("custom_datetime_axis.html", title="custom_datetime_axis.py example")
show(p)
如果您有大量日期,這種方法可能會變得笨拙,並且可能需要自定義擴展。
BokehJS 界面的一些細節已經改變。 對於 Bokeh 0.11
和更新版本, __implementation__
現在應該是:
__implementation__ = """
_ = require "underscore"
Model = require "model"
p = require "core/properties"
class DateGapTickFormatter extends Model
type: 'DateGapTickFormatter'
doFormat: (ticks) ->
date_labels = @get("date_labels")
return (date_labels[tick] ? "" for tick in ticks)
@define {
date_labels: [ p.Any ]
}
module.exports =
Model: DateGapTickFormatter
"""
預計這不會進一步改變。
拉取請求 3314是為一個適用於 2015-12-05 的示例而提出的。 原始代碼在這里。 燭台示例的文檔仍然顯示與問題中 OP 相同的代碼。
包括在下面以供參考。
from math import pi
import pandas as pd
from bokeh.sampledata.stocks import MSFT
from bokeh.plotting import figure, show, output_file
from bokeh.models.formatters import TickFormatter, String, List
# In this custom TickFormatter, xaxis labels are taken from an array of date
# Strings (e.g. ['Sep 01', 'Sep 02', ...]) passed to the date_labels property.
class DateGapTickFormatter(TickFormatter):
date_labels = List(String)
__implementation__ = """
_ = require "underscore"
HasProperties = require "common/has_properties"
class DateGapTickFormatter extends HasProperties
type: 'DateGapTickFormatter'
format: (ticks) ->
date_labels = @get("date_labels")
return (date_labels[tick] ? "" for tick in ticks)
module.exports =
Model: DateGapTickFormatter
"""
df = pd.DataFrame(MSFT)[:50]
# xaxis date labels used in the custom TickFormatter
date_labels = [date.strftime('%b %d') for date in pd.to_datetime(df["date"])]
mids = (df.open + df.close)/2
spans = abs(df.close-df.open)
inc = df.close > df.open
dec = df.open > df.close
w = 0.5
output_file("custom_datetime_axis.html", title="custom_datetime_axis.py example")
TOOLS = "pan,wheel_zoom,box_zoom,reset,save"
p = figure(tools=TOOLS, plot_width=1000, toolbar_location="left")
# Using the custom TickFormatter. You must always define date_labels
p.xaxis[0].formatter = DateGapTickFormatter(date_labels = date_labels)
# x coordinates must be integers. If for example df.index are
# datetimes, you should replace them with a integer sequence
p.segment(df.index, df.high, df.index, df.low, color="black")
p.rect(df.index[inc], mids[inc], w, spans[inc], fill_color="#D5E1DD", line_color="black")
p.rect(df.index[dec], mids[dec], w, spans[dec], fill_color="#F2583E", line_color="black")
p.title = "MSFT Candlestick with custom x axis"
p.xaxis.major_label_orientation = pi/4
p.grid[0].ticker.desired_num_ticks = 6
show(p) # open a browser
由於使用dataframe
索引的代碼,您的數據必須按日期升序排序。 如果您有一個按日期降序排列的時間序列,則可以將其反轉以供上述代碼使用:
df.sort_values(by='date', inplace=True)
df.reset_index(drop=True, inplace=True)
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.