[英]Windowing and Watermark in Apache beam : Google dataflow
I have a fixed window of 1 minute.我有一个 1 分钟的固定 window。 I am considering event time.
我正在考虑活动时间。
beam.WindowInto(window.FixedWindows(300)) beam.WindowInto(window.FixedWindows(300))
When I deploy this code,is the window created instantly even if I have not published any message.suppose I deployed at 6:30, is it like the windows are automatically created as 6:30 to 6:35, 6:35 to 6:40 and so on?当我部署这段代码时,即使我没有发布任何消息,window 是否会立即创建。假设我在 6:30 部署,它是否像 windows 一样自动创建为 6:30 到 6:35、6:35 到 6 :40 等等?
If I publish a message to topic having event timestamp = 6:31 (unix seconds ie 10,176589653) when system time = 6:36..does it mean the watermark for that specific message is at 6:31 and it will miss the window as system time is at 6:36 and allowed lateness=0 and will be rejected.如果我在系统时间 = 6:36 时向具有事件时间戳 = 6:31(unix 秒,即 10,176589653)的主题发布消息,这是否意味着该特定消息的水印在 6:31 并且它会错过window 因为系统时间是 6:36 并且允许迟到 = 0 并且将被拒绝。
Windows are always created using UNIX time 0 as a base, meaning, no matter if you start the pipeline at 6:31, 6:32 or 6:35, the windows would always be [6:30, 6:35), [6:35, 6:40)...
. Windows 始终使用 UNIX 时间 0 作为基础创建,这意味着,无论您是在 6:31、6:32 还是 6:35 开始管道,windows 始终是
[6:30, 6:35), [6:35, 6:40)...
Note that this also applies for days, the windows would start at 00:00 UTC.请注意,这也适用于几天,windows 将从 00:00 UTC 开始。
If you want to change this, there's an offset
parameter .如果你想改变这个,有一个
offset
参数。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.