简体   繁体   English

使用列表和 pcollection 的区别

[英]Difference between using a list or a pcollection

Im building a pipeline in apache beam and I just got curious about this, whats the difference between applying a ptransform to a list and a pcollection, is the performance affected by this or is just that the pcollection is inmutable and is this a bad way to aproach a pipeline with apache beam?我在 apache beam 中构建了一个管道,我只是对此感到好奇,将 ptransform 应用于列表和 pcollection 之间的区别是什么,性能是否受此影响,或者只是 pcollection 是不可变的,这是一个糟糕的方法接近带有 apache 光束的管道?

By definition, a PCollection is a unbounded collection.根据定义,PCollection 是一个无界集合。 Immutable, and unbounded .不可变的,无限的。

The main difference with a list is mainly the unbounded characteristic and it's especially powerful when you are streaming data (from a large file, or from a unbounded source, like PubSub).与列表的主要区别主要在于无界特性,当您流式传输数据时(来自大文件或来自无界源,如 PubSub),它特别强大。

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何转换 PCollection<row> 使用 Java 到数据流 Apache 中的 Integer</row> - How to convert PCollection<Row> to Integer in Dataflow Apache beam using Java 是否可以在不使用侧输入的情况下在不同的 Output 接收器上编写单个 Pcollection? - Is it possible to write a single Pcollection at different Output sinks without using side inputs? 如何使用 2 个 DateTime 计数之间的差异找到百分比? - How can I find Percentage using difference between 2 DateTime counts? 如何使用 SQL 获取连续日期之间的最大差异 - How to get Maximum Difference between Dates in a row using SQL DynamoDbAsyncClient 和 AmazonDynamoDBAsyncClient 之间的区别 - Difference between DynamoDbAsyncClient and AmazonDynamoDBAsyncClient 使用“Container.GetItemLinqQueryable”和“Container.GetItemQueryIterator”有什么区别? - What is the difference between using "Container.GetItemLinqQueryable" and "Container.GetItemQueryIterator"? 使用 ServiceBusMessageBatch 和在 TransactionScope 中发送多条消息有什么区别? - What is the difference between using a ServiceBusMessageBatch and sending multiple messages in a TransactionScope? 向 PCollection 添加增量索引? - Add incremental index to a PCollection? boto3 list_objects 和 list_objects_v2 有什么区别? - What is the difference between boto3 list_objects and list_objects_v2? 如何转换 PCollection<tablerow> 到个人收藏<row>在 Apache 梁?</row></tablerow> - How to convert PCollection<TableRow> to PCollection<Row> in Apache Beam?
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM