简体   繁体   中英

Spliterator vs Stream.Builder

I read some questions how to create a finite Stream ( Finite generated Stream in Java - how to create one? , How do streams stop? ).

The answers suggested to implement a Spliterator . The Spliterator would implement the logic how to and which element to provide as next ( tryAdvance ). But there are two other non-default methods trySplit and estimateSize() which I would have to implement.

The JavaDoc of Spliterator says:

An object for traversing and partitioning elements of a source. The source of elements covered by a Spliterator could be, for example, an array, a Collection , an IO channel, or a generator function. ... The Spliterator API was designed to support efficient parallel traversal in addition to sequential traversal, by supporting decomposition as well as single-element iteration. ...

On the other hand I could implement the logic how to advance to the next element around a Stream.Builder and bypass a Spliterator . On every advance I would call accept or add and at the end build . So it looks quite simple.

What does the JavaDoc say?

A mutable builder for a Stream . This allows the creation of a Stream by generating elements individually and adding them to the Builder (without the copying overhead that comes from using an ArrayList as a temporary buffer.)

Using StreamSupport.stream I can use a Spliterator to obtain a Stream . And also a Builder will provide a Stream .

When should / could I use a Stream.Builder ?
Only if a Spliterator wouldn't be more efficient (for instance because the source cannot be partitioned and its size cannot be estimated)?

Note that you can extend Spliterators.AbstractSpliterator . Then, there is only tryAdvance to implement.

So the complexity of implementing a Spliterator is not higher.

The fundamental difference is that a Spliterator 's tryAdvance method is only invoked when a new element is needed. In contrast, the Stream.Builder has a storage which will be filled with all stream elements, before you can acquire a Stream.

So a Spliterator is the first choice for all kinds of lazy evaluations, as well as when you have an existing storage you want to traverse, to avoid copying the data.

The builder is the first choice when the creation of the elements is non-uniform, so you can't express the creation of an element on demand. Think of situations where you would otherwise use Stream.of(…) , but it turns out to be to inflexible.

Eg you have Stream.of(a, b, c, d, e) , but now it turns out, c and d are optional. So the solution is

Stream.Builder<MyType> builder = Stream.builder();
builder.add(a).add(b);
if(someCondition) builder.add(c).add(d);
builder.add(e).build()
   /* stream operations */

Other use cases are this answer , where a Consumer was needed to query an existing spliterator and push the value back to a Stream afterwards, or this answer , where a structure without random access (a class hierarchy) should be streamed in the opposite order.

On the other hand I could implement the logic how to advance to the next element around a Stream.Builder and bypass a Spliterator. On every advance I would call accept or add and at the end build . So it looks quite simple.

Yes and no. It is simple, but I don't think you understand the usage model:

A stream builder has a lifecycle, which starts in a building phase, during which elements can be added, and then transitions to a built phase, after which elements may not be added. The built phase begins when the build() method is called, which creates an ordered Stream whose elements are the elements that were added to the stream builder, in the order they were added.

( Javadocs )

In particular no, you would not invoke a Stream.Builder 's accept or add method on any stream advance. You need to provide all the objects for the stream in advance. Then you build() to get a stream that will provide all the objects you previously added. This is analogous to adding all the objects to a List , and then invoking that List 's stream() method.

If that serves your purposes and you can in fact do it efficiently then great! But if you need to generate elements on an as-needed basis, whether with or without limit, then Stream.Builder cannot help you. Spliterator can.

The Stream.Builder is a misnomer as streams can't really be built. Things that can be built are value objects - dto, array, collection. So if Stream.Builder is instead thought of as a buffer, it might help understand it better, eg:

buffer.add(a)
buffer.add(b)
buffer.stream()

This shows how similar it is to an ArrayList:

list.add(a)
list.add(b)
list.stream()

On the other hand, Spliterator is the basis of a stream and allows for efficient navigation over data sets (improved version of the Iterator).

So the answer is they should not be compared. Comparing Stream.Builder to Spliterator is the same as comparing ArrayList to Spliterator.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM