简体繁体中英

is spring-batch for me, even though I don't have a usage for itemReader and itemWriter?

原文 2012-01-23 15:24:54 8 2 java/ spring/ batch-processing/ spring-batch

spring-batch newbie : I have a series of batches that

read all new records (since the last execution) from some sql tables
upload all the new records to hadoop
run a series of map-reduce (pig) jobs on all the data (old and new)
download all the output to local and run some other local processing on all the output

point is, I don't have any obvious "item" - I don't want to relate to the specific lines of text in my data, I work with all of it as one big chunk and don't want any commit intervals and such...

however, I do want to keep all these steps loosely coupled - as in, step a+b+c might succeed for several days and accumulate processed stuff while step d keeps failing, and then when it finally succeeds it will read and process all of the output of it's previous steps.

SO: is my "item" a fictive "working-item" which will signify the entire new data? do I maintain a series of queues myself and pass this fictive working-items between them?

thanks!

2 answers

people always assume that the only use of spring batch is really only for the chunk processing. that is a huge feature, but what's overlooked is the visibility of the processing and job control.

give 5 people the same task with no spring batch and they're going to implement flow control and visibility their own way. give 5 people the same task and spring batch and you may end up with custom tasklets all done differently, but getting access to the job metadata and starting and stopping jobs is going to be consistent. from my perspective it's a great tool for job management. if you already have your jobs written, you can implement them as custom tasklets if you don't want to rewrite them to conform the 'item' paradigm. you'll still see benefits.

I don't see the problem. Your scenario seems like a classic application of Spring Batch to me.

read all new records (since the last execution) from some sql tables

Here, an item is a record

upload all the new records to hadoop

Same here

run a series of map-reduce (pig) jobs on all the data (old and new)

Sounds like a StepListener or ChunkListener

download all the output to local and run some other local processing on all the output

That's the next step.

The only problem I see is if you don't have Domain Objects for your records. But even then, you can work with maps or arrays, while still using ItemReaders and ItemWriters.

Spring batch GZIP ItemWriter/ItemReader

How to skip lines with ItemReader in Spring-Batch?

Spring-Batch: Testing custom itemReader

When constructors for ItemReader, ItemProcessor, ItemWriter are called in Spring Batch?

How to create spring-batch ItemWriter for entity with ArrayList field?

How to change ItemWriter target filename in a spring-batch job?

How to create a stream.parallel ItemReader in spring-batch?

Access job parameter in ItemReader (spring-batch using grails)

How to pass parameters from ItemReader to ItemProcessor in spring-batch?

It keeps me looping even though I don't have any loops

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Spring batch GZIP ItemWriter/ItemReader How to skip lines with ItemReader in Spring-Batch? Spring-Batch: Testing custom itemReader When constructors for ItemReader, ItemProcessor, ItemWriter are called in Spring Batch? How to create spring-batch ItemWriter for entity with ArrayList field? How to change ItemWriter target filename in a spring-batch job? How to create a stream.parallel ItemReader in spring-batch? Access job parameter in ItemReader (spring-batch using grails) How to pass parameters from ItemReader to ItemProcessor in spring-batch? It keeps me looping even though I don't have any loops

Related Tags

is spring-batch for me, even though I don't have a usage for itemReader and itemWriter?

Question

2 answers

solution1
5 2012-02-26 16:19:09

solution2
2 2012-01-23 15:37:23

is spring-batch for me, even though I don't have a usage for itemReader and itemWriter?

Question

2 answers

solution1 5 2012-02-26 16:19:09

solution2 2 2012-01-23 15:37:23

solution1
5 2012-02-26 16:19:09

solution2
2 2012-01-23 15:37:23