I work at a place where scalding writes are augmented with a specific API to track dataset meta data. When converting from normal writes to these spec ...
I work at a place where scalding writes are augmented with a specific API to track dataset meta data. When converting from normal writes to these spec ...
I'm installing Scalding and sbt on my system but running command sbt assembly gives the following error: ...
I have a list of list of strings and I want to concatenate all the unique strings into a single (delimited space) string, something that flatMap allow ...
I am new to scalding world. My scalding job will have multiple stages, and I need to tune each stage individually. I have found that we might be able ...
Now that SpyGlass is no longer being maintained, what is the recommended way to access HBase using Scala/Scalding? A similar question was asked in 201 ...
My scalding job is translated into 9 map reduce jobs (m/r jobs). It's not easy for me to understand which part of code each m/r job represents. Is the ...
I am trying to upgrade a scalding job running on CDH 4.5 to CDH 5.5.1. The job uses json4s to parse through json data. I am getting the below error wh ...
I don't find any documention about MonoidAggregator. What is it for ? An example of its use: forAll return a MonoidAggregator. Whould it be roug ...
I have a Scalding job packed in fatjar and running on EMR Hadoop cluster. Recently I added new feature requiring DynamoDB connection inside map. But a ...
I have a scalding job that looks like this: import com.twitter.scalding.{Args, Csv, Job, TextLine} When it runs, I get the following error: Thi ...
I'm trying to get Scalding working on Zeppelin while using YARN. I followed the steps in the docs here to build the interpreter and set up the classpa ...
I experience an issue these days, i am trying to read from multiple files using scalding and create an output with a single file. My code is this: ...
I'd like to aggregate a bunch of values that belong to a particular category into an HLL data structure so I can carry out intersections and unions la ...
Suppose there is following map reduce job Mapper: setup() initializes some state map() add data to state, no output cleanup() ouput state to conte ...
Are the following two code blocks equivalent in terms of performances? and... Specifically, is Scalding going to optimize the code and execute a ...
I have a val in the format of TypedPipe[(Long, Long)], how do I switch left and right columns around? More clearly, how to create a new val with left ...
I am using scalding to do a simple word count type of things. I get an error when using partial function to expand on the tuple. The exact error messa ...
I'm building Scalding job using Scala 2.10.4. Its successfully creating the job. But when I run the job in my Hortonworks it throwing the following ex ...
I have a TypedTipe[(String, String, Long)] where the first String can assume only a limited (~10) number of values. I'd like to partition my output so ...
I have the following input tuple that I'd like to flatMap: (String, List[String]) E.G. Input: Needed output: Is there an elegant way to do thi ...