I'm new to Dataproc and trying to submit a pig job to google dataproc via gcloud with below property file and below is the sample of pig script ...
I'm new to Dataproc and trying to submit a pig job to google dataproc via gcloud with below property file and below is the sample of pig script ...
I need to add BigDecimals in Hadoop. I'm currently using Apache Pig BigDecimalWritable but Pig seems to be completely outdated. This version is 5 y ...
I want to split the string in a dataset that is joined by a backslash (/) into a new line. The example datatset is: I want the result to be: Th ...
spark, hadoop, tez, etc. all have a list of properties that can be manually configured. example: yarn.nodemanager.resource.memory-mb or spark.execu ...
I'm just getting started with Pig and I'm facing lots of issues with running my first program. Any help is much appreciated. I've tried resolving usi ...
I have several files (around 10 files) which I would like to merge together in Pig: I am aware that I could merge two datasets together by: Is t ...
this is what i have done so far. The error i am getting: Input(s): Failed to read data from "hdfs://localhost:9000/mike/users.txt" Failed to read ...
I have a file called test.txt with the records as below (disregard the dots): (tab as field separator) My pig script (test.pig): I run the scri ...
Create employee and dept tables for the available files emp1.csv and dept.csv. Colnames: Emp: Empno, name, sal, did, branch, dno Dept: deptno, na ...
I have a "solution.pig" file which contain all load, join and dump queries. I need to run them by typing "solution.pig" in grunt> and save all the ...
Assume there is a text file named abalone_data with 3 attributes : name, gender and length with M is male, F is female and I is infant. The questi ...
I'm trying to read a file with pig and I have the error indicated in the title. data = LOAD '/user/cloudera/pigexample/commands' USING PigSorage('\n' ...
I am migrating pig script to pyspark and I am new to Pyspark so I am stuck at data loading. My pig script looks like: Bag1 = LOAD '/refined/em/em_re ...
My pig statement generates the following output: But I want to store above output as below in pig: Is there any way to extract the very first el ...
I have this data below in a .csv file: I saw in one of the StackOverflow question the way to remove header using FILTER. So, When I load this file ...
I am new to pig programming. I have one txt file and comma (,) as a delimiter. In amount columns i.e; amt_IN and amy_OUT are of type chararray with da ...
ERROR pig.Main: ERROR 2998: Unhandled internal error. com.google.common.base.Preconditions.checkArgument(ZLjava/lang/String;Ljava/lang/Object;)V WAR ...
Is it possible to use or query data using Pig or Drill or Tableau or some other tool from HDFS which was inserted/loaded using a HIVE Managed table; o ...
I have a requirement where I need to JOIN a tweets table with person names, like filtering the tweets if it contains any person name. I have following ...
(r1797386) compiled Jun 02 2017, 15:41:58 org.apache.hadoop.conf.Configuration.deprecation - mapred.job.tracker is deprecated. Instead, use mapre ...