Tag[hadoop-yarn] Recent Newest Questions

Dataproc CPU usage too low even though all the cores got used

Issue: I run a spark job that uses up all the cores on all the nodes and yet in the Dataproc CPU monitoring graph the CPU usage touches a max of 12% ...

Yarn allocates only 1 core per container. Running spark on yarn

Please ensure dynamic allocation is not killing your containers while you monitor the YARN UI. See the answer below Issue: I can start the SparkSessi ...

How can I get job configuration in command line?

I get get running apps with this yarn application -appStates RUNNING then I get one applicationID from list. then I can get status of app with this: ...

New datanode not tranferring data from existing hadoop cluster

I have followed up the tutriolpoint guide and completed every step on setting up a new node into an existing hadoop cluster. But I am facing difficult ...

I cannot run a haddop jar on Hadoop 3.0.0-cdh6.3.2

I have a machine with Hadoop 3.0.0-cdh6.3.2 installed . I ran this And show me this error: I set with this value I didn't change the yarn- ...

do we need to install spark on yarn to read data from HDFS into Py Spark?

I am having a Hadoop 3.1.1 multi-node cluster, i want to make use of PySpark to read files from my HDFS into PySpark for ETL operations and then load ...

Where are the spark intermediate files stored on the disk?

During a shuffle, the mappers dump their outputs to the local disk from where it gets picked up by the reducers. Where exactly on the disk are those f ...

Getting Java Heap out of memory issue in Pyspark

I have tried to read Multiple CSV files with a size of around 100MB using the pandas package and try to convert the file into Spark.sql.data frame and ...

How to kill the youngest task in Hadoop Fair Scheduler

I have very interesting use-case. I’m running Apache Hadoop distribution latest version, with yarn. The use-case is long computational jobs, that most ...

why i can't install any package in my project?

[this is the problem here] (https://i.stack.imgur.com/b4wFI.png) I have tried both git bash and PowerShell. But can't?? Please help me find this prob ...

hadoop multi node with spark sample job

I have just configured spark on my Hadoop cluster and i want to run the spark sample job. before that I want to understand what, this below job code s ...

Problem/issues with Yarn (yarn.js) version upgrade

When trying to upgrade the yarn version form “0.28.4” to “1.22.19” with to buffer an output in the given scenario , but it’s not working with updated ...

NoSuchMethodError: org/apache/hadoop/mapreduce/util/MRJobConfUtil.setTaskLogProgressDeltaThresholds

I am getting the following error while executing a mapreduce job in my hadoop cluster (distributed cluster). I found the error below in the applicati ...

How can i run multiple queries in parallel on hive with tez execution engine?

We want to run hive with tez for querying data in hdfs as multiple users will query hive so we need to configure hive in such a way so that the querie ...

Definition of yarn queue capacity

If I search for a generic definition of "capacity", Oxford languages says, "the maximum amount that something can contain". If I ask yarn for the stat ...

Hadoop MapReduce job failing in launch_container.sh

MapReduce job is failing with following error even though JAVA_HOME is set. I am trying to setup hadoop (3.3.4) on my Mac M1. I have set JAVA_HOME ...

YARN add new queue or clear default queue

I'm running YARN on an EMR cluster. mapred queue -list returns: How do I clear this queue or add a new one? I've been looking for a while now and ...

Kill process signal from Airflow to spark/yarn

Do someone know how next feature can be implemented? When you marked task in airflow as failed this stops Airflow process however this didn't stoped a ...

how to address HADOOP_CONF_DIR files(yarn-site.xml, ...) from client to remote hdfs directory

I have one unique Yarn cluster which is used by many remote clients that submits spark applications to it. I need to set HADOOP_CONF_DIR environment v ...

Metaplex-master on github only has Readme file

I am trying to set up a Solana candy machine. I am using the Hasplips Metaplex-master but it only has one readme file. Its supposed to have a js folde ...