I am trying to initialize an Apache Spark instance on Windows 10 to run a local test. My problem is during the initialization of the Spark instance, I ...
I am trying to initialize an Apache Spark instance on Windows 10 to run a local test. My problem is during the initialization of the Spark instance, I ...
I get get running apps with this yarn application -appStates RUNNING then I get one applicationID from list. then I can get status of app with this: ...
I'm trying to understand the details of terasort. After using the Hadoop's teragen, I can successfully create the input data files. Many online docum ...
I am trying to get data from Azure Data Lake gen2 (storage account) using .NET spark(3.0.1) and hadoop (3.2.2). I was following Microsoft tutorial and ...
I have followed up the tutriolpoint guide and completed every step on setting up a new node into an existing hadoop cluster. But I am facing difficult ...
Hi I am new to Hive and kerberos. I have some hive jobs which run more then life time of ticket. how can I configure hive so that when I start hive sh ...
I want to drop the whole column 'index' (column name and values) in Impala: index case_id customer_id ...
I have a machine with Hadoop 3.0.0-cdh6.3.2 installed . I ran this And show me this error: I set with this value I didn't change the yarn- ...
I understand based on the slides that in the context of Hadoop that Zookeeper is used for storing information of Master, and status of different tasks ...
SQL novice here trying to perform a delete operation using Hive syntax and ACID transformations I have two delta files that I have brought in as dataf ...
I want to train a model on a compute node but using the data (parquet format) from a storage cluster (HDFS). And I cannot copy-paste the whole dataset ...
Im trying to run this command hadoop jar hadoop-*test*.jar TestDFSIO -read -nrFiles 100 -fileSize 80MB -resFile /TestDFSIOwrite.txt But Im getting zsh ...
I'm trying to export a table from Teradata into a file in my hdfs using TDCH. I'm using the below parameters : It's working, but I need the header ...
I'm a Hadoop newcomer trying to export data from Hive to Oracle. Can Sqoop update data to Oracle table let say, Oracle Table have column A,B,C,D,E I ...
I have a directory "SmallFiles" that contains 8 files, I archived them using "hadoop archive -archiveName myArch.har -p /Files/SmallFiles /Files" then ...
I can navigate from node to node with an ssh connection without any problems, for example from parasilo-1 to parasilo-10. cat ~/.ssh/id_rsa.pub >&g ...
How many files does a pyspark parquet write generate? I have read that the output is one file per in memory partition. However, this does not seem to ...
After creating a Hadoop cluster that provides data to a Cassandra database, I would like to integrate into the Hadoop architecture some Machine Learni ...
I am building a Flink pipeline and based on live input data need to read records from archive files in a RichFlatMapFunction (e.g. each day I want to ...
I'm trying to open a video file with OpenCv VideoCapture that is stored on HDFS. This is a case using Hadoop RecordReader and I can find the file but ...