Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

Monitor a cluster of nodes

Running EMR example, getting 301 Error

how to create a symlink on a hdfs cluster?

hadoop hdfs symlink

Hive, Beeline: Peer indicated failure: GSS initiate failed

hadoop hive

org.datanucleus.exceptions.NucleusUserException: Error : Could not find API definition for name "JDO"

Physical memory usage keeps increasing for Spark application on YARN

Spark-submit how to set the user.name

hadoop apache-spark hadoop2

Running tensorflow with file on HDFS (cannot find libhdfs.so)

python hadoop tensorflow

Hadoop streaming job using Mxnet failing in AWS Emr

Hive: Unable to insert data in table with 100 or more partition columns Error: in column "PART_NAME" that has maximum length of 767

hadoop hive cloudera

java.io.InvalidClassException: org.apache.spark.internal.io.HadoopMapReduceCommitProtocol; local class incompatible

Ingest log files from edge nodes to Hadoop

Java Read Parquet File to JSON Output

Is there a compatibility matrix for Hadoop components?

apache-spark hadoop

Got error when run command `hbase classpath`

hadoop hbase

How to "update" a column using pig latin

hadoop apache-pig

Flume agent - can I specify compression like gzip or bz2?

hadoop agent cloudera flume

Looking for overall review on Hadoop

hadoop cloud mapreduce hdfs

Computing set intersection and set difference of the records of two files with hadoop

Unable to increase Max Application Master Resources

docker hadoop hadoop-yarn