Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

org.apache.hadoop.mapred.LocalClientProtocolProvider not found

java hadoop apache-pig

Hbase master keeps dying, claims a hbase:namespace already exists

hadoop hbase

Load large csv in hadoop via Hue would only store a 64MB block

hadoop hive cloudera hue impala

What is the difference between apache Ambari Server and Agent

hadoop bigdata

RHbase/thrift install issue

r hadoop osx-yosemite

Standard practices for logging in MapReduce jobs

Hive transform using Python: Unable to initialize custom script

python hadoop hive

Key of object type in the hadoop mapper

java hadoop mapreduce

Hadoop setting the HADOOP_HOME correctly to bin/hadoop it gives command not found

hadoop

Spark NotSerializableException

java hadoop apache-spark

What happens when the intermediate output does not fit in RAM in Spark

hadoop apache-spark rdd

Startin HBase Shell - Zookeeper exists but fails

Why my BroadcastHashJoin is slower than ShuffledHashJoin in Spark

hadoop apache-spark hive

Connect to Impala using impyla client with Kerberos auth

Error Loading CSV data into a Hive table

hadoop hive hiveql

Spark coalesce relationship with number of executors and cores

Is Hive faster than Spark?

Spark SQL "Limit"

Java Copying File in HDFS to another Directory in HDFS

java hadoop hdfs

Is it the driver or the workers who reads the text file when sc.textfile is used?