Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

What are some good resources for studying Hadoop's source code?

hadoop

Out of memory error in Mapreduce shuffle phase

hadoop mapreduce

Difference between String.getBytes() and Bytes.toBytes(String data)

java hadoop hbase

List directories (and their subdirectories) with hadoop command line

hadoop command-line

What additional benefit does Yarn bring to the existing map reduce?

Is caching the only advantage of spark over map-reduce?

caching hadoop apache-spark

Difference between HDFS and NFS? [closed]

Where does job.setOutputKeyClass and job.setOutputReduceClass refers to?

java hadoop mapreduce

Pig: Get top n values per group

hadoop hdfs apache-pig

Timestamp Based Scans in HBase?

hadoop hbase

select count distinct using pig latin

hadoop apache-pig

Connection Error in Apache Pig

hadoop apache-pig

yarn is not honouring yarn.nodemanager.resource.cpu-vcores

In a hadoop cluster, should hive be installed on all nodes?

“Combiner" Class in a mapreduce job

Dropping multiple tables with same prefix in Hive

hadoop hive hiveql

Is Snappy splittable or not splittable?

hadoop snappy

Aggregate Resource Allocation for a job in YARN

hadoop hadoop-yarn

Passing arguments to Hadoop mappers

hadoop mapreduce

Apache Helix vs YARN