Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

Why do we need ZooKeeper in the Hadoop stack?

Ports are not available: listen tcp 0.0.0.0/50070: bind: An attempt was made to access a socket in a way forbidden by its access permissions

SparkSQL vs Hive on Spark - Difference and pros and cons?

Why spark-shell fails with NullPointerException?

scala hadoop apache-spark

Thrift, Avro, Protocolbuffers - Are they all dead?

Setting the number of map tasks and reduce tasks

hadoop mapreduce

How to get started with Big Data Analysis [closed]

python r hadoop bigdata

Free Large datasets to experiment with Hadoop

resources hadoop opendata

Datanode process not running in Hadoop

Datanode not starts correctly

hadoop hadoop2

Cascading examples failed to compile?

Spark on yarn concept understanding

Cleanest way in Gradle to get the path to a jar file in the gradle dependency cache

jar hadoop dependencies gradle

What is best way to start and stop hadoop ecosystem, with command line?

hadoop

How to get the input file name in the mapper in a Hadoop program?

hadoop mapreduce

Why HBase is a better choice than Cassandra with Hadoop?

Schema evolution in parquet format

How to write 'map only' hadoop jobs?

hadoop mapreduce

COLLECT_SET() in Hive, keep duplicates?

Default Namenode port of HDFS is 50070.But I have come across at some places 8020 or 9000 [closed]

hadoop hdfs