Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

Hadoop partitioner

Performance issue in hive version 0.13.1

Hadoop backup and recovery tool and guidance

hadoop

How to insert JSON in HDFS using Flume correctly

json hadoop flume flume-ng

Connecting to Hive via Beeline using Kerberos keytab

hadoop hive kerberos keytab

Is there a way to change the replication factor of RDDs in Spark?

Add Yarn cluster configuration to Spark application

Hadoop in the AWS free tier?

How to read parquet files using `ssc.fileStream()`? What are the types passed to `ssc.fileStream()`?

How can I read in a binary file from hdfs into a Spark dataframe?

Install Spark on an existing Hadoop cluster

linux hadoop apache-spark

Distributed alternatives to hadoop

Conditional join in Hive

hadoop hive

Oozie coordinator action rerun from fail nodes

How to configure monopolistic FIFO application queue in YARN?

hadoop hadoop-yarn

pickle.PicklingError: args[0] from __newobj__ args has the wrong class with hadoop python

How to set range for limit clause in hive

hadoop hive

Failed to start NameNode

java hadoop

Spark: unable to load native-hadoop library for platform

java apache-spark hadoop

Is there a canonical problem that provably can't be aided with map/reduce?

hadoop mapreduce apache-pig