Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

"Connection refused" Error for Namenode-HDFS (Hadoop Issue)

hadoop hdfs

What is the maximum value for mapreduce.task.io.sort.mb?

Why Hadoop or Spark? There is ElasticSearch

How can I debug a pig script

hadoop apache-pig bigdata

How can I list subdirectories recursively for HDFS?

list hadoop find hdfs

Duplicate columns in Spark Dataframe

r csv hadoop apache-spark sparkr

Structure Difference between partitioning and bucketing in hive

Hadoop HDFS maximum file size

hadoop hdfs

Partition Hive table by existing field?

Hadoop read multiple lines at a time

hadoop

Hadoop slowstart configuration

hadoop

Why is Maven trying to compile my code as -source 1.3?

maven hadoop mahout

Name Node stores what?

hadoop mapreduce hdfs bigdata

Hadoop log4j not working as No appenders could be found for logger (org.apache.hadoop.metrics2.lib.MutableMetricsFactory)

hadoop log4j

GlusterFS or Ceph as backend for Hadoop

hadoop ceph glusterfs

Spark + Scala transformations, immutability & memory consumption overheads

scala hadoop apache-spark

Difference between 'distcp' and 'distcp -update'?

hadoop mapreduce hdfs

Filter a string on the basis of a word

hadoop apache-pig

How can I concatenate two files in hadoop into one using Hadoop FS shell?

shell hadoop concatenation

What does CPU Time for a Hadoop Job signify?

hadoop timing benchmarking