Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

How to list all row keys in an hbase table?

rest hadoop hbase stargate

Run a sqoop job on a specific queue

hadoop queue sqoop

HDFS fsck command shows health as corrupt for '/'

hadoop hdfs

How Mapper and Reducer works together "without" sorting?

set applicationTags property in YARN for jobs submitted by CLI

hadoop hadoop-yarn

Streamsets solrcloud on CDH 5.7 unable to connect to Solr

How to control the number of hadoop streaming output files

hadoop hadoop-streaming

Usecases/experience of JavaScript for HPC (High Performance Computing)

javascript hadoop hpc

Hadoop Streaming Command Failure with Python Error

How to connect Apache Spark with Yarn from the SparkContext?

Apache Spark Dataframe How to turn off partial aggregation when using groupBy?

Importing multi-level directories of logs in hadoop/pig

hadoop hdfs apache-pig

Hadoop: How to output different format types in the same job?

hadoop mapreduce gzip lzo

Generating multiple lines output from single line input in pig

hadoop apache-pig

Resources for learning about NoSql/Non relational databases