Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop-streaming

Sorting by value in Hadoop from a file

How to resolve java.lang.RuntimeException: PipeMapRed.waitOutputThreads(): subprocess failed with code 2?

EMR How to join files into one?

How to decide when to use a Map-Side Join or Reduce-Side while writing an MR code in java?

Hadoop Configuration Error

Hadoop Throws ClassCastException for the keytype of java.nio.ByteBuffer

Running the Python Code on Hadoop Failed

python hadoop-streaming

Can I force my reducers (copy phase) to start only when all mappers are completed

Amazon Elastic MapReduce - SIGTERM

Python MapReduce Hadoop Streaming Job that requires multiple input files?

Hive FAILED: ParseException line 2:0 cannot recognize input near ''macaddress'' 'CHAR' '(' in column specification

hadoop, python, subprocess failed with code 127

POC for Hadoop in real time scenario

Map Reduce output to CSV or do I need Key Values?

hadoop 2.4.0 streaming generic parser options using TAB as separator

Processing images using hadoop

Pass directories not files to hadoop-streaming?

hadoop hadoop-streaming

What is the difference between Rack-local map tasks and Data-local map tasks?

Python hadoop streaming : Setting a job name

How to get the name of input file in MRjob