Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

Computing set intersection and set difference of the records of two files with hadoop

Python Streaming : how to reduce to multiple outputs?(its possible with Java though)

Does hive instantiate a new UDF object for each record?

hadoop hive

Handling big data sets (neo4j, mongo db, hadoop)

mongodb hadoop neo4j

How to maintain data entry id in Mahout K-means clustering

apache hadoop mahout k-means

Setting additional classpath for a hadoop tool

jar hadoop classpath

Is Mapper Object of Hadoop Shared across Multiple Threads?

multithreading hadoop

Adding Data Node to hadoop cluster

hadoop

Is it possible to run hadoop fs -getmerge in S3?

Hive JDBC getConnection does not return

jdbc hadoop hive

Converting CSV to SequenceFile

hadoop mahout sequencefile

Hadoop mapreduce has "Cannot resolve the host name" error

hadoop mapreduce

Hive: adding rows to existing table

hadoop hive

Map Reduce Keep input ordering

hadoop mapreduce

hive action failing in oozie (on cloudera CDH 4.1.1)

hadoop hive cloudera oozie

Split reduced data into output and new input in Hadoop

java hadoop split mapreduce

Hadoop - how to use and reduce multiple inputs?

java hadoop mapreduce

InputSplit customization in Hadoop

hadoop

Strange Jackson Illegal character ((CTRL-CHAR, code 0)) Exception in Map Reduce Combiner

Permission Denied error while running start-dfs.sh