Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

Accessing hadoop from remote machine

java hadoop

Using Spark for sequential row-by-row processing without map and reduce

hadoop apache-spark pyspark

How to use Mahout in a Windows environment?

windows cygwin hadoop mahout

Java or Python distributed compute job (on a student budget)?

java python nlp hadoop nltk

Can I get invidually sorted Mapper outputs from Hadoop when using zero Reducers?

hadoop mapreduce

Hadoop Streaming Job Failed (Not Successful) in Python

Hadoop seems to modify my key object during an iteration over values of a given reduce call

rsync files to hadoop

hadoop rsync

NullPointerException from Hadoop's JobSplitWriter / SerializationFactory when calling InputSplit's getClass()

Enum value implementing Writable interface of Hadoop

java hadoop enums

Doubts about page rank

Merging two datasets in Pig

hadoop apache-pig piglet

Hbase Region server shutdown

hadoop hbase

Can I rename the oozie job name dynamically

hadoop oozie

Hadoop MapReduce, Java implementation questions

java hadoop mapreduce

how to attach debugger to remote Hadoop instance

debugging hadoop jdb

Error connecting: <class 'thrift.transport.TTransport.TTransportException'> Could not connect to localhost:21000

hadoop hive impala

What to use.. Impala on HDFS, or Impala on Hbase or just the Hbase?

hadoop hbase hdfs impala

Pyspark --py-files doesn't work

python hadoop apache-spark emr

Viewing the number of blocks for a file in hadoop

hadoop hdfs