Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

Map Reduce Slot Definition

read json key-values with hive/sql and spark

Export HDFS file with custom delimiter into Mysql via Sqoop

mysql hadoop hdfs sqoop

How can I use Oozie workflow configuration property in the workflow itself?

hadoop hive oozie

Remove directory level when transferring from HDFS to S3 using S3DistCp

Why HDFS not preferred with applications that require low latency?

hadoop apache-spark hdfs hawq

java.lang.VerifyError with Hadoop

java hadoop

Hadoop on Windows. YARN fails to start with java.lang.UnsatisfiedLinkError

hadoop hadoop-yarn

Hdfs file timestamp

datetime hadoop hdfs

How to append ORC file

java hadoop hive orc

YARN shell command to get number of containers and vcores used by running applications

hadoop hadoop-yarn

Unable to connect to HIVE2 via JAVA

java hadoop jdbc hive hiveql

Accessing hadoop from remote machine

java hadoop

Using Spark for sequential row-by-row processing without map and reduce

hadoop apache-spark pyspark

How to use Mahout in a Windows environment?

windows cygwin hadoop mahout

Java or Python distributed compute job (on a student budget)?

java python nlp hadoop nltk

Can I get invidually sorted Mapper outputs from Hadoop when using zero Reducers?

hadoop mapreduce

Hadoop Streaming Job Failed (Not Successful) in Python

Hadoop seems to modify my key object during an iteration over values of a given reduce call

Viewing the number of blocks for a file in hadoop

hadoop hdfs