Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in bigdata

Difference between reduce task and a reducer

NullPointerException in Spark RDD map when submitted as a spark job

Why extracting an argument in spark to local variable is considered safer?

multiple insert into a table using Apache Spark

what is driver memory and executor memory in spark? [duplicate]

apache-spark bigdata

Creating Impala external table from a partitioned file structure

hadoop bigdata cloudera impala

Basic addition in Tensorflow?

Appending the datetime to the end of every line in a 600 million row file

awk sed bigdata

Table with heavy writes and some reads in Cassandra. Primary key searches taking 30 seconds. (Queue)

cassandra bigdata

key validation class type in cassandra UTF8 or LongType?

java nosql cassandra bigdata

Store pig result in a text file

hadoop apache-pig hdfs bigdata

How do you ingest Spring boot logs directly into elastic

append multiple columns to existing dataframe in spark

Programmatically set the MaxItemsInObjectGraph

c# .net wcf bigdata

Pythonic way to send contents of a file to a pipe and count # lines in a single step

python bash shell awk bigdata

Should I move to NoSQL? (big data)

Hive: Fatal error when trying to create dynamic partitions

hadoop hive bigdata hiveql

What is the exact difference between Spark Local and Standalone mode? [duplicate]