Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark

How to sort an RDD of tuples with 5 elements in Spark Scala?

scala sorting apache-spark rdd

Spark ExecutorLostFailure

apache-spark

Stack Overflow while processing several columns with a UDF

first_value windowing function in pyspark

Advantage of setting name to RDD

scala apache-spark

Copy schema from one dataframe to another dataframe

In Apache Spark 2.0.0, is it possible to fetch a query from an external database (rather than grab the whole table)?

check if a row value is null in spark dataframe

Replace all ":" with "_" in Spark dataframe [duplicate]

Querying json object in dataframe using Pyspark

Scala & Spark: Cast multiple columns at once

scala apache-spark

How to parse CSV file with UTF-8 encoding?

csv apache-spark unicode

Spark on YARN + Secured hbase

How to use --num-executors option with spark-submit?

apache-spark hadoop-yarn

How to Generate Parquet File Using Pure Java (Including Date & Decimal Types) And Upload to S3 [Windows] (No HDFS)

Pyspark 'NoneType' object has no attribute '_jvm' error

DataFrame object has no attribute 'col'

apache-spark

Pandas scalar UDF failing, IllegalArgumentException

Storing a Graph in Spark Graphx with HDFS

apache-spark spark-graphx

Apache Spark Exception in thread "main" java.lang.NoClassDefFoundError: scala/collection/GenTraversableOnce$class