Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark-dataset

Spark Streamming : Reading data from kafka that has multiple schema

Spark SQL alternatives to groupby/pivot/agg/collect_list using foldLeft & withColumn so as to improve performance

No Java class corresponding to Product with Serializable with Base found

Spark-SQL Joining two dataframes/ datasets with same column name

java.lang.UnsupportedOperationException: Error in spark when writing

Spark using recursive case class

using DataSet.repartition in Spark 2 - several tasks handle more than one partition

Spark Java - Collect multiple columns into array column

Spark Datasets - strong typing

Using stat.bloomFilter in Spark 2.0.0 to filter another dataframe

spark convert dataframe to dataset using case class with option fields

How to create a Dataset of Maps?

Spark Dataset equivalent for scala's "collect" taking a partial function

How to convert Dataset into JavaPairRDD?

How to create a Dataset from custom class Person?

Array Intersection in Spark SQL

How to join two spark dataset to one with java objects?

How to create a custom Encoder in Spark 2.X Datasets?

How to change case of whole column to lowercase?