apache-spark-sql tutorials

Spark SQL performance - JOIN on value BETWEEN min and max

Nov 01, 2021

Cannot create dataframe from list: pyspark

Sep 18, 2022

python apache-spark pyspark apache-spark-sql

UDF to extract only the file name from path in Spark SQL

Jul 11, 2022

java scala apache-spark apache-spark-sql spark-dataframe

How to find mean of grouped Vector columns in Spark SQL?

Sep 29, 2018

apache-spark apache-spark-sql aggregate-functions user-defined-functions apache-spark-ml

Apache Spark subtract days from timestamp column

May 26, 2020

apache-spark dataframe apache-spark-sql timestamp

How to extract number from string column?

Dec 13, 2019

scala apache-spark apache-spark-sql

filter only not empty arrays dataframe spark [duplicate]

Oct 30, 2022

scala apache-spark apache-spark-sql

Filter out rows with NaN values for certain column

Oct 29, 2022

scala apache-spark apache-spark-sql

Calculate a grouped median in pyspark

Nov 14, 2022

apache-spark pyspark apache-spark-sql

GenericRowWithSchema exception in casting ArrayBuffer to HashSet in DataFrame to RDD from Hive table

Jul 04, 2019

scala apache-spark hive apache-spark-sql apache-spark-1.3

JSON file parsing in Pyspark

Sep 08, 2022

apache-spark dataframe pyspark apache-spark-sql pyspark-sql

How to check if array column is inside another column array in PySpark dataframe

Jun 26, 2022

apache-spark dataframe pyspark apache-spark-sql pyspark-sql

How to concatenate/append multiple Spark dataframes column wise in Pyspark?

Jul 02, 2022

python apache-spark pyspark apache-spark-sql pyspark-sql

How to convert empty arrays to nulls?

Aug 20, 2022

apache-spark pyspark apache-spark-sql pyspark-sql

How to create a Dataset from custom class Person?

Jan 23, 2017

apache-spark apache-spark-sql apache-spark-dataset

Spark getnewargs error ... Method or([class java.lang.String]) does not exist

Apr 15, 2022

apache-spark pyspark apache-spark-sql

How to set YARN queue for spark-shell?

Aug 21, 2022

apache-spark apache-spark-sql

Pyspark: Replace all occurrences of a value with null in dataframe

Sep 05, 2022

apache-spark pyspark apache-spark-sql pyspark-dataframes

How do I use "not rlike" in spark-sql?

Mar 23, 2022

scala apache-spark apache-spark-sql

Count the number of non-null values in a Spark DataFrame

Aug 13, 2022

scala apache-spark apache-spark-sql count null

New posts in apache-spark-sql