apache-spark-sql tutorials

Selecting only numeric/string columns names from a Spark DF in pyspark

Dec 21, 2017

PySpark - Adding a Column from a list of values using a UDF

Oct 03, 2019

python list apache-spark pyspark apache-spark-sql

spark partition data writing by timestamp

Oct 25, 2022

scala apache-spark apache-spark-sql

spark error RDD type not found when creating RDD

Jun 12, 2019

scala apache-spark apache-spark-sql

What is the best way to define custom methods on a DataFrame?

May 28, 2021

scala apache-spark apache-spark-sql

Apply same function to all fields of spark dataframe row

Aug 25, 2022

apache-spark apache-spark-sql

Pyspark: Replacing value in a column by searching a dictionary

Nov 03, 2022

python apache-spark dataframe pyspark apache-spark-sql

Making histogram with Spark DataFrame column

Aug 11, 2022

python pandas apache-spark pyspark apache-spark-sql

how to cast all columns of dataframe to string

Nov 10, 2022

apache-spark pyspark apache-spark-sql

Spark streaming multiple sources, reload dataframe

Nov 17, 2021

postgresql apache-spark spark-streaming apache-spark-sql

Spark java Issue creating row with java.util.Map type

Jul 17, 2020

java apache-spark apache-spark-sql

Efficient text preprocessing using PySpark (clean, tokenize, stopwords, stemming, filter)

Apr 18, 2020

python apache-spark pyspark apache-spark-sql text-processing

Is Spark SQL UDAF (user defined aggregate function) available in the Python API?

Feb 13, 2017

apache-spark apache-spark-sql spark-dataframe

Caching ordered Spark DataFrame creates unwanted job

Nov 17, 2022

python apache-spark pyspark apache-spark-sql pyspark-sql

How to change the attributes order in Apache SparkSQL `Project` operator?

Oct 02, 2021

scala apache-spark apache-spark-sql

Hive partitioned table reads all the partitions despite having a Spark filter

Apr 11, 2022

scala apache-spark hive apache-spark-sql

How to cache a Spark data frame and reference it in another script

Oct 07, 2017

apache-spark pyspark apache-spark-sql pyspark-sql

Spark DataFrame mapPartitions

Oct 27, 2022

python apache-spark pyspark apache-spark-sql

Apache Spark SQL UDAF over window showing odd behaviour with duplicate input

Sep 24, 2021

apache-spark apache-spark-sql

java.sql.SQLException: No suitable driver found when loading DataFrame into Spark SQL

Aug 12, 2020

scala jdbc apache-spark apache-spark-sql

New posts in apache-spark-sql