apache-spark-sql tutorials

Pyspark: How to chain Column.when() using a dictionary with reduce?

Feb 06, 2026

Spark Iceberg table merge into update all

Feb 08, 2026

apache-spark-sql apache-iceberg

Pyspark convert array of key/value structs into single struct

Feb 07, 2026

apache-spark pyspark apache-spark-sql

Incomprehensible result of a comparison between a string and null value in PySpark

Feb 08, 2026

apache-spark pyspark apache-spark-sql null

Aggregate data from different micro batches in Spark streaming

Feb 06, 2026

apache-spark pyspark spark-streaming apache-spark-sql

How to change the schema of a DataFrame (to fix the names of some nested fields)?

Feb 07, 2026

scala apache-spark apache-spark-sql

Pyspark - from_unixtime not showing the correct datetime

Feb 06, 2026

apache-spark pyspark timestamp apache-spark-sql

How to convert from SparkR to sparklyr?

Feb 05, 2026

r apache-spark-sql sparkr sparklyr

Spark SQL - How do i set a variable within the query, to re-use throughout?

Feb 06, 2026

apache-spark apache-spark-sql azure-databricks

Create a column in a PySpark dataframe using a list whose indices are present in one column of the dataframe

Feb 04, 2026

python arrays pyspark apache-spark-sql

Convert a JSON string to a struct column without schema in Spark

Feb 04, 2026

scala apache-spark struct apache-spark-sql schema

Adaptive Query Execution and Shuffle Partitions

Feb 05, 2026

apache-spark pyspark apache-spark-sql spark3

How to get length of complex datatype column in hive

Feb 04, 2026

hadoop hive apache-spark-sql

Comparing two array columns in Scala Spark

Feb 05, 2026

scala apache-spark apache-spark-sql subset array-column

Read spark csv with empty values without converting to null

Feb 04, 2026

dataframe apache-spark apache-spark-sql

Window function acts not as expected when I use Order By (PySpark)

Feb 04, 2026

pyspark apache-spark-sql window-functions

Filter column with two different schemas in spark scala

Feb 04, 2026

apache-spark apache-spark-sql

New posts in apache-spark-sql