apache-spark-sql tutorials

Aggregate data from different micro batches in Spark streaming

Feb 06, 2026

How to change the schema of a DataFrame (to fix the names of some nested fields)?

Feb 07, 2026

scala apache-spark apache-spark-sql

Pyspark - from_unixtime not showing the correct datetime

Feb 06, 2026

apache-spark pyspark timestamp apache-spark-sql

How to convert from SparkR to sparklyr?

Feb 05, 2026

r apache-spark-sql sparkr sparklyr

Spark SQL - How do i set a variable within the query, to re-use throughout?

Feb 06, 2026

apache-spark apache-spark-sql azure-databricks

Create a column in a PySpark dataframe using a list whose indices are present in one column of the dataframe

Feb 04, 2026

python arrays pyspark apache-spark-sql

Convert a JSON string to a struct column without schema in Spark

Feb 04, 2026

scala apache-spark struct apache-spark-sql schema

Adaptive Query Execution and Shuffle Partitions

Feb 05, 2026

apache-spark pyspark apache-spark-sql spark3

How to get length of complex datatype column in hive

Feb 04, 2026

hadoop hive apache-spark-sql

Comparing two array columns in Scala Spark

Feb 05, 2026

scala apache-spark apache-spark-sql subset array-column

Read spark csv with empty values without converting to null

Feb 04, 2026

dataframe apache-spark apache-spark-sql

Window function acts not as expected when I use Order By (PySpark)

Feb 04, 2026

pyspark apache-spark-sql window-functions

Filter column with two different schemas in spark scala

Feb 04, 2026

apache-spark apache-spark-sql

.isin() with a column from a dataframe

Feb 04, 2026

pyspark apache-spark-sql

Does ordering a column before partitioning make a difference

Feb 04, 2026

apache-spark pyspark apache-spark-sql databricks partitioning

Does SparkSession always use Hive Context?

Feb 02, 2026

apache-spark hive apache-spark-sql

Can I use Spark DataFrame inside regular Spark map operation?

Feb 01, 2026

apache-spark pyspark apache-spark-sql

New posts in apache-spark-sql