apache-spark tutorials and guides

Why does Zeppelin fail with "mismatched input ';' expecting <EOF>" in %spark.sql paragraph?

Feb 28, 2026

org.apache.spark.sql.AnalysisException: cannot resolve given input column

Feb 28, 2026

apache-spark dataframe apache-spark-sql

Scala: Convert xml dataframe to csv file

Feb 28, 2026

xml scala csv intellij-idea apache-spark

How to append collection as new column to DataFrame with many columns?

Feb 28, 2026

scala dataframe apache-spark functional-programming apache-spark-sql

Missing data when ordering Pyspark Window

Feb 28, 2026

apache-spark pyspark apache-spark-sql

How to implement Slowly Changing Dimensions (SCD2) Type 2 in Spark using SQL Join

Feb 27, 2026

apache-spark apache-spark-sql

How to flatten long dataset to wide format (pivot) with no join?

Feb 27, 2026

apache-spark pyspark apache-spark-sql

Efficiently calculate top-k elements in spark

Feb 27, 2026

apache-spark apache-spark-sql window-functions rank approximation

Shutdown Hook for spark batch application

Feb 27, 2026

scala apache-spark

Pyspark java.lang.OutOfMemoryError: Requested array size exceeds VM limit

Feb 26, 2026

python scala hadoop apache-spark pyspark

How To Apply Multiple Conditions on Case-Otherwise Statement Using Spark Dataframe API

Feb 24, 2026

r apache-spark dataframe apache-spark-sql

What does the sbt assembly documentation mean by "already part of the container?"

Feb 25, 2026

scala apache-spark jar sbt sbt-assembly

Left outer join not emitting null values when joining two streams in spark structured streaming 2.3.0

Feb 26, 2026

scala apache-spark spark-structured-streaming

Streaming query not showing any progress in Spark

Feb 26, 2026

scala apache-spark spark-structured-streaming

In Spark scala dataframe how do i get week end date based on week number

Feb 25, 2026

scala apache-spark

How to use columns to create queries (e.g. WHERE clause)?

Feb 25, 2026

apache-spark pyspark apache-spark-sql

Why Spark streaming creates batches with 0 events?

Feb 25, 2026

apache-spark

New posts in apache-spark