apache-spark tutorials and guides

How can I use databricks utils functions in PyCharm? I can't find appropriate pip package

Jun 26, 2026

spark-submit python file ‘home/.python-eggs’ permission denied

Jun 26, 2026

python apache-spark

Decimal precision for Spark Dataset case class Encoder

Jun 25, 2026

scala apache-spark apache-spark-dataset

Compare column values in consecutive rows in Scala

Jun 26, 2026

scala apache-spark dataframe dataset pyspark

How can i sort a Vector of objects in Scala?

Jun 25, 2026

scala sorting apache-spark

How to convert from Pandas' DatetimeIndex to DataFrame in PySpark?

Jun 26, 2026

apache-spark pyspark apache-spark-sql

SparkSQL: not found value expr

Jun 26, 2026

scala apache-spark sbt apache-spark-sql

generating DataFrames in for loop in Scala Spark cause out of memory

Jun 25, 2026

sql scala apache-spark

Best practice for keeping local vs. test vs. production configuration properties in Spark/Scala

Jun 25, 2026

scala configuration apache-spark sbt

Casting RDD to a different type (from float64 to double)

Jun 24, 2026

python apache-spark pyspark types rdd

How to convert binary to string (UUID) without UDF in Apache Spark (PySpark)?

Jun 24, 2026

python apache-spark pyspark binary

Dot product in pyspark dataframes with MLLIB

Jun 25, 2026

python apache-spark pyspark apache-spark-mllib

Convert pyspark dataframe into list of python dictionaries

Jun 25, 2026

python apache-spark pyspark

java.lang.ClassNotFoundException: Class org.apache.hadoop.fs.azurebfs.SecureAzureBlobFileSystem not found

Jun 25, 2026

java apache-spark hadoop kubernetes azure-data-lake-gen2

Is Star Schema (data modelling) still relevant with the Lake House pattern using Databricks?

Jun 25, 2026

apache-spark bigdata databricks azure-databricks databricks-sql

Delta lake and ADLS Gen2 transactions

Jun 24, 2026

azure apache-spark databricks azure-data-lake azure-data-lake-gen2

Adding new column using other existing columns Spark/Scala

Jun 24, 2026

scala dataframe apache-spark apache-spark-sql

more efficient way to get monthly counts in Python/Pyspark

Jun 23, 2026

python sql apache-spark pyspark apache-spark-sql

New posts in apache-spark