apache-spark tutorials and guides

Spark Launcher waiting for job completion infinitely

Feb 20, 2022

How to turn off INFO from logs in PySpark with no changes to log4j.properties?

Sep 15, 2022

python apache-spark pyspark

how to use Regexp_replace in spark

Sep 15, 2022

scala apache-spark apache-spark-sql regexp-replace

Spark Implicit $ for DataFrame

Nov 25, 2018

scala apache-spark implicit-conversion

spark off heap memory config and tungsten

Mar 07, 2022

apache-spark apache-spark-sql spark-dataframe apache-spark-2.0 off-heap

It is possible to start an embedded instance of apache Spark node?

Oct 15, 2022

java mapreduce apache-spark

Is caching the only advantage of spark over map-reduce?

May 15, 2022

caching hadoop apache-spark

When does shuffling occur in Apache Spark?

Aug 17, 2022

mapreduce apache-spark

Stackoverflow due to long RDD Lineage

Dec 31, 2021

scala apache-spark rdd

How to check version of Spark and Scala in Zeppelin?

Oct 18, 2022

scala apache-spark version apache-zeppelin

ETL in Java Spring Batch vs Apache Spark Benchmarking

Mar 23, 2022

spring spring-boot apache-spark spring-batch etl

Modify collection inside a Spark RDD foreach

Sep 12, 2022

scala apache-spark rdd

PySpark — UnicodeEncodeError: 'ascii' codec can't encode character

Sep 15, 2022

python python-2.7 apache-spark pyspark

Replace missing values with mean - Spark Dataframe

Sep 15, 2022

scala apache-spark dataframe apache-spark-sql imputation

Spark-Submit: --packages vs --jars

Sep 22, 2022

java scala apache-spark cassandra

How do you perform basic joins of two RDD tables in Spark using Python?

Aug 29, 2022

python join apache-spark pyspark rdd

Spark RDD default number of partitions

Oct 19, 2022

scala apache-spark

How can I get the current SparkSession in any place of the codes?

Aug 01, 2022

scala apache-spark

Not able to import Spark Implicits in ScalaTest

Sep 15, 2022

scala apache-spark apache-spark-sql implicit scalatest

How to read only n rows of large CSV file on HDFS using spark-csv package?

Sep 15, 2022

apache-spark pyspark hdfs apache-spark-sql spark-csv

New posts in apache-spark