apache-spark tutorials and guides

How do Spark scheduler pools work when running on YARN?

Feb 19, 2022

Converting pattern of date in spark dataframe

Nov 09, 2022

scala apache-spark spark-dataframe

How to convert RDD[Row] to RDD[String]

Oct 19, 2019

scala apache-spark

What is the faster way to count the number of entries in a data frame?

Jun 17, 2022

scala apache-spark apache-spark-sql

apache-spark startup error on alpine linux docker

Aug 28, 2021

apache-spark docker alpine alpine-linux

Spark Scala Dataframe convert a column of Array of Struct to a column of Map

Nov 11, 2022

scala apache-spark apache-spark-sql

Dummy Encoding using Pyspark [duplicate]

Nov 12, 2022

apache-spark encoding pyspark dummy-variable

How to create a Dataset of Maps?

Nov 19, 2022

scala apache-spark apache-spark-sql apache-spark-dataset apache-spark-encoders

Spark Structured Streaming with Hbase integration

Aug 28, 2022

scala apache-spark apache-kafka hbase spark-streaming

How does Spark 2.0 handle column nullability?

Jun 13, 2022

apache-spark pyspark apache-spark-sql apache-spark-2.0

Spark: Extracting summary for a ML logistic regression model from a pipeline model

Sep 27, 2022

python apache-spark pyspark pipeline logistic-regression

Pyspark, Add a character in the middle of a string

Oct 01, 2022

python apache-spark split pyspark

How to implement Functor[Dataset]

Jan 11, 2022

scala apache-spark scala-cats scala-implicits apache-spark-encoders

Understanding Kryo serialization buffer overflow error

Nov 17, 2022

scala apache-spark kryo

Using UDF ignores condition in when

Oct 15, 2022

python apache-spark pyspark spark-dataframe user-defined-functions

Spark: select with key in map

Apr 19, 2022

apache-spark apache-spark-sql

How to bucketize a group of columns in pyspark?

Jun 29, 2022

python apache-spark pyspark

ERROR : User did not initialize spark context

May 06, 2022

apache-spark hadoop

Why does Spark's Word2Vec return a vector?

Jul 15, 2022

java apache-spark machine-learning word2vec apache-spark-ml

Set spark configuration

Feb 27, 2022

python-3.x apache-spark pyspark apache-spark-sql

New posts in apache-spark