apache-spark-sql tutorials

Splitting a row in a PySpark Dataframe into multiple rows

Nov 03, 2021

What is an optimized way of joining large tables in Spark SQL

Nov 07, 2022

apache-spark apache-spark-sql

Where is the reference for options for writing or reading per format?

Jun 24, 2021

apache-spark apache-spark-sql apache-spark-1.6

Spark - Creating Nested DataFrame

Oct 29, 2020

python apache-spark dataframe pyspark apache-spark-sql

spark sql current timestamp function

Sep 16, 2022

apache-spark apache-spark-sql

Spark 2.0: Relative path in absolute URI (spark-warehouse)

Mar 06, 2021

windows apache-spark pyspark apache-spark-sql pyspark-sql

Convert comma separated string to array in pyspark dataframe

Apr 06, 2022

python apache-spark dataframe pyspark apache-spark-sql

How do I convert a WrappedArray column in spark dataframe to Strings?

Sep 10, 2022

scala apache-spark dataframe apache-spark-sql user-defined-functions

Use collect_list and collect_set in Spark SQL

Sep 16, 2022

apache-spark hive apache-spark-sql

Spark, Scala, DataFrame: create feature vectors

Jun 20, 2019

scala apache-spark apache-spark-sql apache-spark-ml

How to filter based on array value in PySpark?

Nov 12, 2022

python apache-spark dataframe pyspark apache-spark-sql

How to use groupBy to collect rows into a map?

Sep 24, 2022

apache-spark apache-spark-sql

Does SparkSQL support subquery?

Mar 06, 2019

sql apache-spark subquery apache-spark-sql

How to filter column on values in list in pyspark?

Sep 16, 2022

apache-spark pyspark apache-spark-sql spark-dataframe pyspark-sql

Spark Scala: Cannot up cast from string to int as it may truncate

Feb 13, 2022

scala apache-spark apache-spark-sql

Convert a pandas dataframe to a PySpark dataframe [duplicate]

Sep 16, 2022

python-3.x pandas pyspark apache-spark-sql pyspark-sql

Spark SQL case insensitive filter for column conditions

Sep 16, 2022

apache-spark apache-spark-sql

How to add multiple columns using UDF?

Oct 31, 2022

apache-spark pyspark apache-spark-sql

Spark SQL broadcast hash join

Jan 14, 2018

apache-spark apache-spark-sql

Writing more than 50 millions from Pyspark df to PostgresSQL, best efficient approach

Oct 17, 2022

postgresql apache-spark pyspark apache-spark-sql bigdata

New posts in apache-spark-sql