apache-spark-sql tutorials

How to use approxQuantile by group?

Sep 02, 2021

apache-spark apache-spark-sql apache-spark-dataset

How to set jdbc/partitionColumn type to Date in spark 2.4.1

Sep 05, 2022

apache-spark apache-spark-sql databricks

PySpark DataFrame - Join on multiple columns dynamically

Oct 29, 2021

python apache-spark dataframe pyspark apache-spark-sql

pyspark createdataframe: string interpreted as timestamp, schema mixes up columns

Feb 22, 2020

apache-spark pyspark apache-spark-sql pyspark-sql

Use Map to replace column values in Spark

Sep 02, 2022

scala apache-spark apache-spark-sql

How to check if a Spark data frame struct Array contains a specific value

Oct 20, 2022

apache-spark apache-spark-sql

Round double values and cast as integers

Apr 13, 2022

python apache-spark pyspark apache-spark-sql rounding

reading data from URL using spark databricks platform

Aug 29, 2022

scala apache-spark pyspark apache-spark-sql databricks

Spark: What is the difference between repartition and repartitionByRange?

Aug 23, 2022

apache-spark pyspark apache-spark-sql

How to rename column names in spark SQL

Apr 23, 2022

dataframe apache-spark-sql spark-dataframe

Merge two spark sql columns of type Array[string] into a new Array[string] column

Nov 15, 2022

scala apache-spark apache-spark-sql user-defined-functions

Split Time Series pySpark data frame into test & train without using random split

Aug 11, 2022

python pyspark apache-spark-sql rdd

Methods of max() and sum() undefined in the Java Spark Dataframe API (1.4.1)

Mar 23, 2022

java apache-spark-sql spark-dataframe

How can we JOIN two Spark SQL dataframes using a SQL-esque "LIKE" criterion?

Oct 19, 2022

python apache-spark apache-spark-sql pyspark

Spark SQL and MySQL- SaveMode.Overwrite not inserting modified data

Oct 29, 2022

mysql apache-spark dataframe apache-spark-sql

How to create SQLContext in spark using scala?

Oct 28, 2022

scala apache-spark sbt apache-spark-sql

Why spark tell me “ name 'sqlContext' is not defined ”, how can I use sqlContext?

Feb 04, 2022

apache-spark apache-spark-sql

How to zip two array columns in Spark SQL

Sep 16, 2022

python pandas apache-spark pyspark apache-spark-sql

Spark SQL has no SparkSqlParser.scala file when compiling in intelliJ idea

Apr 18, 2022

scala intellij-idea apache-spark apache-spark-sql

Why does posexplode fail with "AnalysisException: The number of aliases supplied in the AS clause does not match the number of columns..."?

Oct 03, 2022

apache-spark apache-spark-sql spark-dataframe

New posts in apache-spark-sql