apache-spark-sql tutorials

How to use "cube" only for specific fields on Spark dataframe?

May 05, 2021

How to split comma separated string and get n values in Spark Scala dataframe?

Oct 25, 2022

scala apache-spark dataframe apache-spark-sql spark-dataframe

PySpark equivalent of function "typedLit" from Scala API

Aug 22, 2022

scala apache-spark pyspark apache-spark-sql

Spark DataFrames with Parquet and Partitioning

Sep 08, 2019

apache-spark apache-spark-sql parquet

Group by and order by in Spark SQL

Oct 14, 2022

apache-spark apache-spark-sql

CodeGen grows beyond 64 KB error when normalizing large PySpark dataframe

Dec 09, 2021

apache-spark pyspark apache-spark-sql pyspark-sql window-functions

Read parquet into spark dataset ignoring missing fields [duplicate]

Dec 14, 2019

apache-spark apache-spark-sql parquet apache-spark-dataset apache-spark-2.0

How to get the number of records written (using DataFrameWriter's save operation)?

Nov 03, 2022

scala apache-spark apache-spark-sql

Connection from Spark to snowflake

Jun 21, 2022

apache-spark apache-spark-sql databricks snowflake-cloud-data-platform

Pyspark: How to convert a spark dataframe to json and save it as json file?

Nov 02, 2022

python-3.x pyspark apache-spark-sql pyspark-sql

Comparing two data frames in Spark (performance)

Sep 15, 2022

java scala performance apache-spark apache-spark-sql

How we save a Huge pyspark dataframe?

Apr 08, 2022

apache-spark pyspark apache-spark-sql

Implementing a recursive algorithm in pyspark to find pairings within a dataframe

Oct 26, 2022

python apache-spark pyspark apache-spark-sql

Spark SQL 1.5 build failure

Sep 15, 2022

maven build apache-spark apache-spark-sql

How to get an Iterator of Rows using Dataframe in SparkSQL

Aug 31, 2022

apache-spark apache-spark-sql

How to perform "Lookup" operation on Spark dataframes given multiple conditions

Nov 02, 2022

scala apache-spark dataframe apache-spark-sql lookup

Use the result from Cross tab (spark dataframe) for chi-square test in SparkMlib

Oct 18, 2020

python apache-spark pyspark apache-spark-sql apache-spark-mllib

Zeppelin - Cannot query with %sql a table I registered with pyspark

Jun 10, 2022

apache-spark pyspark apache-spark-sql apache-zeppelin

Bulk data migration through Spark SQL

Dec 22, 2019

apache-spark apache-spark-sql spark-dataframe

SparkSQL on HBase Tables

May 08, 2022

apache-spark hadoop apache-spark-sql hbase

New posts in apache-spark-sql