apache-spark-sql tutorials

How to select all columns instead of hard coding each one?

Nov 27, 2019

apache-spark pyspark apache-spark-sql

How to delete rows in a table created from a Spark dataframe?

Sep 15, 2022

apache-spark pyspark apache-spark-sql

how to calculate max value in some columns per row in pyspark

Aug 10, 2022

python apache-spark pyspark apache-spark-sql

Where is the union() method on the Spark DataFrame class?

Mar 29, 2021

java apache-spark dataframe apache-spark-sql

Dividing complex rows of dataframe to simple rows in Pyspark

Aug 28, 2022

python apache-spark dataframe pyspark apache-spark-sql

pyspark py4j.Py4JException: Method and([class java.lang.Integer]) does not exist

Mar 26, 2022

apache-spark pyspark apache-spark-sql

How to limit decimal values to 2 digits before applying agg function?

Oct 31, 2022

scala apache-spark apache-spark-sql apache-spark-1.5

Find column index by searching column header of a Dataset in Apache Spark Java

Sep 13, 2022

java apache-spark apache-spark-sql apache-spark-dataset

Spark Failure : Caused by: org.apache.spark.shuffle.FetchFailedException: Too large frame: 5454002341

Sep 26, 2022

apache-spark apache-spark-sql hadoop-yarn

Spark java.lang.ClassCastException: scala.collection.mutable.WrappedArray$ofRef cannot be cast to java.util.ArrayList

May 22, 2022

scala apache-spark apache-spark-sql

How to filter a Spark dataframe by a boolean column?

Nov 20, 2022

python apache-spark filter apache-spark-sql

Can I read a CSV represented as a string into Apache Spark using spark-csv

May 25, 2022

apache-spark apache-spark-sql spark-csv

How to calculate Median in spark sqlContext for column of data type double

May 19, 2022

apache-spark hive apache-spark-sql

How to replace NULL to 0 in left outer join in SPARK dataframe v1.6

May 24, 2022

scala apache-spark apache-spark-sql apache-spark-1.6

How to register UDF to use in SQL and DataFrame?

May 20, 2022

scala apache-spark apache-spark-sql user-defined-functions

How to check if a Hive table exists using PySpark

May 20, 2022

python-2.7 pyspark apache-spark-sql

Spark Dataset unique id performance - row_number vs monotonically_increasing_id

Jun 08, 2022

scala apache-spark apache-spark-sql apache-spark-dataset

Convert between spark.SQL DataFrame and pandas DataFrame [duplicate]

Sep 11, 2022

apache-spark apache-spark-sql apache-zeppelin

Get the last element from Apache Spark SQL split() Function

May 11, 2022

apache-spark-sql

Why does DataFrame.saveAsTable("df") save table to different HDFS host?

Dec 21, 2019

hadoop apache-spark hdfs apache-spark-sql

New posts in apache-spark-sql