apache-spark-sql tutorials

Spark : Size exceeds Integer.MAX_VALUE When Joining 2 Large DFs

Mar 30, 2021

scala apache-spark apache-spark-sql

Changing column data type to factor with sparklyr

Sep 05, 2022

r apache-spark dplyr apache-spark-sql sparklyr

How to add jdbc drivers to classpath when using PySpark?

Aug 23, 2022

pyspark apache-spark-sql

When to execute REFRESH TABLE my_table in spark?

Oct 26, 2022

apache-spark hive apache-spark-sql

PySpark.sql.filter not performing as it should

May 15, 2022

python-2.7 apache-spark pyspark apache-spark-sql spark-dataframe

What problems can arise from a Spark non-deterministic Pandas UDF

Oct 23, 2022

python pandas apache-spark pyspark apache-spark-sql

Derby version mismatch between Spark and Hive : Unable to instantiate org.apache.hadoop.hive.metastore.HiveMetaStoreClient

Nov 04, 2022

apache-spark apache-spark-sql

Spark SQL package not found

Dec 08, 2018

java maven apache-spark apache-spark-sql

Re-using A Schema from JSON within a Spark DataFrame using Scala

Mar 09, 2022

json scala apache-spark apache-spark-sql

How to do non-random Dataset splitting on Apache Spark?

Jun 06, 2022

apache-spark apache-spark-sql apache-spark-dataset apache-spark-2.0

How to find first non-null values in groups? (secondary sorting using dataset api)

Feb 06, 2022

apache-spark apache-spark-sql apache-spark-dataset

Can we able to use mulitple sparksessions to access two different Hive servers

Sep 08, 2022

scala apache-spark hive apache-spark-sql

Does Spark do one pass through the data for multiple withColumn?

Oct 20, 2022

scala apache-spark apache-spark-sql

java.lang.AssertionError: assertion failed: No plan for HiveTableRelation

Jul 17, 2021

scala apache-spark amazon-s3 hive apache-spark-sql

Spark : Union can only be performed on tables with the compatible column types. Struct<name,id> != Struct<id,name>

Sep 19, 2022

apache-spark struct apache-spark-sql union

How to use transform higher-order function?

Feb 10, 2022

apache-spark apache-spark-sql

Why is Scala's Symbol not accepted as a column reference?

Oct 21, 2019

scala apache-spark-sql

Zeppelin SqlContext registerTempTable issue

Sep 15, 2022

scala apache-spark apache-spark-sql apache-zeppelin

Saving / exporting transformed DataFrame back to JDBC / MySQL

Apr 11, 2022

apache-spark apache-spark-sql apache-spark-1.5

Spark - How can get the Logical / Physical Query execution using - Thirft - Hive Interactor

Jan 30, 2022

apache-spark apache-spark-sql spark-dataframe

New posts in apache-spark-sql