Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark-sql

Spark sql top n per group

Add months to date column in Spark dataframe

How to select multiple columns of dataset, given a list of column names?

Spark decimal type precision loss

How to find the max String length of a column in Spark using dataframe?

Spark: How to aggregate/reduce records based on time difference?

Joining two spark dataframes on time (TimestampType) in python

SPARK SQL Equivalent of Qualify + Row_number statements

Spark dataframes: Extract a column based on the value of another column

Avro Schema to spark StructType

How to load specific Hive partition in DataFrame Spark 1.6?

Convert PySpark dataframe column type to string and replace the square brackets

Spark DataSet filter performance

Creating/accessing dataframe inside the transformation of another dataframe

How to concatenate a string to a column in Spark?

How to create a Row from a given case class?

Parquet vs Delta format in Azure Data Lake Gen 2 store

Spark SQL: automatic schema from csv

How to use countDistinct in Scala with Spark?

How to implement NOT IN for two DataFrames with different structure in Apache Spark