Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark

Scala Patch function

scala apache-spark

How to change the schema of a DataFrame (to fix the names of some nested fields)?

Pyspark - from_unixtime not showing the correct datetime

How to add a new column to a Delta Lake table?

explode a row of spark dataset into several rows with added column using flatmap

Spark fails to merge parquet files (INTEGER -> DECIMAL)

Spark SQL - How do i set a variable within the query, to re-use throughout?

Spark - Csv data split with scala

scala csv apache-spark

Spark "Failed to construct kafka consumer" via SSL

R dplyr filter rows on numeric values for given column

r apache-spark dplyr

Convert a JSON string to a struct column without schema in Spark

How to parse string to array in Spark?

arrays json apache-spark

Adaptive Query Execution and Shuffle Partitions

How to partition a single RDD into multiple RDD in spark [duplicate]

scala apache-spark

Building spark-jobserver Using SBT and Scala