Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark-sql

Spark: What is the difference between repartition and repartitionByRange?

How to rename column names in spark SQL

Merge two spark sql columns of type Array[string] into a new Array[string] column

Split Time Series pySpark data frame into test & train without using random split

Methods of max() and sum() undefined in the Java Spark Dataframe API (1.4.1)

How can we JOIN two Spark SQL dataframes using a SQL-esque "LIKE" criterion?

Spark SQL and MySQL- SaveMode.Overwrite not inserting modified data

How to create SQLContext in spark using scala?

Why spark tell me “ name 'sqlContext' is not defined ”, how can I use sqlContext?

How to zip two array columns in Spark SQL

Spark SQL has no SparkSqlParser.scala file when compiling in intelliJ idea

Why does posexplode fail with "AnalysisException: The number of aliases supplied in the AS clause does not match the number of columns..."?

Meaning of Exchange in Spark Stage

How to convert timestamp column to epoch seconds?

Spark DataFrame: Computing row-wise mean (or any aggregate operation)

Spark SQL - Select all AND computed columns?

How do I truncate a PySpark dataframe of timestamp type to the day?

Spark Scala: How to convert Dataframe[vector] to DataFrame[f1:Double, ..., fn: Double)]

Remove blank space from data frame column values in Spark