Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark-sql

How to query datasets in avro format?

Hive and SparkSQL do not support datetime type?

sql hive apache-spark-sql

What's the difference between Dataset.col() and functions.col() in Spark?

How to transpose/pivot the rows data to column in Spark Scala? [duplicate]

Counting number of nulls in pyspark dataframe by row

spark: How does salting work in dealing with skewed data

How to calculate size of dataframe in spark scala

compute string length in Spark SQL DSL

How to get default property values in Spark

Spark 2.0 DataSets groupByKey and divide operation and type safety

Spark Dataframes- Reducing By Key

How to use Scala UDF in PySpark?

Scala/Spark dataframes: find the column name corresponding to the max

Apache Spark how to append new column from list/array to Spark dataframe

How to flatten columns of type array of structs (as returned by Spark ML API)?

Spark: Return empty column if column does not exist in dataframe

Apache Spark startsWith in SQL expression

Spark AnalysisException when "flattening" DataFrame in Spark SQL

How to find the max value of multiple columns?

Spark Convert Data Frame Column to dense Vector for StandardScaler() "Column must be of type org.apache.spark.ml.linalg.VectorUDT"