Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark-sql

save dataframe as external hive table

Apache Spark - Backend servers

What is the faster way to count the number of entries in a data frame?

Spark Scala Dataframe convert a column of Array of Struct to a column of Map

How to create a Dataset of Maps?

How does Spark 2.0 handle column nullability?

Spark: select with key in map

Set spark configuration

select latest record from spark dataframe

apache-spark-sql

PySpark explode stringified array of dictionaries into rows

Convert UTC timestamp to local time based on time zone in PySpark

Stream-Static Join: How to refresh (unpersist/persist) static Dataframe periodically

Spark DataFrame created from JavaRDD<Row> copies all columns data into first column

How is it possible to add new column to existing Dataframe in Spark SQL

Broadcast not happening while joining dataframes in Spark 1.6

How to drop rows with too many NULL values?

Pyspark : Custom window function

How to add new columns to DataFrame given their names when they are missing?

How to write rows asynchronously in Spark Streaming application to speed up batch execution?

spark-sql Table or view not found error