Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark-sql

spark convert dataframe to dataset using case class with option fields

How do I flatMap a row of arrays into multiple rows?

UPDATE Cassandra table using spark cassandra connector

Spark DataFrame filtering: retain element belonging to a list

When registering a table using the %pyspark interpreter in Zeppelin, I can't access the table in %sql

SparkSQL sql syntax for nth item in array

How do I collect a List of Strings from spark DataFrame Column after a GroupBy operation?

Spark remove duplicate rows from DataFrame [duplicate]

save dataframe as external hive table

Apache Spark - Backend servers

What is the faster way to count the number of entries in a data frame?

Spark Scala Dataframe convert a column of Array of Struct to a column of Map

How to create a Dataset of Maps?

How does Spark 2.0 handle column nullability?

Spark: select with key in map

Set spark configuration

select latest record from spark dataframe

apache-spark-sql

PySpark explode stringified array of dictionaries into rows

Convert UTC timestamp to local time based on time zone in PySpark

Stream-Static Join: How to refresh (unpersist/persist) static Dataframe periodically