Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark-sql

How to calculate the power of 2 for the column of DataFrame

why does spark appends 'WHERE 1=0' at the end of sql query

Save the parquet output file with fixed size in spark

Spark's .count() function is different to the contents of the dataframe when filtering on corrupt record field

How do I groupby and concat a list in a Dataframe Spark Scala

Spark & Scala: saveAsTextFile() exception

contains pyspark SQL: TypeError: 'Column' object is not callable

How to show my existing column name instead '_c0', '_c1', '_c2', '_c3', '_c4' in first row?

Spark Parquet read error : java.io.EOFException: Reached the end of stream with XXXXX bytes left to read

Using pyspark, how to expand a column containing a variable map to new columns in a DataFrame while keeping other columns?

Pyspark filter dataframe if column does not contain string

Weird behaviour with spark-submit

How does Spark DataFrame handles Pandas DataFrame that is larger than memory

java.lang.UnsupportedOperationException: 'Writing to a non-empty Cassandra Table is not allowed

How to convert DataFrame columns from string to float/double in PySpark 1.6?

How to select constant values from Dataframe coding in Java

How to indicate the database in SparkSQL over Hive in Spark 1.3

pyspark, Compare two rows in dataframe

How to specify multiple tables in Spark SQL?

Spark SQL - JAVA syntax of CASE-THEN?