Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark

how to fix Illegal Parquet type: INT64 (TIMESTAMP_MICROS) error

Skip/Take with Spark SQL

Gather in sparklyr

r apache-spark dplyr sparklyr

Spark 1.3.0: Running Pi example on YARN fails

How to materialize an RDD explicitly in Spark

apache-spark

Get field values from a structtype in pyspark dataframe

apache-spark pyspark

Read a csv into an RDD using Spark 2.0

Programmatically Rename All But One Column Spark Scala

java.lang.NoClassDefFoundError: com/amazonaws/auth/AWSCredentialsProvider

Why printing inside foreach doesn't reflect an order of elements

scala apache-spark

How to submit a job via REST API?

Flatten nested array in Spark DataFrame

python apache-spark pyspark

Joining rows from two dataframes with the closest point

Is Spark.read.csv() an Action or Transformation

python apache-spark pyspark

Can I give dataproc's log4j.properties file having log4j.appender.file.File as gcs path?