apache-spark tutorials and guides

Why inconsistent results using subtraction in reduce?

Dec 25, 2022

scala apache-spark

What is the difference between spark.task.cpus and --executor-cores

Dec 27, 2022

multithreading apache-spark

How to modify/transform the column of a dataframe?

Dec 26, 2022

python apache-spark pyspark apache-spark-sql

Why result of Spark reduceByKey is not consistent

Dec 25, 2022

scala hadoop apache-spark

Count of List values in spark - dataframe

Dec 27, 2022

scala apache-spark apache-spark-sql datastax-enterprise cassandra-2.1

Use library in Spark-shell

Dec 26, 2022

scala apache-spark

PySpark - Are Spark DataFrame Arrays Different Than Python Lists?

Dec 26, 2022

python apache-spark dataframe pyspark apache-spark-sql

Spark schema from case class with correct nullability

Dec 25, 2022

apache-spark apache-spark-sql apache-spark-ml apache-spark-dataset spark-csv

Difference between translate and regexp_replace

Dec 26, 2022

apache-spark apache-spark-sql

Joining more than 2 Tables In Spark SQL

Dec 26, 2022

apache-spark apache-spark-sql

Scala String Variable Substitution

Dec 26, 2022

scala apache-spark apache-spark-sql

Reading multiple csv files at different folder depths

Dec 26, 2022

scala csv apache-spark dataframe wildcard

How to replace elements of a breeze matrix in Scala based on some condition?

Dec 26, 2022

scala apache-spark scala-breeze

Why Spark ML ALS algorithm print RMSE = NaN?

Dec 26, 2022

scala apache-spark machine-learning

Getting a date x days back from a custom date in Scala

Dec 26, 2022

scala apache-spark

How to create DataFrame with nulls using toDF?

Dec 25, 2022

scala apache-spark apache-spark-sql

Using custome UDF withColumn in a Spark Dataset<Row>; java.lang.String cannot be cast to org.apache.spark.sql.Row

Dec 25, 2022

java apache-spark apache-spark-sql user-defined-functions apache-spark-dataset

Spark job fails on java 9 NumberFormatException for input string ea

Dec 26, 2022

java scala apache-spark java-9

How can dataframereader read http?

Dec 26, 2022

scala apache-spark intellij-idea apache-spark-sql hdfs

Spark Dataframe - Implement Oracle NVL Function while joining

Dec 25, 2022

scala apache-spark apache-spark-sql

New posts in apache-spark