Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in apache-spark

PicklingError: Could not serialize object: IndexError: tuple index out of range

Dec 01, 2025

python apache-spark pyspark rdd

How to load data into spark dataframe from text file without knowing the schema of the data?

Dec 01, 2025

java apache-spark apache-spark-sql

spark conditional replacement of values

Nov 30, 2025

apache-spark apache-spark-sql

Add all the dates (week) between two dates in new Row in spark Scala

Nov 30, 2025

scala apache-spark apache-spark-sql partitioning

Create a new column by replacing comma-separated column's values with a lookup based on another dataframe

Nov 29, 2025

python apache-spark pyspark apache-spark-sql

How is task distributed in spark

Nov 30, 2025

apache-spark distributed-system

How to read a Json file with a specific format with Spark Scala?

Nov 29, 2025

json scala apache-spark

How to get the latest date from listed dates along with the total count?

Nov 29, 2025

scala apache-spark apache-spark-sql

Spark saving RDD[(Int, Array[Double])] to text file got strange result

Nov 29, 2025

apache-spark apache-spark-mllib

How to make predictions with Linear Regression Model?

Nov 28, 2025

java apache-spark linear-regression apache-spark-ml

How to broadcast large variable to local disk of each node in Spark

Nov 29, 2025

hadoop apache-spark broadcast

Spark history server filter jobs by user id or time

Nov 29, 2025

apache-spark apache-spark-sql spark-streaming

Spark not able to find checkpointed data in HDFS after executor fails

Nov 29, 2025

apache-spark spark-streaming spark-checkpoint

Does PySpark code run in JVM or Python subprocess?

Nov 28, 2025

python apache-spark pyspark

Spark read JDBC from SAS IOM

Nov 29, 2025

apache-spark sas

Spark + Yarn: How to retain logs of lost-executors

Nov 28, 2025

hadoop logging apache-spark hadoop-yarn

« Newer Entries Older Entries »