Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
How to flatten long dataset to wide format (pivot) with no join?
Feb 27, 2026
apache-spark
pyspark
apache-spark-sql
Efficiently calculate top-k elements in spark
Feb 27, 2026
apache-spark
apache-spark-sql
window-functions
rank
approximation
Shutdown Hook for spark batch application
Feb 27, 2026
scala
apache-spark
Pyspark java.lang.OutOfMemoryError: Requested array size exceeds VM limit
Feb 26, 2026
python
scala
hadoop
apache-spark
pyspark
How To Apply Multiple Conditions on Case-Otherwise Statement Using Spark Dataframe API
Feb 24, 2026
r
apache-spark
dataframe
apache-spark-sql
What does the sbt assembly documentation mean by "already part of the container?"
Feb 25, 2026
scala
apache-spark
jar
sbt
sbt-assembly
Left outer join not emitting null values when joining two streams in spark structured streaming 2.3.0
Feb 26, 2026
scala
apache-spark
spark-structured-streaming
Streaming query not showing any progress in Spark
Feb 26, 2026
scala
apache-spark
spark-structured-streaming
In Spark scala dataframe how do i get week end date based on week number
Feb 25, 2026
scala
apache-spark
How to use columns to create queries (e.g. WHERE clause)?
Feb 25, 2026
apache-spark
pyspark
apache-spark-sql
Why Spark streaming creates batches with 0 events?
Feb 25, 2026
apache-spark
PySpark direct streaming from Kafka
Feb 23, 2026
apache-spark
apache-kafka
pyspark
spark-streaming
Convert an Rows or Columns to a dataframe
Feb 25, 2026
scala
apache-spark
apache-spark-sql
data-manipulation
SparkR on Windows - Spark SQL is not built with Hive support
Feb 25, 2026
r
apache-spark
hive
sparkr
Does spark streaming must finish processing previous batch of data, and then it can process the next batch of data, is it right?
Feb 25, 2026
apache-spark
spark-streaming
Programmatically reduce log in a spark shell
Feb 25, 2026
scala
shell
apache-spark
get multiple columns within a map: rdd
Feb 25, 2026
scala
apache-spark
rdd
Python Spark How to find cumulative sum by group using RDD API
Feb 25, 2026
python
apache-spark
pyspark
rdd
Creating a new scala class that relies on GraphFrames without serialization issues
Feb 24, 2026
scala
apache-spark
apache-spark-sql
« Newer Entries
Older Entries »