Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
SparkSession does not pull down packages from repo in pytest suite
Oct 31, 2025
apache-spark
pyspark
pytest
StringType issue: Exception in thread "main" scala.MatchError: org.apache.spark.sql.types.StringType@
Nov 01, 2025
java
scala
apache-spark
Not able to retain the corrupted rows in pyspark using PERMISSIVE mode
Oct 31, 2025
python
csv
apache-spark
pyspark
Spark Join of 2 dataframes which have 2 different column names in list
Oct 31, 2025
scala
apache-spark
join
Understanding lambda function inputs in Spark for RDDs
Oct 31, 2025
python
apache-spark
lambda
pyspark
Create dictionary of each row in polars Dataframe
Oct 31, 2025
python
apache-spark
python-polars
How to decrease total timing processing of Spark SQL Execution plan
Oct 31, 2025
apache-spark
pyspark
apache-spark-sql
databricks
sql-execution-plan
Spark memory cache keeps increasing even with unpersist
Oct 30, 2025
apache-spark
caching
pyspark
memory-management
amazon-emr
How to deduplicate messages while streaming kafka using Spark Streaming?
Oct 31, 2025
apache-spark
duplicates
apache-kafka
spark-streaming
How to write streaming data to S3?
Oct 31, 2025
scala
amazon-web-services
apache-spark
amazon-s3
spark-streaming
How can I retrieve the alias for a DataFrame in Spark
Oct 31, 2025
apache-spark
apache-spark-sql
Logging in spark structured streaming
Oct 30, 2025
apache-spark
spark-structured-streaming
Join two RDDs on custom function - SPARK
Oct 30, 2025
python
join
apache-spark
pyspark
cluster-computing
Spark 2.3.1 AWS EMR not returning data for some columns yet works in Athena/Presto and Spectrum
Oct 31, 2025
apache-spark
amazon-emr
Is getNumPartitions an RDD action or transformation?
Oct 31, 2025
apache-spark
rdd
Why I get null results from date_format() PySpark function?
Oct 30, 2025
python
apache-spark
pyspark
Databricks - Failure starting repl. Try detaching and re-attaching the notebook
Oct 30, 2025
python
apache-spark
pyspark
databricks
cluster-computing
« Newer Entries
Older Entries »