Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Create a map column in Apache Spark from other columns
Oct 22, 2025
scala
apache-spark
apache-spark-sql
Spark Dataset cache is using only one executor
Oct 23, 2025
apache-spark
hadoop-yarn
parquet
replace for loop to parallel process in pyspark
Oct 23, 2025
python
apache-spark
pyspark
apache-spark-sql
How does toLocalIterator works?
Oct 23, 2025
apache-spark
hadoop
pyspark
hadoop2
Pyspark JSON string parsing - Error: ValueError: 'json' is not in list - no Pandas
Oct 23, 2025
json
apache-spark
pyspark
Load data with where clause in spark dataframe
Oct 22, 2025
scala
apache-spark
How to specify sql dialect when creating spark dataframe from JDBC?
Oct 23, 2025
apache-spark
jdbc
apache-spark-sql
apache-spark-2.0
vitess
Maximum number of concurrent tasks in 1 DPU in AWS Glue
Oct 23, 2025
amazon-web-services
apache-spark
apache-spark-sql
aws-glue
When will Spark clean the cached RDDs automatically?
Oct 23, 2025
apache-spark
caching
apache-spark-sql
rdd
Spark: Distribute low number of compute-intensive tasks via UDF
Oct 23, 2025
python
apache-spark
pyspark
databricks
azure-databricks
Dynamically infer Schema of returned object from UDF in pySpark
Oct 21, 2025
python
apache-spark
pyspark
apache-spark-sql
In build.sbt, dependencies in parent project not reflected in child modules
Oct 23, 2025
scala
apache-spark
module
sbt
Stop hadoop/EMR/AWS creating S3 paths with _$folder$ extensions
Oct 22, 2025
hadoop
amazon-web-services
amazon-s3
apache-spark
emr
How to write a Spark dataframe into Kinesis Stream?
Oct 22, 2025
scala
apache-spark
apache-kafka
kafka-consumer-api
amazon-kinesis
Is there a command to convert existing parquet data to Iceberg table in place?
Oct 22, 2025
apache-spark
delta-lake
apache-iceberg
Writing Parquet in Azure Blob Storage: "One of the request inputs is not valid"
Oct 22, 2025
scala
apache-spark
hadoop
azure-blob-storage
parquet
"The associated location already exists" when saving a Spark DataFrame with mode('overwrite') set
Oct 23, 2025
apache-spark
apache-spark-sql
Read fixed width file using schema from json file in pyspark
Oct 21, 2025
python
apache-spark
pyspark
apache-spark-sql
« Newer Entries
Older Entries »