Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
How to get columns from an org.apache.spark.sql row by name?
Oct 26, 2025
scala
apache-spark
apache-spark-sql
spark-streaming
How should I load file on s3 using Spark?
Oct 25, 2025
python
apache-spark
amazon-s3
pyspark
Combining csv files with mismatched columns
Oct 25, 2025
csv
apache-spark
pyspark
apache-spark-sql
data-analysis
Suppress messages from spark-submit when loading packages
Oct 25, 2025
apache-spark
ivy
spark-submit
How to create table with nested map on databricks using sql
Oct 24, 2025
sql
arrays
apache-spark
apache-spark-sql
databricks
Transposing a Spark DataFrame from row to column in PySpark and appending it with another DataFrame
Oct 23, 2025
python
dataframe
apache-spark
pyspark
transpose
Convert date to ISO week date in Spark
Oct 23, 2025
apache-spark
date
pyspark
apache-spark-sql
spark3
How can I append to same file in HDFS(spark 2.11)
Oct 23, 2025
apache-spark
apache-spark-sql
spark-streaming
How to merge two rows in Spark SQL?
Oct 25, 2025
scala
apache-spark
apache-spark-sql
Writing Spark dataframe in ORC format with Snappy compression
Oct 24, 2025
amazon-s3
apache-spark
dataframe
orc
How to convert RDD list of lists into one list in pyspark
Oct 24, 2025
list
apache-spark
pyspark
Can't use "update" in outputMode() when writing stream data in spark
Oct 23, 2025
apache-spark
pyspark
databricks
delta-lake
Why does Spark Query Plan shows more partitions whenever cache (persist) is used
Oct 23, 2025
apache-spark
pyspark
Split a column in multiple columns using Spark SQL
Oct 24, 2025
sql
apache-spark
apache-spark-sql
Google Dataproc Pyspark - BigQuery connector is super slow
Oct 24, 2025
apache-spark
pyspark
google-bigquery
google-cloud-dataproc
Databricks notebook time out error when calling other notebooks: com.databricks.WorkflowException: java.net.SocketTimeoutException: Read timed out
Oct 24, 2025
apache-spark
apache-spark-sql
databricks
socket-timeout-exception
How to check Spark configuration from command line?
Oct 23, 2025
linux
scala
hadoop
apache-spark
Parallelizing a for loop with map and reduce in spark with pyspark
Oct 23, 2025
python
apache-spark
pyspark
run spark locally with intellij
Oct 23, 2025
scala
apache-spark
How to prevent processing files twice with Spark DataFrames
Oct 24, 2025
apache-spark
amazon-s3
apache-spark-sql
aws-glue
« Newer Entries
Older Entries »