Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
spark streaming: read CSV string from kafka, write to parquet
Feb 02, 2026
python
csv
apache-spark
apache-kafka
spark-structured-streaming
Can I use Spark DataFrame inside regular Spark map operation?
Feb 01, 2026
apache-spark
pyspark
apache-spark-sql
How to execute hql files with multiple SQL queries per single file?
Feb 02, 2026
scala
hadoop
apache-spark
hive
apache-spark-sql
How spark works when a join is followed by a coalesce
Feb 02, 2026
apache-spark
apache-spark-sql
using pyspark how to reject bad (malformed) records from csv file and save these rejected records in a new file
Feb 02, 2026
apache-spark
pyspark
apache-spark-sql
Merge multiple JSON file to single JSON and parquet file
Feb 02, 2026
scala
apache-spark
pyspark
apache-spark-sql
Spark ML Naive Bayes predict multiple classes with probabilities
Jan 30, 2026
apache-spark
pyspark
apache-spark-mllib
Run spark-shell command in shell script
Feb 02, 2026
mysql
unix
apache-spark
What's the meaning of the "Stages" on Spark UI for Streaming Scenarios
Feb 02, 2026
apache-spark
spark-streaming
SPARK + Standalone Cluster: Cannot start worker from another machine
Feb 01, 2026
apache-spark
Hadoop configuration in sparkR
Feb 02, 2026
r
hadoop
amazon-s3
apache-spark
sparkr
Spark count & percentage for every column values Exception handling and loading to Hive DB
Jan 30, 2026
scala
apache-spark
hadoop
hive
apache-spark-sql
How to convert int64 datatype columns of parquet file to timestamp in SparkSQL data frame?
Feb 01, 2026
apache-spark
hive
pyspark
apache-spark-sql
Poor weak scaling of Apache Spark join operation
Feb 01, 2026
performance
scala
apache-spark
distributed-computing
« Newer Entries
Older Entries »