Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Spark Structured Streaming with Kafka SASL/PLAIN authentication
Sep 06, 2022
apache-spark
apache-kafka
spark-structured-streaming
Job 65 cancelled because SparkContext was shut down
Dec 05, 2021
apache-spark
hadoop
pyspark
apache-spark-sql
apache-zeppelin
PySpark - pass a value from another column as the parameter of spark function
Oct 29, 2022
apache-spark
pyspark
apache-spark-sql
NoClassDefFoundError: org/apache/spark/sql/internal/connector/SimpleTableProvider when running in Dataproc
May 22, 2022
apache-spark
sbt
google-cloud-dataproc
PySpark data skewness with Window Functions
Sep 25, 2022
apache-spark
pyspark
In spark, what does the parameter "minPartitions" works in SparkContext.textFile(path, minPartitions)?
Jun 20, 2022
apache-spark
How to query when connecting mongodb with apache-spark
Sep 24, 2022
mongodb
hadoop
apache-spark
Hadoop DistributedCache functionality in Spark
Aug 30, 2022
hadoop
apache-spark
distribute
distributed-cache
Merge more than 32 files in Google Cloud Storage
Jan 06, 2020
google-cloud-storage
apache-spark
google-compute-engine
reduceByKey using Scala object as key
Dec 03, 2019
scala
apache-spark
reduce
launching a spark program using oozie workflow
Nov 18, 2022
scala
apache-spark
workflow
oozie
custom join with non equal keys
Nov 10, 2021
join
apache-spark
Ordering an RDD[String]
Aug 29, 2022
scala
apache-spark
Apache Spark app workflow
Jun 24, 2022
apache-spark
workflow
How to create collection of RDDs out of RDD?
Nov 13, 2022
scala
apache-spark
How do I install Python libraries automatically on Dataproc cluster startup?
May 12, 2022
hadoop
apache-spark
google-cloud-platform
google-cloud-dataproc
Spark Streaming on EC2: Exception in thread "main" java.lang.ExceptionInInitializerError
Feb 10, 2022
scala
maven
amazon-ec2
apache-spark
spark-streaming
Spark difference between maven Artifacts spark-core_2.10 and spark-core_2.11
Dec 11, 2020
maven
apache-spark
Apache Spark: Driver (instead of just the Executors) tries to connect to Cassandra
Oct 26, 2022
scala
apache-spark
cassandra
Efficient grouping by key using mapPartitions or partitioner in Spark
Nov 13, 2022
apache-spark
grouping
partition
« Newer Entries
Older Entries »