Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in spark-streaming

What is the difference between RUNNING and LOADING states for executors in web UI?

java.io.InvalidClassException: org.apache.spark.deploy.ApplicationDescription; local class incompatible

Spark streaming job doesn't delete shuffle files

Spark Streaming Kafka Stream batch execution

How to specify the group id of kafka consumer for spark structured streaming?

Does DStream's RDD pull entire data created for the batch interval at one shot?

wordCounts.dstream().saveAsTextFiles("LOCAL FILE SYSTEM PATH", "txt"); does not write to file

how to convert rdd to list effectively without using collect function

Class org.apache.hadoop.fs.s3a.auth.IAMInstanceCredentialsProvider not found when trying to write data on S3 bucket from Spark

Yarn Heap usage growing over time

What is the correct way to use the "topics" parameter in KafkaUtils.createstream()?