Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in spark-streaming

How to save latest offset that Spark consumed to ZK or Kafka and can read back after restart

Spark Streaming get warn "replicated to only 0 peer(s) instead of 1 peers"

In Apache Spark, how to convert a slow RDD/dataset into a stream?

Spark Streaming: foreachRDD update my mongo RDD

Spark Streaming mapWithState seems to rebuild complete state periodically

How to fix "java.io.NotSerializableException: org.apache.kafka.clients.consumer.ConsumerRecord" in Spark Streaming Kafka Consumer?

What is the correct way to start/stop spark streaming jobs in yarn?

spark ssc.textFileStream is not streamining any files from directory

How to specify which java version to use in spark-submit command?

SBT Test Error: java.lang.NoSuchMethodError: net.jpountz.lz4.LZ4BlockInputStream

Amazon s3a returns 400 Bad Request with Spark

Spark off heap memory leak on Yarn with Kafka direct stream

How to create Spark RDD from an iterator?

Kafka topic partitions to Spark streaming

java.lang.NoClassDefFoundError: org/apache/spark/streaming/twitter/TwitterUtils$ while running TwitterPopularTags

Spark Dataframe validating column names for parquet writes

Spark streaming with Kafka - createDirectStream vs createStream

How to stop spark streaming when the data source has run out

AbstractMethodError creating Kafka stream