Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Combine results from batch RDD with streaming RDD in Apache Spark
Nov 09, 2022
cassandra
apache-spark
apache-kafka
spark-streaming
real time log processing using apache spark streaming
Mar 31, 2022
apache-spark
apache-kafka
flume
spark-streaming
Spark streaming DStream RDD to get file name
Nov 03, 2022
scala
apache-spark
Create Spark DataFrame in Spark Streaming from JSON Message on Kafka
Nov 09, 2017
scala
apache-spark
dataframe
apache-kafka
Spark forcing log4j
Jul 27, 2021
java
scala
hadoop
apache-spark
logback
Accessing HDFS HA from spark job (UnknownHostException error)
Nov 12, 2022
scala
apache-spark
hdfs
mesos
mesosphere
Spark worker memory
Oct 24, 2022
apache-spark
Why is a Spark Row object so big compared to equivalent structures?
Mar 27, 2019
apache-spark
Understanding Spark shuffle spill
Oct 25, 2022
apache-spark
How to transform RDD, Dataframe or Dataset straight to a Broadcast variable without collect?
Oct 16, 2022
scala
apache-spark
dataframe
apache-spark-sql
More efficient way to loop through PySpark DataFrame and create new columns
Nov 15, 2022
python
apache-spark
pyspark
Dag-scheduler-event-loop java.lang.OutOfMemoryError: unable to create new native thread
Dec 24, 2021
java
apache-spark
Passing a map with struct-type key into a Spark UDF
Oct 25, 2022
scala
apache-spark
Handling microseconds in Spark Scala
Oct 31, 2022
java
scala
datetime
apache-spark
apache-spark-sql
How to change user in hdfs using sparkSubmit in java
Aug 30, 2022
java
hadoop
apache-spark
Spark how to use a UDF with a Join
Nov 17, 2022
apache-spark
join
dataframe
user-defined-functions
How to validate Spark SQL expression without executing it?
Nov 06, 2022
apache-spark
apache-spark-sql
how to process data in chunks/batches with kafka streams?
Nov 16, 2022
java
scala
apache-spark
apache-kafka
apache-kafka-streams
Spark: UDF executed many times
Aug 17, 2022
scala
apache-spark
apache-spark-sql
Problems when writing parquet with timestamps prior to 1900 in AWS Glue 3.0
Jun 28, 2022
amazon-web-services
apache-spark
pyspark
aws-glue
« Newer Entries
Older Entries »