Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Apache Spark UDF that returns dynamic data types
Oct 25, 2022
scala
apache-spark
apache-spark-sql
user-defined-functions
How to save bucketed DataFrame?
Jun 13, 2022
apache-spark
apache-spark-sql
how to list spark-packages added to the spark context?
Jul 04, 2022
apache-spark
sparkr
UDF to map words to term Index in Spark
Mar 14, 2022
apache-spark
pyspark
apache-spark-sql
user-defined-functions
apache-spark-ml
how does YARN "Fair Scheduler" work with spark-submit configuration parameter
Aug 21, 2022
hadoop
apache-spark
hadoop-yarn
how to change column value in spark sql
Sep 05, 2022
sql
apache-spark
pyspark
apache-spark-sql
How to write streaming dataset to Kafka?
Mar 08, 2022
apache-spark
apache-kafka
spark-structured-streaming
Kafka with Spark 2.1 Structured Streaming - cannot deserialize
Oct 24, 2022
apache-spark
pyspark
deserialization
apache-spark-sql
spark-streaming
I am getting an error while creating a simple RDD in Spark
Jan 31, 2022
python
apache-spark
rdd
Spark Pipeline error
Jun 18, 2021
python
apache-spark
pyspark
pyspark-sql
spring autoconfiguration class is missing in META-INF/spring.factories
Feb 18, 2022
java
spring
maven
apache-spark
NoClassDefFoundError: Could not initialize XXX class after deploying on spark standalone cluster
Oct 20, 2022
scala
apache-spark
deployment
spark-streaming
spark-submit
How to cache partitioned dataset and use in multiple queries?
Jun 20, 2022
java
apache-spark
apache-spark-sql
Pyspark udf high memory utilization
Nov 04, 2022
apache-spark
pyspark
Enum equivalent in Spark Dataframe/Parquet
May 12, 2022
apache-spark
parquet
Cumulative distinct count with Spark SQL
Nov 08, 2022
sql
apache-spark
apache-spark-sql
pyspark.sql.utils.IllegalArgumentException: "Error while instantiating 'org.apache.spark.sql.hive.HiveSessionStateBuild in windows 10
Sep 06, 2022
apache-spark
pyspark
How handle categorical features in the latest Random Forest in Spark?
Sep 03, 2022
apache-spark
apache-spark-mllib
random-forest
apache-spark-ml
feature-engineering
Why is difference between sqlContext.read.load and sqlContext.read.text?
Sep 15, 2022
apache-spark
pyspark
apache-spark-sql
spark-csv
Which would be a quicker (and better) tool for querying data stored in the Parquet format - Spark SQL, Athena or ElasticSearch?
Aug 21, 2022
performance
apache-spark
elasticsearch
etl
amazon-athena
« Newer Entries
Older Entries »