Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Multiple SparkSessions in single JVM
Sep 24, 2022
apache-spark
Spark dataframe filter
Sep 27, 2022
scala
apache-spark
apache-spark-sql
Spark Dataframe groupBy and sort results into a list
Nov 03, 2022
apache-spark
dataframe
apache-spark-sql
Concatenating string by rows in pyspark
Sep 15, 2022
python
apache-spark
pyspark
How to do opposite of explode in PySpark?
Oct 23, 2022
apache-spark
pyspark
apache-spark-sql
Spark2.2.1 incompatible Jackson version 2.8.8
Mar 19, 2022
java
eclipse
scala
maven
apache-spark
Passing command line arguments to Spark-shell
Sep 13, 2022
apache-spark
How to drop multiple column names given in a list from Spark DataFrame?
Sep 13, 2022
apache-spark
dataframe
pyspark
apache-spark-sql
pyspark-sql
Failed to start master for Spark in Windows
Sep 16, 2022
apache-spark
windows-10
How to exit spark-submit after the submission
Mar 25, 2022
apache-spark
hadoop-yarn
Spark Random Forests: Different results with same seed
Oct 25, 2022
scala
apache-spark
machine-learning
random-forest
Does Spark support Partition Pruning with Parquet Files
Sep 13, 2022
apache-spark
amazon-s3
hive
parquet
Spark Kafka Direct DStream - How many executors and RDD partitions in yarn-cluster mode if num-executors is set?
Sep 07, 2022
apache-spark
apache-kafka
spark-streaming
Spark: efficiency of dataframe checkpoint vs. explicitly writing to disk
Aug 30, 2022
scala
apache-spark
apache-spark-sql
Why does Spark's OneHotEncoder drop the last category by default?
Aug 29, 2022
apache-spark
machine-learning
pyspark
one-hot-encoding
bigdata
Does collect_list() maintain relative ordering of rows?
Nov 17, 2022
scala
apache-spark
apache-spark-sql
org.apache.spark.SparkException: Job aborted due to stage failure: Task from application
Sep 26, 2022
apache-spark
"sparkContext was shut down" while running spark on a large dataset
Sep 28, 2022
scala
apache-spark
hadoop-yarn
apache-spark-sql
Total size of serialized results of tasks is bigger than spark.driver.maxResultSize
Sep 14, 2022
apache-spark
pyspark
Spark 2.0 deprecates 'DirectParquetOutputCommitter', how to live without it?
Jan 23, 2022
hadoop
apache-spark
amazon-s3
amazon-emr
parquet
« Newer Entries
Older Entries »