Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Does collect_list() maintain relative ordering of rows?
Nov 17, 2022
scala
apache-spark
apache-spark-sql
org.apache.spark.SparkException: Job aborted due to stage failure: Task from application
Sep 26, 2022
apache-spark
"sparkContext was shut down" while running spark on a large dataset
Sep 28, 2022
scala
apache-spark
hadoop-yarn
apache-spark-sql
Total size of serialized results of tasks is bigger than spark.driver.maxResultSize
Sep 14, 2022
apache-spark
pyspark
Spark 2.0 deprecates 'DirectParquetOutputCommitter', how to live without it?
Jan 23, 2022
hadoop
apache-spark
amazon-s3
amazon-emr
parquet
What is the best way to remove accents with Apache Spark dataframes in PySpark?
Sep 12, 2022
python
apache-spark
pyspark
apache-spark-sql
unicode-normalization
Hash function in spark
Sep 08, 2022
scala
apache-spark
hash
apache-spark-sql
Spark - Which instance type is preferred for AWS EMR cluster? [closed]
Sep 12, 2022
amazon-ec2
apache-spark
emr
Spark losing println() on stdout
May 07, 2020
scala
apache-spark
println
accumulator
How to stop a running SparkContext before opening the new one
Sep 08, 2022
scala
apache-spark
How to merge multiple feature vectors in DataFrame?
Sep 12, 2022
apache-spark
machine-learning
apache-spark-sql
apache-spark-ml
Spark train test split
Oct 18, 2022
apache-spark
apache-spark-mllib
train-test-split
Stopping a Running Spark Application
Feb 28, 2022
apache-spark
Where are the Spark logs on EMR?
Jun 30, 2016
scala
apache-spark
emr
ImportError: No module named numpy on spark workers
Sep 15, 2022
python
numpy
apache-spark
pyspark
PySpark converting a column of type 'map' to multiple columns in a dataframe
Sep 12, 2022
python
apache-spark
dataframe
pyspark
apache-spark-sql
Accessing Spark SQL RDD tables through the Thrift Server
Oct 27, 2019
apache-spark
apache-spark-sql
Spark save(write) parquet only one file
Aug 25, 2022
scala
apache-spark
parquet
Using Grouped Map Pandas UDFs with arguments
Sep 26, 2022
python
apache-spark
pyspark
pandas-groupby
How to use custom classes with Apache Spark (pyspark)?
Sep 12, 2022
python
apache-spark
python-module
pyspark
« Newer Entries
Older Entries »