Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
How to wait until all executors are allocated before Spark application starts on YARN?
May 07, 2022
apache-spark
hadoop-yarn
amazon-emr
Build Spark SQL query dynamically
Oct 14, 2022
scala
apache-spark
apache-spark-sql
Why does Spark on YARN in cluster mode fail with "Exception in thread "Driver" java.lang.NullPointerException"?
Jan 01, 2021
apache-spark
nullpointerexception
emr
PySpark: create dataframe from random uniform disribution
May 04, 2022
python
apache-spark
pyspark
How to force a certain partitioning in a PySpark DataFrame?
Oct 03, 2021
apache-spark
pyspark
partitioning
Coalesce columns in spark dataframe
Feb 20, 2020
scala
apache-spark
null
apache-spark-sql
user-defined-functions
Dataframe: how to groupBy/count then order by count in Scala
Nov 11, 2022
scala
apache-spark
Error using spark 'save' does not support bucketing right now
Apr 26, 2022
apache-spark
apache-spark-sql
partitioning
parquet
How to find installation directory of Apache Spark package in Homebrew?
Oct 20, 2022
macos
apache-spark
homebrew
Get index of item in array that is a column in a Spark dataframe
Nov 10, 2022
apache-spark
pyspark
Correct Parquet file size when storing in S3?
Oct 26, 2022
apache-spark
hdfs
parquet
Optimal file size and parquet block size
Feb 12, 2022
apache-spark
amazon-s3
parquet
Adding external jars in EMR Notebooks
Jun 09, 2022
scala
apache-spark
jupyter-notebook
amazon-emr
Spark/Hadoop throws exception for large LZO files
May 13, 2020
hadoop
apache-spark
elastic-map-reduce
lzo
simple mapping partitions job in (py)spark
Jan 15, 2022
python
ipython
apache-spark
Deploy mode in "SPARK-SUBMIT"
Oct 16, 2022
apache-spark
hadoop-yarn
Load Spark data locally Incomplete HDFS URI
Jun 21, 2022
scala
sbt
apache-spark
Requirements for converting Spark dataframe to Pandas/R dataframe
May 14, 2019
pandas
apache-spark
dataframe
hadoop
apache-spark-sql
creating spark data structure from multiline record
Oct 31, 2022
python
apache-spark
pyspark
How to use secondary user actions with to improve recommendations with Spark ALS?
Feb 09, 2018
apache-spark
apache-spark-mllib
mahout-recommender
« Newer Entries
Older Entries »