Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
How to set spark driver maxResultSize when in client mode in pyspark?
Oct 18, 2025
python
apache-spark
driver
pyspark
Pyspark - Split a column and take n elements
Oct 18, 2025
apache-spark
pyspark
apache-spark-sql
How to concatenate a string and a column in a dataframe in spark?
Oct 17, 2025
apache-spark
dataframe
apache-spark-sql
Does an RDD need to be cached if used more than once?
Oct 17, 2025
python
scala
hadoop
apache-spark
rdd
Call a function for each row of a dataframe in pyspark[non pandas]
Oct 17, 2025
apache-spark
apache-spark-sql
pyspark
Remove element from pyspark array based on element of another column
Oct 18, 2025
apache-spark
pyspark
apache-spark-sql
Error when importing udf from module -> SparkContext should only be created and accessed on the driver
Oct 16, 2025
python
apache-spark
pyspark
runtime-error
pyspark.ml: Type error when computing precision and recall
Oct 18, 2025
python
apache-spark
machine-learning
pyspark
apache-spark-ml
Is there a way to find out which port the Spark web UI is using?
Oct 17, 2025
apache-spark
pyspark
jupyter-notebook
Reading from one Hadoop cluster and writing to another Hadoop custer
Oct 18, 2025
apache-spark
hadoop
hdfs
Scala read Json file as Json
Oct 16, 2025
scala
apache-spark
What is the purpose of global temporary views?
Oct 18, 2025
apache-spark
apache-spark-sql
Reuse Spark session across multiple Spark jobs
Oct 18, 2025
apache-spark
pyspark
apache-spark-sql
PySpark: TypeError: StructType can not accept object 0.10000000000000001 in type <type 'numpy.float64'>
Oct 18, 2025
python
numpy
apache-spark
pyspark
apache-spark-sql
How to pass multiple Columns as features in a Logistic Regression Classifier in Spark? [duplicate]
Oct 17, 2025
python
apache-spark
machine-learning
pyspark
logistic-regression
Implicit schema for pandas_udf in PySpark?
Oct 17, 2025
python
apache-spark
pyspark
user-defined-functions
Spark: how to write dataframe to S3 efficiently
Oct 17, 2025
amazon-web-services
apache-spark
amazon-s3
pyspark
Creating data frame out of sequence using toDF method in Apache Spark
Oct 17, 2025
scala
apache-spark
apache-spark-sql
rdd
Does Spark Dynamic Allocation depend on external shuffle service to work well?
Oct 17, 2025
apache-spark
spark-shuffle
Older Entries »