Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Reuse Spark session across multiple Spark jobs
Oct 18, 2025
apache-spark
pyspark
apache-spark-sql
PySpark: TypeError: StructType can not accept object 0.10000000000000001 in type <type 'numpy.float64'>
Oct 18, 2025
python
numpy
apache-spark
pyspark
apache-spark-sql
How to pass multiple Columns as features in a Logistic Regression Classifier in Spark? [duplicate]
Oct 17, 2025
python
apache-spark
machine-learning
pyspark
logistic-regression
Implicit schema for pandas_udf in PySpark?
Oct 17, 2025
python
apache-spark
pyspark
user-defined-functions
Spark: how to write dataframe to S3 efficiently
Oct 17, 2025
amazon-web-services
apache-spark
amazon-s3
pyspark
Creating data frame out of sequence using toDF method in Apache Spark
Oct 17, 2025
scala
apache-spark
apache-spark-sql
rdd
Does Spark Dynamic Allocation depend on external shuffle service to work well?
Oct 17, 2025
apache-spark
spark-shuffle
Convert a Spark Vector of features into an array
Oct 17, 2025
arrays
scala
apache-spark
vector
apache-spark-sql
pyspark : How to write dataframe partition by year/month/day/hour sub-directory?
Oct 17, 2025
apache-spark
pyspark
apache-spark-sql
How to allow pyspark to run code on emr cluster
Oct 17, 2025
apache-spark
pyspark
port
devops
amazon-emr
InvalidQueryException: Consistency level LOCAL_ONE is not supported for this operation. Supported consistency levels are: LOCAL_QUORUM
Oct 16, 2025
amazon-web-services
apache-spark
cassandra
datastax-java-driver
spark-cassandra-connector
Turning a continuous variable into categorical in Spark
Oct 17, 2025
scala
apache-spark
recode
How to get Kafka header's value to Spark Dataset as a single column?
Oct 17, 2025
scala
apache-spark
apache-kafka
spark-structured-streaming
When using Spark structured streaming , how to just get the aggregation result of current batch, like Spark Streaming?
Oct 17, 2025
apache-spark
spark-streaming
spark-structured-streaming
How to load a spark-nlp pre-trained model from disk
Oct 17, 2025
scala
apache-spark
nlp
apache-spark-mllib
johnsnowlabs-spark-nlp
Pyspark error with UDF: py4j.Py4JException: Method __getnewargs__([]) does not exist error
Oct 17, 2025
python
apache-spark
pyspark
databricks
SparkJob on GCP dataproc failing with error - java.lang.NoSuchMethodError: io.netty.buffer.PooledByteBufAllocator.<init>(ZIIIIIIZ)V
Oct 16, 2025
apache-spark
google-cloud-platform
google-cloud-dataproc
What happens if a Spark broadcast join is too large?
Oct 16, 2025
apache-spark
Pyspark 2.0 - IndextoString Error
Oct 16, 2025
apache-spark
pyspark
apache-spark-ml
How to row bind two Spark dataframes using sparklyr?
Oct 16, 2025
r
apache-spark
dplyr
sparklyr
« Newer Entries
Older Entries »