Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
How can I retrieve the alias for a DataFrame in Spark
Oct 31, 2025
apache-spark
apache-spark-sql
Logging in spark structured streaming
Oct 30, 2025
apache-spark
spark-structured-streaming
Join two RDDs on custom function - SPARK
Oct 30, 2025
python
join
apache-spark
pyspark
cluster-computing
Spark 2.3.1 AWS EMR not returning data for some columns yet works in Athena/Presto and Spectrum
Oct 31, 2025
apache-spark
amazon-emr
Is getNumPartitions an RDD action or transformation?
Oct 31, 2025
apache-spark
rdd
Why I get null results from date_format() PySpark function?
Oct 30, 2025
python
apache-spark
pyspark
Databricks - Failure starting repl. Try detaching and re-attaching the notebook
Oct 30, 2025
python
apache-spark
pyspark
databricks
cluster-computing
Broadcast join in spark not working for left outer
Oct 31, 2025
apache-spark
pyspark
apache-spark-sql
amazon-emr
How do I get data on spark jobs and stages from python [duplicate]
Oct 29, 2025
python-3.x
apache-spark
pyspark
Spark Kubernetes - FileNotFoundException when copying config files from driver to executors using --files or spark.files
Oct 30, 2025
java
scala
docker
apache-spark
kubernetes
Spark multiple dynamic aggregate functions, countDistinct not working
Oct 30, 2025
scala
apache-spark
count
apache-spark-sql
distinct
Apache Spark: saveAsTextFile not working correctly in Stand Alone Mode
Oct 30, 2025
apache-spark
TIMESTAMP not behaving as intended with parquet in hive
Oct 29, 2025
apache-spark
hadoop
hive
DESCRIBE TABLE see which columns are NOT NULL
Oct 31, 2025
apache-spark
apache-spark-sql
databricks
azure-databricks
Are built-in Spark transformations faster than Spark SQL queries?
Oct 30, 2025
apache-spark
pyspark
apache-spark-sql
aws-glue
Nested Json extract the value with unknown key in the middle
Oct 30, 2025
json
scala
apache-spark
apache-spark-sql
scala-collections
Sparklyr/Dplyr - How to apply a user defined function for each row of a sparkdata frame and create write the output of each row to new column?
Oct 30, 2025
r
apache-spark
dplyr
apache-spark-sql
sparklyr
How do I connect to a Kerberos-secured Kafka cluster with Spark Structured Streaming?
Oct 30, 2025
scala
apache-spark
apache-kafka
kerberos
How to select an exact number of random rows from DataFrame
Oct 30, 2025
apache-spark
random
apache-spark-sql
Pandas-on-spark throwing java.lang.StackOverFlowError
Oct 30, 2025
python
pandas
apache-spark
pyspark
pyspark-pandas
« Newer Entries
Older Entries »