Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Pyspark: Reading JSON data file with no separator between objects
Mar 07, 2023
json
apache-spark
pyspark
databricks
amazon-kinesis-firehose
PySpark DataFrame: Change cell value based on min/max condition in another column
Mar 07, 2023
python
apache-spark
dataframe
pyspark
apache-spark-sql
How to use array_contains with 2 columns in spark scala?
Mar 09, 2023
scala
apache-spark
dataframe
Spark structured streaming query always starts with auto.offset.rest=earliest even though auto.offset.reset=latest is set
Mar 08, 2023
scala
apache-spark
kafka-consumer-api
spark-structured-streaming
Creating Hive table on top of multiple parquet files in s3
Mar 09, 2023
hadoop
apache-spark
hive
amazon-emr
parquet
PySpark - Split all dataframe column strings to array
Mar 09, 2023
apache-spark
pyspark
PySpark: Invalid returnType with scalar Pandas UDFs
Mar 08, 2023
apache-spark
pyspark
apache-arrow
Spark parse string to timestamp with timezone
Mar 08, 2023
apache-spark
apache-spark-sql
timestamp
timezone
timezone-offset
Upsert to CosmosDB from Spark error
Mar 09, 2023
scala
apache-spark
pyspark
apache-spark-sql
azure-cosmosdb
Exception in thread "main" org.apache.spark.SparkException: Must specify the driver container image
Mar 09, 2023
apache-spark
docker
kubernetes
kubectl
minikube
How to create an Encoder for Scala collection (to implement custom Aggregator)?
Mar 09, 2023
scala
apache-spark
apache-spark-sql
apache-spark-encoders
Splittling list of JSON key/value pairs into columns of a row in a Dataset
Mar 08, 2023
scala
apache-spark
apache-spark-sql
Inconsistent results with KMeans between Apache Spark and scikit_learn
Mar 08, 2023
python
apache-spark
scikit-learn
pyspark
k-means
Spark - pass full row to a udf and then get column name inside udf
Mar 07, 2023
scala
apache-spark
How can I control the number of output files written from Spark DataFrame?
Mar 07, 2023
scala
apache-spark
apache-kafka
apache-spark-sql
spark-streaming
Spark: Create temporary table by executing sql query on temporary tables
Mar 08, 2023
scala
apache-spark
jenkins
jdbc
spark dataframe: explode list column
Mar 08, 2023
apache-spark
apache-spark-sql
PySpark - Show a count of column data types in a dataframe
Mar 08, 2023
python
apache-spark
pyspark
Iterate over elements of columns Scala
Mar 08, 2023
scala
apache-spark
apache-spark-sql
Spark Scala Jaas configuration
Mar 07, 2023
scala
apache-spark
apache-kafka
jaas
« Newer Entries
Older Entries »