Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Partitions not being pruned in simple SparkSQL queries
Sep 13, 2022
amazon-s3
apache-spark
apache-spark-sql
pyspark
parquet
Multiple windows of different durations in Spark Streaming application
May 24, 2018
apache-spark
real-time
analytics
apache-kafka
spark-streaming
Failed to load class for data source: com.databricks.spark.csv
Apr 08, 2021
apache-spark
Spark JoinWithCassandraTable on TimeStamp partition key STUCK
Aug 31, 2022
mysql
scala
cassandra
apache-spark
datastax-enterprise
Using TestHiveContext/HiveContext in unit tests
Jun 29, 2021
apache-spark
hive
apache-spark-sql
hivecontext
Locally change the log level for the zookeeper C client
Aug 17, 2022
logging
apache-spark
apache-zookeeper
mesos
Spark mapWithState shuffles all data to one node
Nov 06, 2022
scala
apache-spark
spark-streaming
How to give predicted and label columns in BinaryClassificationMetrics evaluation for Naive Bayes model
Dec 20, 2019
scala
apache-spark
machine-learning
apache-spark-mllib
apache-spark-ml
Not able to fetch result from hive transaction enabled table through spark-sql
Oct 20, 2022
hadoop
apache-spark
hive
apache-spark-sql
How to write dataframe (obtained from hive table) into hadoop SequenceFile and RCFile?
Oct 16, 2022
apache-spark
apache-spark-sql
spark-dataframe
How to convert RDD to DataFrame in Spark Streaming, not just Spark
Oct 18, 2022
scala
apache-spark
spark-streaming
rdd
Apache Toree and Spark Scala Not Working in Jupyter
Nov 16, 2021
scala
apache-spark
jupyter-notebook
apache-toree
Spark never finishes jobs and stages, JobProgressListener crash
Aug 07, 2021
apache-spark
The root scratch dir: /tmp/hive on HDFS should be writable. Current permissions are: rwx--------- (on Linux)
Jan 18, 2020
apache-spark
hive
apache-spark-sql
spark-dataframe
hiveql
How to implement a ScalaTest FunSuite to avoid boilerplate Spark code and import implicits
Jun 05, 2022
scala
apache-spark
scalatest
Accessing Spark Mllib Bisecting K-means tree data
Apr 15, 2019
apache-spark
apache-spark-mllib
Am I fully utilizing my EMR cluster?
Mar 08, 2022
amazon-web-services
apache-spark
pyspark
elastic-map-reduce
How to log malformed rows from Scala Spark DataFrameReader csv
Feb 05, 2020
scala
csv
logging
apache-spark
How to transform Dataset<Tuple2<String,DeviceData>> to Iterator<DeviceData>
Feb 16, 2021
java
apache-spark
apache-spark-2.0
apache-spark-dataset
Naive install of PySpark to also support S3 access
Oct 24, 2022
python
amazon-web-services
apache-spark
amazon-s3
pyspark
« Newer Entries
Older Entries »