Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Spark Is there any rule of thumb about the optimal number of partition of a RDD and its number of elements?
Oct 01, 2022
apache-spark
apache-spark-sql
partitioning
Spark sql top n per group
Apr 22, 2022
apache-spark
group-by
apache-spark-sql
top-n
org.apache.thrift.transport.TTransportException error while Reading large JSON file in zeppelin scala
Aug 18, 2021
json
scala
apache-spark
apache-zeppelin
How to split column of vectors into two columns?
Mar 25, 2022
apache-spark
pyspark
apache-spark-ml
Running steps of EMR in parallel
Oct 15, 2022
web-services
amazon-web-services
apache-spark
amazon-emr
How Spark handle data larger than cluster memory
Mar 08, 2022
apache-spark
Dropping nested column of Dataframe with PySpark
Jul 13, 2022
apache-spark
dataframe
pyspark
struct
schema
Best practice to create SparkSession object in Scala to use both in unittest and spark-submit
Aug 31, 2022
scala
apache-spark
spark-submit
Add months to date column in Spark dataframe
Nov 06, 2022
python
apache-spark
pyspark
apache-spark-sql
What does "pre-built for Apache Hadoop 2.7 and later" mean?
Oct 29, 2022
apache-spark
How can I obtain the DAG of an Apache Spark job without running it?
Apr 14, 2022
scala
apache-spark
Why is no map function for dataframe in pyspark while the spark equivalent has it?
Nov 06, 2022
apache-spark
pyspark
How to set spark.driver.memory for Spark/Zeppelin on EMR
Apr 20, 2019
apache-spark
emr
amazon-emr
apache-zeppelin
Is there a way to validate the syntax of raw spark sql query?
May 21, 2022
scala
apache-spark
java.lang.UnsupportedOperationExceptionfieldIndex on a Row without schema is undefined: Exception on row.getAs[String]
Sep 05, 2022
scala
apache-spark
How to select multiple columns of dataset, given a list of column names?
May 08, 2022
java
apache-spark
apache-spark-sql
Spark decimal type precision loss
Jun 16, 2022
scala
apache-spark
apache-spark-sql
Comparison of a `float` to `np.nan` in Spark Dataframe
Sep 07, 2022
python
numpy
apache-spark
pyspark
nan
How do I get a spark dataframe to print it's explain plan to a string
Nov 17, 2022
scala
apache-spark
dataframe
How to find the max String length of a column in Spark using dataframe?
Sep 15, 2022
scala
apache-spark
apache-spark-sql
« Newer Entries
Older Entries »