Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark-sql

What is the preferred way to avoid SQL injections in Spark-SQL (on Hive)

Integrating Spark SQL and Apache Drill through JDBC

How to load Tuple from Cassandra table?

Are the join types defined as constants somewhere accessible in Apache Spark?

Start kubernetes pod memory depending on size of data job

spark.table fails with java.io.Exception: No FileSystem for Scheme: abfs

Partitions not being pruned in simple SparkSQL queries

Using TestHiveContext/HiveContext in unit tests

Not able to fetch result from hive transaction enabled table through spark-sql

How to write dataframe (obtained from hive table) into hadoop SequenceFile and RCFile?

The root scratch dir: /tmp/hive on HDFS should be writable. Current permissions are: rwx--------- (on Linux)

Using .where() on pyspark.sql.functions.max().over(window) on Spark 2.4 throws Java exception

one-hot encode of multiple string categorical features using Spark DataFrames

Aggregate while dropping duplicates in pyspark

How to extract complex JSON structures using Apache Spark 1.4.0 Data Frames

Apache Spark: In SparkSql, are sql's vulnerable to Sql Injection [duplicate]

rank() function usage in Spark SQL

How to convert the group by function to data frame

How can you update values in a dataset?

How to add sparse vectors after group by, using Spark SQL?