Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in parquet
Importing parquet file in chunks and insert in DuckDB
Sep 10, 2025
python
pandas
parquet
pyarrow
duckdb
Reading single parquet-partition with single file results in DataFrame with more partitions
Sep 09, 2025
python
apache-spark
pyspark
parquet
How can I open a large parquet file with Keras?
Sep 10, 2025
tensorflow
keras
pyspark
parquet
Kafka - From JSON records to Parquet files in S3
Sep 08, 2025
json
apache-kafka
parquet
apache-kafka-connect
Combining 2 parquets that are too large for memory together
Sep 07, 2025
r
parquet
apache-arrow
Read schema information from a parquet format file stored in azure data lake gen2
Sep 07, 2025
parquet
azure-synapse
azure-data-lake-gen2
Pyarrow: TypeError: an integer is required (got type str)
Sep 06, 2025
python
pandas
parquet
Amazon AWS Athena HIVE_CANNOT_OPEN_SPLIT: Error opening Hive split / Not valid Parquet file, parquet files compress to gzip with Athena
Sep 07, 2025
amazon-web-services
gzip
parquet
amazon-athena
Spark: what options can be passed with DataFrame.saveAsTable or DataFrameWriter.options?
Sep 06, 2025
scala
hadoop
apache-spark
hive
parquet
Efficiency in using pandas and parquet
Sep 04, 2025
pandas
dask
parquet
pyarrow
ibis
Spark job with large text file in gzip format
Mar 14, 2023
hadoop
apache-spark
amazon-s3
apache-spark-sql
parquet
read a parquet files from HDFS using PyArrow
Mar 09, 2023
hdfs
parquet
pyarrow
Creating Hive table on top of multiple parquet files in s3
Mar 09, 2023
hadoop
apache-spark
hive
amazon-emr
parquet
How to save spark dataframe to parquet without using INT96 format for timestamp columns?
Mar 04, 2023
apache-spark
avro
parquet
Refresh metadata for Dataframe while reading parquet file
Mar 05, 2023
apache-spark
apache-spark-sql
parquet
apache-spark-dataset
UPSERT in parquet Pyspark
Mar 05, 2023
amazon-s3
pyspark
etl
parquet
Spark2 Can't write dataframe to parquet hive table : HiveFileFormat`. It doesn't match the specified format `ParquetFileFormat`
Feb 06, 2023
apache-spark
hive
parquet
apache-spark-2.0
Read parquet data from ByteArrayOutputStream instead of file
Feb 05, 2023
java
parquet
bytearrayoutputstream
JOOQ generator for Apache Spark parquet dataframes?
Sep 03, 2025
apache-spark
apache-spark-sql
jooq
parquet
Reading data from s3 subdirectories in PySpark
Sep 03, 2025
apache-spark
parquet
aws-glue
pyspark
Older Entries »