Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in parquet
Conversion of JSON to parquet format using Apache Parquet in C#
Sep 13, 2025
c#
parquet
Total allocation exceeds 95.00% (960,285,889 bytes) of heap memory- pyspark error
Sep 14, 2025
python
csv
pyspark
heap-memory
parquet
Parquet predicate pushdown filtering with Dask
Sep 13, 2025
dask
parquet
Use of compaction for Parquet bulk format
Sep 14, 2025
apache-flink
parquet
flink-streaming
Is there a way to traverse through a dask dataframe backwards?
Sep 12, 2025
python
pandas
dask
parquet
dask-dataframe
How to show user schema in a Parquet file using DuckDB?
Sep 14, 2025
parquet
database-metadata
duckdb
Importing parquet file in chunks and insert in DuckDB
Sep 10, 2025
python
pandas
parquet
pyarrow
duckdb
Reading single parquet-partition with single file results in DataFrame with more partitions
Sep 09, 2025
python
apache-spark
pyspark
parquet
How can I open a large parquet file with Keras?
Sep 10, 2025
tensorflow
keras
pyspark
parquet
Kafka - From JSON records to Parquet files in S3
Sep 08, 2025
json
apache-kafka
parquet
apache-kafka-connect
Combining 2 parquets that are too large for memory together
Sep 07, 2025
r
parquet
apache-arrow
Read schema information from a parquet format file stored in azure data lake gen2
Sep 07, 2025
parquet
azure-synapse
azure-data-lake-gen2
Pyarrow: TypeError: an integer is required (got type str)
Sep 06, 2025
python
pandas
parquet
Amazon AWS Athena HIVE_CANNOT_OPEN_SPLIT: Error opening Hive split / Not valid Parquet file, parquet files compress to gzip with Athena
Sep 07, 2025
amazon-web-services
gzip
parquet
amazon-athena
Spark: what options can be passed with DataFrame.saveAsTable or DataFrameWriter.options?
Sep 06, 2025
scala
hadoop
apache-spark
hive
parquet
Efficiency in using pandas and parquet
Sep 04, 2025
pandas
dask
parquet
pyarrow
ibis
Spark job with large text file in gzip format
Mar 14, 2023
hadoop
apache-spark
amazon-s3
apache-spark-sql
parquet
read a parquet files from HDFS using PyArrow
Mar 09, 2023
hdfs
parquet
pyarrow
JOOQ generator for Apache Spark parquet dataframes?
Sep 03, 2025
apache-spark
apache-spark-sql
jooq
parquet
Reading data from s3 subdirectories in PySpark
Sep 03, 2025
apache-spark
parquet
aws-glue
pyspark
Older Entries »