Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Parsing Parquet files with Perl

What would be the correct approach to parsing Parquet files with Perl?

CPAN seems devoid of any module to do this, aside from some logic inside S3 Amazon Web Services modules that don't offer any parsing code I can find.

like image 326
DVK Avatar asked Jan 17 '26 23:01

DVK


1 Answers

There's not a single module that would do this, but there is a path forward. First, note that the Parquet format has a Thrift specification, which is parseable by Thrift::IDL. That, coupled with either Convert::Binary::C or the link posted by@HåkonHægland should get you started.

like image 139
Diab Jerius Avatar answered Jan 19 '26 20:01

Diab Jerius