I would like to know what standalone options exist for parsing and converting normal webpages to the easy to read format, provided by services like Pocket, Readability, and Instapaper.
I am primarily looking for a Java parser library to include in my app, but in general, all kinds of standalone tools are appreciated.
Thanks.
Snackatory now itself states that it's outdated and recommends using Crux.
But there's also Readability4J. It's a Kotlin port of Mozilla's Readability.js, which is used for Firefox's Reader View. So Readability4J's output matches exactly Firefox's Reader View.
Disclaimer: I am the author of Readability4J. But I just ported it, kudos for the great work go to Mozilla: https://github.com/mozilla/readability.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With