Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Solution for server side web scraping/navigation (with JavaScript support)

I need to do server side web scraping/navigation, including sites with JavaScript, and I need a solution that would work on a hosting plan - I dont have my own server. I came across python/pyside/pyqt4 - this would work perfectly/allow me to navigate sites like a headless browser. However I don't know if this would be possible to install on a remote server/host...

like image 712
James Avatar asked Jun 08 '26 22:06

James


1 Answers

If you need a headless browser, you should check out PhantomJS, and in particular PyPhantomJS, the Python implementation. These might work in a shared hosting context - it really depends on the host. See the build instructions for different platforms - you'd likely need to ask your hosting provider to install.

If you can get this running, you might be interested in checking out pjscrape (disclaimer: this is my project). It's a command-line tool using PhantomJS to allow scraping using JavaScript and jQuery in a full browser context.

like image 169
nrabinowitz Avatar answered Jun 10 '26 10:06

nrabinowitz