Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

MP3 link Crawler

I have been looking into a good way to implement this. I am working on a simple website crawler that will go around a specific set of websites and crawl all the mp3 links into the database. I don't want to download the files, just crawl the link, index them and be able to search them. So far for some of the sites i have been successful, but for some they use url redirects and stuff which confuses the crawler..

any ideas? how does beemp3.com index all these links?

thanks


1 Answers

You can do an http header request to the links and check their mime type. If it is audio/mpeg chances are you are fetching an mp3 link.

like image 66
Federico klez Culloca Avatar answered Dec 01 '25 06:12

Federico klez Culloca