Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Do they actually do anything to circumvent paywalls or do websites just whitelist their crawlers?


Websites don’t whitelist their crawlers, they maintain custom bypasses for a wide variety of websites.

If the websites were inclined to whitelist these crawlers, they’d also whitelist archive.org which is actually easy to whitelist. Archive.is is not




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: