Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

To be fair, Reddit is actively trying to prevent bots. How do I know? I scrape Reddit threads directly via old.reddit.com URLs and even the most sophisticated scraping tools like BrightData, Undetected Playwright (and Puppeteer), and others just don't work on Reddit threads anymore as of a few months ago.

I now have to use .json at the end of the URL to get the content, but I suspect that'll stop working at some point.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: