HACKER Q&A
📣 jb_briant

Is HN used as AI dataset?


Someone, somewhere is scrapping, right?


  👤 dredmorbius Accepted Answer ✓
I've turned up several of my own HN comments using FastGPT, from Kagi Labs.

Whether that's training data or live search results I'm not entirely sure, but HN definitely contributes to results in that case.


👤 pvg
There's are a couple of different APIs and full datasets are downloadable so the data is readily accessible without scraping anything.

👤 zoezoezoezoe
if it's on the internet, it's in a dataset somewhere at this point.