Is this generally done through web scraping, archival information, web crawler data, or something else entirely?
I'm asking because I would hate to resort to those types of methods as they are generally heavy on resources and unstable.
It might be wrong, but it's usually pretty dang close to right (presuming the RSS feed isn't brand-new to the reader)
[0] https://github.com/Ranchero-Software/NetNewsWire/blob/941342...
https://github.com/internetarchive/wayback/blob/master/wayba...