Computers are so beefy now that I could download the parts of the internet I use regularly, and it still wouldn't put a dent in my hard drive. I think the entirety of Stack Overflow and Wikipedia is only a few gigs.
Finding what I'm looking for is still a challenge though. I tried taking those offline a few years ago using YaCy, which didn't work so well. Maybe elasticsearch? People say nice things about it.