I could cook up some projects I suppose, and flood it with dummy data. Just curious if there are any better ideas?
Usenet (servers) are as much of a distributed system as a Hadoop cluster.
The same can be said for e.g., DNS vs. Spark.