HACKER Q&A
📣 cglong

How would you solve this one-off data processing problem?


My mom asked me how many World Series championships have been won by each team since 1970. Since I didn't have this data readily available, I had to improvise.

Curious to see how others would solve this problem. For instance, where did you find the data, how did you fetch it and then transform it?


  👤 cglong Accepted Answer ✓
I ended up using Wikipedia and Excel as a quick-and-dirty approach.

Unfortunately, the Wikipedia table has a lot of extraneous annotations, so after copying it into my spreadsheet, I had to do some filtering and manual data munging to get it into a usable format. I then created a PivotTable to do the calculations.


👤 pestatije
I'd go the Wikipedia way...50 data points doesn't look too many, so just a list would do it, maybe a spreadsheet...a few sums, voila...i wouldn't call it a data processing problem though

👤 xrosbeh
A one line SDQL query in the online sportsdatabase.

👤 warpspin
When data like that is available in text format, I often use a simple "sort | uniq -c | sort -g" for that kind of problem.