Best codebases to study to learn software design?

Question

I&rsquo;m working on improving my software design skills, and it was recommended that I study existing well designed codebases. What are some publicly accessible codebases you would consider gold standards for software design?

sprobertson · Accepted Answer

It's been asked a few times, here are some links to get you started:
* https://news.ycombinator.com/item?id=36370684
* https://news.ycombinator.com/item?id=30752540
* https://news.ycombinator.com/item?id=9896369 (Python specific)

rgovostes · Answer

These are several years old at this point, but many open source project leaders contributed to the series "The Architecture of Open Source Applications", which is free to read online: https://aosabook.org/en/index.html

userbinator · Answer

UNIXv6 and the BSDs.

riffraff · Answer

I am not qualified enough to answer, but some ~15 years ago I enjoyed going through the book Code Reading[0], which is about this exact topic.
I think there was another one with a similar name but I can't think of it's name.
0: https://www.spinellis.gr/codereading/, check the TOC https://www.spinellis.gr/codereading/toc.html

pfannkuchen · Answer

Maybe I&rsquo;m just not good enough at paying attention, but for me it seems like you have to actually run into problems over and over and figure out how to avoid the problems. Then you end up being able to mentally simulate what problems you will run into, and design is basically all about avoiding future problems of various kinds (and balancing tradeoffs about which future problems to avoid and how much effort to put into each, whether you can solve multiple with one design play, etc).

belZaah · Answer

My experience (30 years in software, 25 in practicing architecture, MIT system architecture masters) tells me there is no such thing as abstractly &ldquo;good&rdquo; design. There are designs with negative consequences for sure, but &ldquo;good&rdquo; depends on the context: what are you building, safety/security requirements etc. Probably most importantly on the implementation team and it&rsquo;s structure. A team of juniors will butcher your intricate design and Conway&rsquo;s Law makes your software reflect the team.

iphone_elegance · Answer

https://aosabook.org/en/

crystal_revenge · Answer

My immediate reaction to this question is: "your team's". Nothing will teach you more about how to design software then really understanding why good and bad solutions were adapted to solve a certain real problem.
Software exists precisely because there is still a messy layer connecting user requirements to actions on a computer. If there was not messiness then we could just automate it all. Approaching software from some sort of Platonic ideal of what software should be will frequently lead to bad decisions on it's own.
When you start to see how certain pressures lead to certain paths you learn to recognize the wrong decisions that are often good at the time, and avoid them. At the same time, you need to learn to develop methods that work quickly and effectively. By far the biggest real challenge in real world software is time constraints. This is almost never discussed in theoretical views of software, but the truth is you're always going to be writing code under pressure to ship. You will come across situations where you do not have time to do what you want to do or think is best.
Good software is software that runs and solves the user need, but you will come to realize that there are design solutions that will make successfully running happen more often. The best way to find these is to study the real software you're writing.

vogelke · Answer

I'd recommend the book "Beautiful Code: Leading Programmers Explain How They Think". Published by O'Reilly, ISBN-10 &rlm; : 0596510047

BlackFly · Answer

I can recommend reading about postfix architecture if you want to learn a bit about what would nowadays be called a microservice architecture:
https://www.postfix.org/OVERVIEW.html
You might need to know a bit about how email servers work to appreciate it though.

ben30 · Answer

While studying well-designed codebases is incredibly valuable, there's an important "tip of the iceberg" effect to consider: much of good software design lives in the "negative space" - what's deliberately not there.
The decisions to exclude complexity, avoid premature abstractions, or reject certain patterns are often just as valuable as the code you can see. But when you're studying a codebase, you're essentially seeing the final edit without the editor's notes - all the architectural reasoning that shaped those choices is invisible.
This is why I've started maintaining Architectural Decision Records (ADRs) in my projects. These document the "why" behind significant technical choices, including the alternatives we considered and rejected. They're like technical blog posts explaining the complex decisions that led to the clean, simple code you see.
ADRs serve as pointers not just for future human maintainers, but also for AI tools when you're using them to help with coding. They provide readable context about architectural constraints and compromises - "we've agreed not to do X because of Y, so please adhere to Z instead." This makes AI assistance much more effective at respecting your design decisions rather than suggesting patterns you've deliberately avoided.
When studying codebases for design patterns, I'd recommend looking for projects that also maintain ADRs, design docs, or similar decision artifacts. The combination of clean code plus the architectural reasoning behind it - especially the restraint decisions - provides a much richer learning experience.
Some projects with good documentation of their design decisions include Rust's RFCs, Python's PEPs, or any project following the ADR pattern. Often the reasoning about what not to build is more instructive than the implementation itself.

shahzaibmushtaq · Answer

My suggestion is to search for open-source codebases in your favorite languages, study them, and start practicing them.

ChrisArchitect · Answer

https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...

tmoravec · Answer

There's a free book on this topic: The Architecture of Open Source Applications
https://aosabook.org/en/index.html
Maybe that would be a good start. You can then pick a project to dive in.
As a more specific tip, I've done some hacks in Nginx long time ago and found it quite nice.

rc_kas · Answer

For website/web server learning I recommend the laravel code-base. It's a beauty.https://www.instagram.com/reel/C2x4Ge5RtNC/

pydry · Answer

I dont think this is a good idea, actually. Good design is about making good trade offs at the right point in time.
A code base with excellent design will show you the end state but not how it got there but probably not the trade offs and decisions involved.
Practicing refactoring on subpar code bases and dealing with the consequences of your decisions is a better way to improve.

RossBencina · Answer

[delayed]

koliber · Answer

Trying to learn software design by looking at code is kind of like trying to learn about architecture looking at the bricks that make up a building.To learn about design you need a wider perspective. You can theoretically learn it from code but it won&rsquo;t be most effective. Look at great documentation and literature about design instead.

SonOfLilit · Answer

To answer your actual question:The codebases I learned from the most are Git, Postgres, CPython. Not saying they are perfect designs, but they are well maintained, solve hard problems, have seen many years of evolution, and are very easy to get your hands on.

chickenzzzzu · Answer

Your own. Everyone else's is trash. Never understand someone else's code-- the whole point is that there should be just a handful of functions of yours I can call, and shouldn't ever care to see how the sausage is made. Otherwise, your abstraction sucks

artpar · Answer

I made a small list long while ago, but it still holds todayhttps://medium.com/@012parth/what-source-code-is-worth-study...

sim7c00 · Answer

study modern code bases of language you want to learn, and look first at their history to see if they are mature, came from a company, academics etc.
you should not only study 'good' code... how will you know what is bad code?
study code that does similar things to what u want (client/server/game/ai/datacrunching etc.) and study lot of it..different qualities, ages, and sources

revskill · Answer

There's no. Most of production codebase outthere is result of hacks.

CraigJPerry · Answer

I've done this for years and swear by it.
Top 5 codebases for changing my mind about things:
Wietse Venema's Postfix mail server. Taught me tons about security posture, the architecture i'd describe as microservices before microservices was a thing, but contrary to the modern take on microservices (it's mostly a tool for decomposing work across large semi-isolated groups) this was primarily about security and simplicity.
Spring framework - this opened my eyes to ways of working that i hadn't really thought enough about before, the developers on that project have a culture of deeply considering the needs of their users (who are java developers often in an enterprise environment).
Git - the thing i like about the git code base is that once you've covered the objects database (e.g. blobs, trees and commits) and the implementation of refs, everything else just feels like additional incremental features. With those core concepts, everything else is kinda harmoniously built on top.
Varnish by Poul Henning-Kamp is another one - feels like he went to great lengths to make that code base a teaching tool despite the fact it's also a top tier reverse proxy.
Last one isn't a code base - but it will help with software design in the large; studying how the lieutenants model works in the linux kernel.
Thinking about my answers, i think i've highlighted something subtly different than "well designed codebases" it's more a list of codebases that left a notable long lasting impression on me because of design decisions they made.

drysine · Answer

>well designed codebasesOne thing to keep in mind is that what was well-designed 30, 20 or 10 years ago may be not considered such now. Hardware changes and so are the design decisions involving performance.For example, if you are looking at C++ networking libraries, learning from ACE or even Asio maybe not the best idea - better look at "thread per core, share nothing" seastar.

goodthink · Answer

https://github.com/newspeaklanguage https://newspeaklanguage.org https://blog.bracha.org

mamcx · Answer

This has 2 key things:* "well designed": What was the objectives and ideas...* "codebases": How well that was implementedThey are a lot of lofty claims saying how this or that is "fast, secure, etc" but don't end like that in the actual implementation.But most of the time, that could be seen in the "design claims" already! Good design is not just full of adjectives and nice sounding goals, but the concrete considerations, what was the trade-offs, false-starts, and reasons behind the decisions.You can see some examples reading about the design of Erlang, early pascal, most RDBMS, etc.So, you first mid/long term goal is to learn to distinguish what good design actual look like.then, in relation with codebases then to be kinda easier: It actually follow the design?A good example is the 'std' library of Rust. It has a lot of lofty claims about security and such things that could sound alarms, but then you dive in the code of it and see is there A LOT of care about it, and a lot of docs comments discussing this stuff and then the code match.P.D: The "std" or equivalent of the lang is one of the most important codebases you need to learn and study, and the MAJOR way to judge how truly good is it.

NoahZuniga · Answer

codemirror 6 is the product of a bunch of redesigns, and this most recent version seems to have had a lot of thought put into its structure.

ramesh31 · Answer

Doom 3: https://github.com/id-Software/DOOM-3

alphazard · Answer

The codebase is not where the design usually lives. It's where the implementation lives. You could imagine a rewrite into another programming language which would preserve the design but completely replace the implementation.
You should practice writing design docs. Don't worry about what the doc is supposed to look like, and definitely don't work off of a template. The most important thing about the doc is that another human could do the implementation if you gave it to them.
The doc can also function as a "proof of consideration". If you choose to do something one way, but there are other possible ways to do it, you can acknowledge the other possible ways, and say why they are worse. By preemptively acknowledging an alternative, you have proved to readers that you considered it.
All a "good" system designer is doing is considering a larger design space than most, and consistently finding good points in the space. Pick a problem, sample points in the design space, tell me why some points are better than others, and write it all down.

Best codebases to study to learn software design?

I’m working on improving my software design skills, and it was recommended that I study existing well designed codebases. What are some publicly accessible codebases you would consider gold standards for software design?

It's been asked a few times, here are some links to get you started:
* https://news.ycombinator.com/item?id=36370684
* https://news.ycombinator.com/item?id=30752540
* https://news.ycombinator.com/item?id=9896369 (Python specific)

These are several years old at this point, but many open source project leaders contributed to the series "The Architecture of Open Source Applications", which is free to read online: https://aosabook.org/en/index.html

UNIXv6 and the BSDs.

https://aosabook.org/en/

I'd recommend the book "Beautiful Code: Leading Programmers Explain How They Think". Published by O'Reilly, ISBN-10 ‏ : 0596510047

I can recommend reading about postfix architecture if you want to learn a bit about what would nowadays be called a microservice architecture:
https://www.postfix.org/OVERVIEW.html
You might need to know a bit about how email servers work to appreciate it though.

My suggestion is to search for open-source codebases in your favorite languages, study them, and start practicing them.

https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...

There's a free book on this topic: The Architecture of Open Source Applications
https://aosabook.org/en/index.html
Maybe that would be a good start. You can then pick a project to dive in.
As a more specific tip, I've done some hacks in Nginx long time ago and found it quite nice.

For website/web server learning I recommend the laravel code-base. It's a beauty.
https://www.instagram.com/reel/C2x4Ge5RtNC/

[delayed]

To answer your actual question:
The codebases I learned from the most are Git, Postgres, CPython. Not saying they are perfect designs, but they are well maintained, solve hard problems, have seen many years of evolution, and are very easy to get your hands on.

Your own. Everyone else's is trash. Never understand someone else's code-- the whole point is that there should be just a handful of functions of yours I can call, and shouldn't ever care to see how the sausage is made. Otherwise, your abstraction sucks

I made a small list long while ago, but it still holds today
https://medium.com/@012parth/what-source-code-is-worth-study...

There's no. Most of production codebase outthere is result of hacks.

https://github.com/newspeaklanguage https://newspeaklanguage.org https://blog.bracha.org

codemirror 6 is the product of a bunch of redesigns, and this most recent version seems to have had a lot of thought put into its structure.

Doom 3: https://github.com/id-Software/DOOM-3

Best codebases to study to learn software design?

I’m working on improving my software design skills, and it was recommended that I study existing well designed codebases. What are some publicly accessible codebases you would consider gold standards for software design?

It's been asked a few times, here are some links to get you started:* https://news.ycombinator.com/item?id=36370684* https://news.ycombinator.com/item?id=30752540* https://news.ycombinator.com/item?id=9896369 (Python specific)

These are several years old at this point, but many open source project leaders contributed to the series "The Architecture of Open Source Applications", which is free to read online: https://aosabook.org/en/index.html

UNIXv6 and the BSDs.

https://aosabook.org/en/

I'd recommend the book "Beautiful Code: Leading Programmers Explain How They Think". Published by O'Reilly, ISBN-10 ‏ : 0596510047

I can recommend reading about postfix architecture if you want to learn a bit about what would nowadays be called a microservice architecture:https://www.postfix.org/OVERVIEW.htmlYou might need to know a bit about how email servers work to appreciate it though.

My suggestion is to search for open-source codebases in your favorite languages, study them, and start practicing them.

https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...

There's a free book on this topic: The Architecture of Open Source Applicationshttps://aosabook.org/en/index.htmlMaybe that would be a good start. You can then pick a project to dive in.As a more specific tip, I've done some hacks in Nginx long time ago and found it quite nice.

For website/web server learning I recommend the laravel code-base. It's a beauty.https://www.instagram.com/reel/C2x4Ge5RtNC/

[delayed]

To answer your actual question:The codebases I learned from the most are Git, Postgres, CPython. Not saying they are perfect designs, but they are well maintained, solve hard problems, have seen many years of evolution, and are very easy to get your hands on.

Your own. Everyone else's is trash. Never understand someone else's code-- the whole point is that there should be just a handful of functions of yours I can call, and shouldn't ever care to see how the sausage is made. Otherwise, your abstraction sucks

I made a small list long while ago, but it still holds todayhttps://medium.com/@012parth/what-source-code-is-worth-study...

There's no. Most of production codebase outthere is result of hacks.

https://github.com/newspeaklanguage https://newspeaklanguage.org https://blog.bracha.org

codemirror 6 is the product of a bunch of redesigns, and this most recent version seems to have had a lot of thought put into its structure.

Doom 3: https://github.com/id-Software/DOOM-3

It's been asked a few times, here are some links to get you started:
* https://news.ycombinator.com/item?id=36370684
* https://news.ycombinator.com/item?id=30752540
* https://news.ycombinator.com/item?id=9896369 (Python specific)

I can recommend reading about postfix architecture if you want to learn a bit about what would nowadays be called a microservice architecture:
https://www.postfix.org/OVERVIEW.html
You might need to know a bit about how email servers work to appreciate it though.

There's a free book on this topic: The Architecture of Open Source Applications
https://aosabook.org/en/index.html
Maybe that would be a good start. You can then pick a project to dive in.
As a more specific tip, I've done some hacks in Nginx long time ago and found it quite nice.

For website/web server learning I recommend the laravel code-base. It's a beauty.
https://www.instagram.com/reel/C2x4Ge5RtNC/

To answer your actual question:
The codebases I learned from the most are Git, Postgres, CPython. Not saying they are perfect designs, but they are well maintained, solve hard problems, have seen many years of evolution, and are very easy to get your hands on.

I made a small list long while ago, but it still holds today
https://medium.com/@012parth/what-source-code-is-worth-study...