We have many at the company, but none of them did really cover a good overview of the KPIs that we did want.
Components: a few perl scripts, launched from a single sh script + nginx + letsencrypt
Basically it collects metrics from remote APIs and also from other internal monitoring systems, and generates static JSON files into /dev/shm/, which are served by nginx + letsencrypt from there.
The frontend is just a few vanilla HTML files, with vanilla javascript, dark theme, and mobile first responsive design (it's mostly tables to show KPIs). I'm a sysadmin, not a frontend designer, but everyone in the company (even the frontend team) likes it.
Built between 3 and 4 years ago, in two days, one week later I did adjust the logrotate rules for the vhost, and never touched again.
All the company looks at it daily (developers, support, infrastructure, even the CEO), +/- 80 users during office hours, and it's the first place I check, wherever I am with my phone, when we get any alert.
Other one that I really forgot about it, but did remember thinking about this topic: A service to adjust kafka retention
It collects our kafka servers disk space for all kafka-logs volumes, we use replica 3, so it has some logic to make a decision, if the retention needs to be reduced or can be increased, has some up and low limits for the retention for each topic and a default for undefined topics...
Before having it, we did need to adjust/restore the kafka retention almost daily, and check everything by hand.
Developed in one day, did forget about it until today.
So I block all foreign country IPs, have a pretty good lockdown on open ports, and check for updates every now and then.