Looking for some good recommendations.
For example:
1/ https://rootly.com/blog/monitoring-your-platform-from-multip... 2/ https://rootly.com/blog/the-role-of-sres-in-observability
Once you're familiar with the basics, you can pickup some docs/videos around OpenTelemetry to read about how it is done in the real world. Pretty much all obs systems today have otel support.
This sets you up to play around and learn PromQL.