How to automate Kubernetes application debugging process?

Question

No brainer things i find myself doing again and again while debugging kubernetes issues1. debugging pod crashloopbackoff2. Checking events, logs, labels, pods, services3. Looking for older events which are gone :(4. Frequently logging into cloud providers dashboard to figure out if there is any issue with cloud provider.5. Traffic is not being received by downstream applications6. Ensuring services are selecting right pods7. Launching pod to execute curl/dnsutils/awscli8. For externally exposed service figuring out if ingress is routing traffic correctly, there isn&rsquo;t other config superseding it9. Doing exec into pod to check configmap/secret changes are reflected or not or killing the pod if feeling too lazy to check10. Figuring out why node is not ready11. Checking RAM and CPU utilization12. Figuring out how this application is deployed: helm , argocd, flux, tecton, wf13. Checking if manifest has changed recently and comparing for manifest misconfiguration14. Comparing manifest with other env manifest to be sure if new config parameter has not been missed15. Building mental model for applications context boundaryHave you felt the same? I wanted to automate it, which feature should i implement first?

bg24 · Accepted Answer

You may have captured these as part of some bullet points already - 16/ Network connectivity. Make sure that the resource is accessible. It could be api server, a controller, or an application pod. 18/ What changed in the sw versions - install/remove/update 19/ Is your DNS working correctly? 20/ Does the pod have right permissions (RBAC)

streetcat1 · Answer

You should write an operator for inner checks.
This might be helpful:
https://learnk8s.io/troubleshooting-deployments
I am planning to do the same for a platform that I am building, and is deployed on prem. Let me know if this is an open source project.

prakarsh · Answer

Following this thread, will post some Kubernetes debugging issues.

How to automate Kubernetes application debugging process?

You should write an operator for inner checks.This might be helpful:https://learnk8s.io/troubleshooting-deploymentsI am planning to do the same for a platform that I am building, and is deployed on prem. Let me know if this is an open source project.

Following this thread, will post some Kubernetes debugging issues.

You should write an operator for inner checks.
This might be helpful:
https://learnk8s.io/troubleshooting-deployments
I am planning to do the same for a platform that I am building, and is deployed on prem. Let me know if this is an open source project.