Did anyone implement something similar? How did it go? How much time did it save? What was the cost improvement? I recently found this tool in the AWS samples: https://github.com/aws-samples/scalable-hw-agnostic-inference
I'm wondering if anyone used/tried it or other approaches?
[edit: sorry, not inference, but a great cost-saver]