I recently started looking at CMUSphinx, Kaldi and DeepSpeech.
Do you have any tips for me?
It is a python wrapper for a library for voice activity detection. It acts as a starting point while working on speech recognition problems. Helped me understand and discover a lot of concepts related to audio signal and data when I was in your shoes.