Not sure if mentioned before, but would Jasper (http://jasperproject.github.io/) be of any use here.
Jasper uses Pocketsphinx for voice recognition.