Want to understand the topic. A good idea of how the decoding of phonemes to words and gammatech. But it is absolutely not guided in the first stages of signal processing: signal -> vector feature -> phonemes.
May be someone doing it. It would make links to articles and tools for decoding the signal into a vector of features and phonemes, and the corresponding dictionaries. Looked diagonally cmu-sphinx, flying is not understood.