How to improve SDK accuracy?

Message Topic Search Topic Options Post Reply Create New Topic Printable Version Translate Topic

   Hi, I'm not sure its the right place to ask a question about the MS Speech Engine SDk. Hopefully some one could help me with my problem.
I've been trapped in this problem for a very long time. I'm developing an application that take audio files as input and generate transcript from it using SAPI 5.1. However the accuracy is too disappointing, the accuracy is almost below 30%, most of the time the engine just guess what uttered in the audio file, even with a good quality audio file without any back ground noise and has standard pronunciation. I use dictation grammar and the wav file format is 16 bit,44100 hz and mono. Could anyone told me what should I do to improve the accuracy or it's the nature of MS SAPI that could only recognize voice correctly after trained? Is there any way to train the speech engine with the audio file which might including multiple speekers?