Sphinx-4 Transcriber Demo |
A simple Sphinx-4 application that transcribes a continuous audio file
that has multiple utterances. The audio file should contain connected
digits data. The default file, called "10001-90210-01803.wav", contains
three utterances, separated by silences.
People who want to transcribe non-digits data should
modify the config.xml file to use the correct grammar,
language model, and linguist to do so. Please refer to the
Programmer's Guide
on how to modify the configuration file for your purposes.
Check if the bin directory already has the
Transcriber.jar file. If not, type the following in the top
level directory:
ant -find demo.xml
First make sure that you have JSAPI setup correctly. Then, to run the demo, type:
sphinx4> java -jar bin/Transcriber.jar
You will see the following result, with each utterance on its own line:
one zero zero zero one
nine oh two one oh
zero one eight zero three
NOTE:
lib/sphinx4.jar is built. If not, go to the top level
directory and type: ant
test.wav, then:
java -jar bin/Transcriber.jar test.wav