Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

-fwdflat 
-bestpath 
-lm [the language model file to be used]
-dict [the dictionary to be used]
-hmm [the acoustic model. By default, the Virtual Human Toolkit uses the wall street journal acoustic model that comes with pocketsphinx and the CMU pronunciation dictionary. You can change this to use your own.]
-samprate [the sampling rate]

 

Creating a language model for pocketsphinx-sonic-server

You will need to follow the below steps for creating your own language model for use with pocketsphinx-sonic-server

...

. First of all, we will need the "jasr" tool. This tool is present under lib/jasr in the virtual human toolkit folder. To make the actual Language-Models, you can use 'cmuslm' or 'srilm', which are both included within the jasr folder. However 'cmuslm' currently only works for Linux and 'srilm' works for linux and Windows XP.

You should do the following

  • Go to the "trunk\lib\jasr\bin\srilm" folder within the toolkit install
  •  run the following command

     create-lm.bat <corpus.txt> <output.arpa>

     where,
    - corpus.txt is a  file containing individual lines(text) that comprise the words that make up the vocabulary for your domain
    - output.arpa is the name of the output language model file
Once you create the language model, you just need to point to it in your config file that is input to pocketsphinx-sonic-server and it's good to go.

Developers

How can programmers modify or add functionality? 

...