Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: This has multiple issues, but cleaned up the formatting

...

  • -c [file-name]
    This config file should contain the following information specified as below
  • -fwdflat
  • -bestpath
  • -lm [the language model file to be used]
  • -dict [the dictionary to be used]
  • -hmm [the acoustic mode]
    By default, the Virtual Human Toolkit uses the wall street journal acoustic model that comes with pocketsphinx and the CMU pronunciation dictionary. You can change this to use your own.
  • -samprate [the sampling rate]

Creating a language model for

...

PocketSphinx Wrapper

You will need to follow the below steps for creating your own language model for use with pocketsphinx-sonic-serverPocketSphinx Wrapper. First of all, we will need the "jasr" tool. This tool is present under lib/jasr in the virtual human toolkit folderToolkit. To make the actual Language-Modelslanguage models, you can use 'cmuslm' or 'srilm', which are both included within the jasr folder. However 'cmuslm' currently only works for on Linux and , while 'srilm' works for linux and Windows XPis compatible with both Windows and Linux.

You should do the following:

  • Go to the "\lib\jasr\bin\srilm" folder within the toolkit installToolkit
  •  run
    Run the following command
     create
     
    create-lm.bat <corpus.txt> <output.arpa>

     where,

    - corpus.txt is a  file containing individual lines(text) that comprise the words that make up the vocabulary for your domain

    - output.arpa is the name of the output language model file

Once you create the language model, you just need to point to it in your config file that is input to pocketsphinx-sonic-server and it's good to go.

...