CMU Sphinx Open Source Models

Table of Contents

  1. Language Model Resources
  2. Language models for HUB4 (broadcast news)
  3. General Notes on Acoustic Models
    1. PocketSphinx Specific Notes
    2. Sphinx-3 Specific notes
  4. Acoustic Models for Sphinx-3 and PocketSphinx
  5. Models for Sphinx-2 (obsolete)

Language Model Resources

Language models for HUB4 (broadcast news)

General Notes on Acoustic Models

All models use the new "40 phone" phoneset and are compatible with CMUDict 0.6d and 0.7, as well as the Sphinx Knowledge Base Tool. These models are released under the same permissive license as Sphinx-3. Although the databases they are trained from are not freely redistributable, as far as we know, this does not affect models trained on them.

These models are packaged for use with Sphinx-3 and PocketSphinx. Unpack them using tar and take note of the directory that is created by doing so. To use them, you will specify this directory in your configuration file, or on the command line, using the -hmm argument. This will ensure that the proper feature extraction parameters are picked up by the decoder.

PocketSphinx Specific Notes

The mixture_weights file is not necessary in order to use the semi-continuous models with PocketSphinx. Since it is quite large, you may choose to omit it in cases where storage is limited. We have included it because it is (currently) necessary in order to do acoustic model adaptation.

If you use these continuous density models with PocketSphinx you may encounter segfaults. This is a known bug which will be fixed in the next release. To solve it, you need to edit the noisedict file and remove all lines where the pronunciation of a word contains phones other than SIL or noise phones (ones starting and ending with +). Alternately, here is a corrected noise dictionary. Copy it over the noisedict file.

Sphinx-3 Specific Notes

While Sphinx-3 supports MLLT feature transformations, version 0.7 does not find them automatically in the acoustic model directory. Therefore, for this version, in order to use the wideband WSJ models below, you will need to add this argument to the configuration or command line:

-lda wsj_all_cd30.mllt_cd_cont_4000/feature_transform

If you aren't running the directory from the same directory as the model, you will need to add the full path to it.

Note also that if you intend to use the semi-continuous models with Sphinx-3, you must add these arguments to your configuration:

-feat s2_4x -senmgau .s2semi.

Acoustic Models for Sphinx-3 and PocketSphinx

Models for Sphinx-2 (obsolete)

Important note: These models use an older (44-phone) phoneset, so you must use the Advanced Sphinx Knowledge Base Tool instead (select "Reduced (Sphinx_44)" in the dictionary and language model parameters).

Maintained by David Huggins-Daines
Last modified: Wed Mar 19 15:36:12 EDT 2008