LexTool

LOGIOS Lexicon Tool


This tool generates a pronunciation dictionary suitable for speech recognition, in particular for the Sphinx system.
The tool currently uses cmudict.0.7b and the (currently standard) 40 item phone inventory.
Please note that the dictionary may be updated from time to time and that your results may vary as a consequence.

If you notice any errors in the output (such as a seemingly incorrect pronunciation), please report it and we will look into it. You can send reports to air:cs'cmu,edu.


word file:
hand file:



An example

If your input file looks something like this: Your output file will look something like this:
Hello
	
HELLO        HH EH L OW
HELLO(1) HH AH L OW
world
compound_word
hyphen-ated
ONE23
2008
boom!
kweezlebotter
WORLD	W ER L D
COMPOUND_WORD	K AA M P AW N D W ER D
HYPHEN-ATED	HH AY F AH N EY T IH D
ONE23	OW EH N IY T UW TH R IY
2008	T UW Z IY R OW Z IY R OW EY T
BOOM!	B UW M
KWEEZLEBOTTER	K W IY Z L AH B AA T AH R

Please note the following: