Main Page | Class List | Directories | File List | Class Members | File Members

fast_algo_struct.h File Reference

File that implement various structure for fast algorithms. fast_algo_struct implement beam_t, histprune_t, pl_t and fast_algo_struct_t. More...

#include <s3types.h>
#include <mdef.h>
#include <ascr.h>
#include <logmath.h>

Go to the source code of this file.

Classes

struct  beam_t
 Structure that contains all beam parameters for beam pruning in Viterbi algorithm. More...
struct  pl_t
 Structure that contains all parameters for phoneme lookahead. More...
struct  histprune_t
 Structure containing various histogram pruning parameters and internal storage All in integers. More...
struct  downsampling_t
 Structure containing various parameters for downsampling. More...
struct  gmm_select_t
 Structure that contains all parameters for CI-based GMM selection. More...
struct  gau_select_t
 Structure that contains all parameters related to Gaussian selection. More...
struct  fast_gmm_t
 Structure that contains all parameter related to 4-Level Fast GMM computation. More...

Functions

beam_tbeam_init (float64 hmm, float64 ptr, float64 wd, float64 wdend, int32 ptranskip, int32 n_ciphone, logmath_t *logmath)
void beam_report (beam_t *b)
void beam_free (beam_t *b)
histprune_thistprune_init (int32 maxhmm, int32 maxhist, int32 maxword, int32 hmmhistbinsize, int32 numNodes)
void histprune_zero_histbin (histprune_t *h)
void histprune_update_histbinsize (histprune_t *h, int32 hmmhistbinsize, int32 numNodes)
void histprune_report (histprune_t *h)
void histprune_showhistbin (histprune_t *hp, int32 nfr, char *uttid)
void histprune_free (histprune_t *h)
S3DECODER_EXPORT fast_gmm_tfast_gmm_init (int32 down_sampling_ratio, int32 mode_cond_ds, int32 mode_dist_ds, int32 isGS4GS, int32 isSVQ4SVQ, float64 subvqbeam, float64 cibeam, float32 tighten_factor, int32 max_cd, int32 n_ci_sen, logmath_t *logmath)
void fast_gmm_report (fast_gmm_t *f)
S3DECODER_EXPORT void fast_gmm_free (fast_gmm_t *fg)
pl_tpl_init (int32 pheurtype, float64 pl_beam, int32 n_ciphone, logmath_t *logmath)
void pl_report (pl_t *pl)
void pl_free (pl_t *pl)
void pl_computePhnHeur (mdef_t *md, ascr_t *a, pl_t *pl, int32 heutype, int32 win_strt, int32 win_efv)


Detailed Description

File that implement various structure for fast algorithms. fast_algo_struct implement beam_t, histprune_t, pl_t and fast_algo_struct_t.


Function Documentation

void beam_free beam_t b  ) 
 

free the beam_t data structure

Parameters:
b  beam data structure

beam_t* beam_init float64  hmm,
float64  ptr,
float64  wd,
float64  wdend,
int32  ptranskip,
int32  n_ciphone,
logmath_t *  logmath
 

Create and initialize a beam_t structure, with the given parameters, converting them from prob space to logs3 space.

Note the last parameter is used in controling when the word beam is applied.

Returns:
a pointer to created structure if successful, NULL otherwise.
Parameters:
hmm  Input: hmm beam
ptr  Input: phone transition beam
wd  Input: word beam
wdend  Input: word end beam.
ptranskip  Input: whether to apply phoneme transition beam rather than word beam.
n_ciphone  Input: number of ciphone to initialized arrays used in word end pruning.

void beam_report beam_t b  ) 
 

report the content of the beam_t data structure

Parameters:
b  beam data structure

S3DECODER_EXPORT void fast_gmm_free fast_gmm_t fg  ) 
 

Free the fast_gmm_t structure

Parameters:
fg  Input: structure to free

S3DECODER_EXPORT fast_gmm_t* fast_gmm_init int32  down_sampling_ratio,
int32  mode_cond_ds,
int32  mode_dist_ds,
int32  isGS4GS,
int32  isSVQ4SVQ,
float64  subvqbeam,
float64  cibeam,
float32  tighten_factor,
int32  max_cd,
int32  n_ci_sen,
logmath_t *  logmath
 

Create and initialize a fast_gmm_t structure, withe the given parameters

Parameters:
down_sampling_ratio  Input: The frame down-sampling ratio
mode_cond_ds  Input: Whether conditional down sampling is used
mode_dist_ds  Input: (NOT USED) Whether distance-based down sampling is used
isGS4GS  Input: Whether Gaussian selection map is only used for Gaussian Selection. Not for down sampling
isSVQ4SVQ  Input: Whether SVQ is used in acoustic model computation.
subvqbeam  Input: Subvq beam
cibeam  Input: CI phone beam
tighten_factor  Input : A tightening factor used in down sampling
max_cd  Input: Max CD senone to be computed
n_ci_sen  Input: no. of ci senone, use to initialize the ci_occ array

void fast_gmm_report fast_gmm_t f  ) 
 

report the content of the fast_gmm_t data structure

Parameters:
f  Input: the fast GMM computation structure

void histprune_free histprune_t h  ) 
 

free the histprune data structure

Parameters:
h  Input/Output :histprune structure

histprune_t* histprune_init int32  maxhmm,
int32  maxhist,
int32  maxword,
int32  hmmhistbinsize,
int32  numNodes
 

Create and initialize a histprune_t structure, with the given parameters.

Parameters:
maxhmm  Input: Maximum HMM per frame
maxhist  Input: Maximum history per frame
hmmhistbinsize  Input: Maximum no. words per frame Input: The size of each histogram
numNodes  Input: The number of nodes in the tree (or graph?)

void histprune_report histprune_t h  ) 
 

report the content of the histprune_t data structure

Parameters:
h  Input: histprune structure

void histprune_showhistbin histprune_t hp,
int32  nfr,
char *  uttid
 

report the histogram bins

Parameters:
hp  Input: histprune structure
nfr  Number of frames in an utterance
uttid  Utterance ID

void histprune_update_histbinsize histprune_t h,
int32  hmmhistbinsize,
int32  numNodes
 

Parameters:
h  Input: histprune structure
hmmhistbinsize  Input: The size of each histogram
numNodes  Input: The number of nodes in the tree (or graph?)

void histprune_zero_histbin histprune_t h  ) 
 

Parameters:
h  Input: histpurne structure

void pl_computePhnHeur mdef_t md,
ascr_t a,
pl_t pl,
int32  heutype,
int32  win_strt,
int32  win_efv
 

Phoneme look-ahead and compute phoneme heuristic. This routine will compute a phone heuristic scores with lookahead win_efv frame and start at frame win_strt.

Parameters:
md  Input: a model definition file
a  Input: an acoustic score data structure
pl  Input: a phoneme-lookahead data structure
heutype  Input: type of heuristic used
win_strt  Input: Starting frame of a window
win_efv  Input: The effective window size

void pl_free pl_t pl  ) 
 

Free the phoneme lookahead data structure

Parameters:
pl  Input/Output: a phoneme looahead data structure

pl_t* pl_init int32  pheurtype,
float64  pl_beam,
int32  n_ciphone,
logmath_t *  logmath
 

Create and initialize the pl_t data structure

Returns:
an initialized phoneme lookahead data structure
Parameters:
pheurtype  Input: Phoneme lookahead heuristic types TBD: (NOT DETAIL ENOUGH)
pl_beam  Input: Phoneme lookahead beam
n_ciphone  Input: Number of CI phones used in the cache size.

void pl_report pl_t pl  ) 
 

report the content of the phoneme lookahead data structure pl_t

Parameters:
pl  Input: a phoneme lookahead data structure


Generated on Sat Apr 11 00:02:29 2009 by  doxygen 1.3.9.1