Hidden Markov Model

May 12, 2013

Hidden Markov Models

A Hidden Markov Model( $N$ , $\sum$ , $Θ$ ) consists of the following elements:

1. $N$ is a positive integer specifying the number of states in the model. Without loss of generality, we will take the $N$ th state to be a special state, the final or stop state.

2. $\sum$ is a set of output symbols, for example $\sum = a, b$

3. $Θ$ is a vector of parameters. It contains three types of parameters:
$π j$ for $j = 1 \dots N$ is the probability of choosing state $j$ as an initial state. Note that $\sum_{j = 1}^{N} π j = 1$

$a_{j, k}$ for $j = 1 \dots (N - 1), k = 1 \dots N$

is the probability of transitioning from state j to state k. Note that for all j,

$\sum_{k = 1}^{N} a_{j, k} = 1$

$b_{j} (o)$ for $j = 1 \dots N - 1$ , and $o \in \sum$ ,

is the probability of emitting symbol $o$ from state j. Note that for all $j$ ,

$\sum_{o \in \sum} b_{j} (o) = 1$ .

Thus it can be seen that $Θ$ is a vector of $N + (N - 1) N + (N - 1) | \sum |$ parameters.

An HMM specifies a probability for each possible $(x, y)$ pair, where x is a sequence of symbols drawn from $\sum$ , and y is a sequence of states drawn from the integers $1 \dots N - 1$ . The sequences $x$ and $y$ are restricted to have the same length. As an example, say we have an HMM with $N = 3, \sum a, b$ , and with some choice of the parameters $Θ$ . Take $x = a, a, b, b$ and $y = 1, 2, 2, 1$ . Then in this case,
$P (x, y | Θ) = π_{1} a_{1, 2} a_{2, 2} a_{2, 1} a_{1, 3} b_{1} (a) b_{2} (a) b_{2} (b) b_{1} (b)$

Thus we have a product of terms specifying the probability of emitting each symbol from its associated state.

In general, if we have the sequence $x$ , and the sequence $y$ ,

$P (x, y | Θ)$

$= π_{y 1} a_{y_{n}, N} P r o d_{j = 2}^{n} a_{y_{j - 1, y_{j}}} P r o d_{j = 1}^{n} b_{y_{j}} (x_{j})$

Thus we see that the probability is a simple function of the prameters $Θ$ .