0% found this document useful (0 votes)
130 views3 pages

Graphical Models and HMMs Overview

Uploaded by

Neetha Das
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
130 views3 pages

Graphical Models and HMMs Overview

Uploaded by

Neetha Das
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Introduction to Machine Learning

Week 9
Prof. B. Ravindran, IIT Madras

1. (2 marks) In the undirected graph given below, how many terms will be there in its potential
function factorization?

(a) 7
(b) 3
(c) 5
(d) 9
(e) None of the above

Soln. B - Three Cliques {A, B, C, D}, {A, D, E}, {B, C, F}


2. (2 Mark) Consider the following directed graph:

A B C

D E

Based on the d-separation rules, which of the following statements is true?


(a) A and C are conditionally independent given B
(b) A and E are conditionally independent given D
(c) B and E are conditionally dependent given C
(d) A and C are conditionally dependent given D and E

1
Soln. A - Conditioning on B blocks the only path A-B-C between A and C, making A and C
conditionally independent given B.

3. (1 Mark) Consider the following undirected graph:

A B E

C D

In the undirected graph given above, which nodes are conditionally independent of each other
given C? Select all that apply.

(a) A, E
(b) B, F
(c) A, D
(d) B, D
(e) None of the above

Soln. E - None of the paths between the given pairs are blocked when C is conditioned on.

4. (1 Marks) Consider the following statements about Hidden Markov Models (HMMs):

I. The ”Hidden” in HMM refers to the fact that the state transition probabilities are un-
known.
II. The ”Markov” property means that the current state depends only on the previous state.
III. The ”Hidden” aspect relates to the underlying state sequence that is not directly observ-
able.
IV. The ”Markov” in HMM indicates that the model uses matrix operations for calculations.

Which of the statements correctly describe the ”Hidden” and ”Markov” aspects of Hidden
Markov Models?

(a) I and II
(b) I and IV
(c) II and III
(d) III and IV

Soln. C - Refer to the lectures

2
5. (2 marks) For the given graphical model, what is the optimal variable elimination order when
trying to calculate P(E=e)?

A B C D E

(a) A, B, C, D
(b) D, C, B, A
(c) A, D, B, C
(d) D, A, C, A

Soln. A

XXXX
P (E = e) = P (a, b, c, d, e)
d c b a
XXXX
P (E = e) = P (a)P (b|a)P (c|b)P (d|c)P (e|d)
d c b a
X X X X
P (E = e) = P (e|d) P (d|c) P (c|b) P (a)P (b|a)
d c b a

6. (1 Marks) Consider the following statements regarding belief propagation:

I. Belief propagation is used to compute marginal probabilities in graphical models.


II. Belief propagation can be applied to both directed and undirected graphical models.
III. Belief propagation guarantees an exact solution when applied to loopy graphs.
IV. Belief propagation works by passing messages between nodes in a graph.

Which of the statements correctly describe the use of belief propagation?

(a) I and II
(b) II and III
(c) I, II, and IV
(d) I, III, and IV
(e) II, III, and IV

Soln. C - Refer to the lectures.

7. (1 Mark) HMMs are used for finding these. Select all that apply.
(a) Probability of a given observation sequence
(b) All possible hidden state sequences given an observation sequence
(c) Most probable observation sequence given the hidden states
(d) Most probable hidden states given the observation sequence
Soln. A, D - Refer to the lectures.

Common questions

Powered by AI

In undirected graphical models, belief propagation can be efficient due to the absence of a directed path constraint, allowing for straightforward message passing. However, its application in loopy or complex graphs may lead to inexact results. In directed models, belief propagation is more directly aligned with conditional independencies present in the model, but the directionality can limit the flexibility of message passing, thereby complicating the computation in certain arrangements .

The failure of certain statements to describe belief propagation correctly often stems from misunderstandings about its applicability across different types of graphical models and its limitations. Belief propagation is often misapplied when it is assumed to guarantee exact solutions, despite its limitations in loopy graphs where it might only provide approximate results. Such misconceptions need correcting to match the actual capabilities and appropriate applications of belief propagation .

Belief propagation can fail to guarantee exact solutions when applied to loopy graphs due to cycles that can cause the propagation of messages to be inconsistent, leading to approximate rather than exact inference. In such cases, alternative approaches like Loopy Belief Propagation (an iterative method), approximation algorithms, or exact inference algorithms like the Junction Tree Algorithm might be considered to obtain more reliable results .

In sparse graphical models, belief propagation offers strategic benefits by leveraging fewer connections to efficiently propagate messages between nodes, maintaining computational simplicity and accuracy. In these contexts, it can lead to exact solutions. Conversely, in dense graphs, where numerous nodes are interconnected, the method may become computationally intensive and might only yield approximate results due to the complexity and potential cycles, necessitating alternative strategies for more precise inference .

Hidden Markov Models (HMMs) are beneficial in sequence prediction and alignment by modeling systems where observations are influenced by underlying hidden processes. They are used to find the probability of a given observation sequence and determine the most probable hidden states given the observation sequence, among other applications. By utilizing the Markov property, HMMs predict the sequences of hidden states that best explain the observed data .

D-separation uses the criterion of blocking paths to determine conditional independence between nodes in a directed graph. When nodes A and C in a directed graph are considered, d-separation implies that conditioning on a node B that lies on the path between A and C can block the path and make A and C conditionally independent. For instance, in the graph with nodes A, B, and C, if A and C are connected indirectly via B (i.e., A-B-C), then conditioning on B blocks the only path between A and C, resulting in A and C being conditionally independent given B .

Graphical models use cliques in factorization to break down a joint probability distribution into a product of smaller, potentially overlapping components, each associated with a clique. This process utilizes the conditional independence properties encoded in the graph to simplify computation. The primary implication for computational efficiency is that the complexity of computations is often reduced, avoiding the need to work with the full distribution directly and instead dealing only with localized dependencies within cliques .

In undirected graphs, having nodes be conditionally independent given a certain node implies that the node in question, when conditioned upon, blocks all paths through it. This breaks connections between otherwise linked nodes, rendering them conditionally independent. Therefore, no information about one node in the pair can provide any knowledge about the other node's state. For example, in a graph, if C is conditioned upon, and C blocks paths between certain nodes, those nodes become conditionally independent given C .

A common misconception about the 'Hidden' aspect in Hidden Markov Models (HMMs) is that it refers to the state transition probabilities being unknown. However, it actually pertains to the underlying state sequence not being directly observable. Additionally, there is a misconception regarding the 'Markov' property, with some believing it relates to the use of matrix operations for calculations, whereas it actually refers to the property that the current state depends only on the previous state .

Selecting an optimal variable elimination order involves strategic considerations such as minimizing computational complexity and maintaining tractability by ensuring that the intermediate factors generated during the process remain small. For example, in calculating P(E=e) in a given model, eliminating variables in an order that reduces the creation of large cliques can significantly ease the computational burden and lead to more efficient calculations .

You might also like