Practice Final Solution for Introduction to Artificial Intelligence | COMPSCI 188 | Exams Computer Science

NAME: SID#: Login: Sec: 1

CS 188 Introduction to

Spring 2006 Artificial Intelligence Practice Final Sol’ns

1. (20 points.) True/False

Each problem is worth 2 points. Incorrect answers are worth 0 points. Skipped questions are worth 1 point.

(a) True/False: All MDPs can be solved using expectimax search.

False. MDPs with self loops lead to infinite expectimax trees. Unlike search problems, this issue cannot

be addressed with a graph-search variant.

(b) True/False: There is some single Bayes’ net structure over three variables which can represent any prob-

ability distribution over those variables.

True. A fully connected Bayes’ net can represent any joint distribution.

utility function over those outcomes.

True. Any set of preferences which conform to the six constraints on rational preferences (orderability,

transitivity, continuity, substitutability, monotonicity, decomposability) can be summarized by a single,

real-valued function.

(d) True/False: Temporal difference learning of optimal utility values (U) requires knowledge of the transition

probability tables (T).

Mostly True. Temporal difference learning is a model-less learning technique that requires only example

state sequences to learn the utilities for a fixed policy. However, to derive the best policy from those

utilities, which would be required to find the optimal utility values, we would need to compute

π(s) = arg maxaX

T(s, a, s0)U(s0)

which of course includes a transition probability. The solution reads “mostly” true because the optimal

utility values could be found without the transition probabilities if the agent were also supplied with the

optimal policy. In practice, we could also estimate the transition probabilities from the training data

(using maximum-likelihood estimates, for example), so they need not necessarily be known in advance.

NOTE: This solution was updated since the review session.

(e) True/False: Pruning nodes from a decision tree may have no effect on the resulting classifier.

True. Trivially, a decision tree may have branches that are unreachable. Furthermore, splits in the

decision tree may also refine P(class), but have no effect in practice because of rounding. Imagine a leaf

has 10 true, 3 false, and splits to 5/2 and 5/1 – you’ll still guess true on each branch, but the split is

refining the conditional probabilities.

NOTE: This solution was updated since the review session.

Practice Final Solution for Introduction to Artificial Intelligence | COMPSCI 188, Exams of Computer Science