Decision Making and Expected Utility: Maximizing Rewards and Minimizing Risks, Slides of Artificial Intelligence

The concept of decision making under uncertainty, focusing on the maximum expected utility (meu) principle. Students will learn about deterministic and probabilistic problems, decision-making examples, and the utility principle. The document also covers the expected monetary value (emv) and the concept of risk-neutral and risk-averse agents.

Typology: Slides

2012/2013

Uploaded on 04/24/2013

banani
banani 🇮🇳

4.3

(3)

91 documents

1 / 4

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
1
CSCI 100
Think Like Computers
Lecture 5
Fall 2008
Decisions
Some problems are deterministic: actions
lead to known, predictable outcomes
But others involve chances
Lottery
Or, they are so complicated that outcomes
is hard to predict
So we assume there are some randomness
Decision-making example
You are deciding whether to hold a party
inside or outside
Probability of rain is 1/3
Indoor party gives reward 2
Outdoor party with no rain gives reward 3
Outdoor party with rain gives reward 1
Maximum Expected Utility (MEU)
The MEU principle says that a rational
agent should choose an action that
maximizes its expected utility.
The utility principle
Theorem 1: If an agent's preferences obey the
axioms of utility theory, then there exists a real-
valued function U that operates on states such that:
U(A) > U(B) A >B; and
U(A) = U(B) A B
Theorem 2: The utility of a lottery is the sum of
probabilities of each outcome times the utility of that
outcome: U([p1,S1; p2,S2; ...; pn,Sn] = ipiU(Si)
- expected utility
Expected monetary value
Example: You can take a $1,000,000 prize (A) or
gamble on it by flipping a coin (B). If you
gamble, you will either triple the prize or loose
it.
EMV (expected monetary value) of the lottery is
$1,500,000, but does it have higher utility?
EU(Gamble)=0.5*U(x)+0.5*U(x+3,000,000)
EU(Take)=U(x+1,000,000)
Docsity.com
pf3
pf4

Partial preview of the text

Download Decision Making and Expected Utility: Maximizing Rewards and Minimizing Risks and more Slides Artificial Intelligence in PDF only on Docsity!

CSCI 100

Think Like Computers

Lecture 5

Fall 2008

Decisions

  • Some problems are deterministic: actions lead to known, predictable outcomes
  • But others involve chances Š Lottery
  • Or, they are so complicated that outcomes is hard to predict Š So we assume there are some randomness

Decision-making example

  • You are deciding whether to hold a party inside or outside
  • Probability of rain is 1/
  • Indoor party gives reward 2
  • Outdoor party with no rain gives reward 3
  • Outdoor party with rain gives reward 1

Maximum Expected Utility (MEU)

  • The MEU principle says that a rational agent should choose an action that maximizes its expected utility.

The utility principle

Theorem 1 : If an agent's preferences obey the axioms of utility theory, then there exists a real- valued function U that operates on states such that: U(A) > U(B) ⇔ A > B; and U(A) = U(B) ⇔ A ≡ B

Theorem 2 : The utility of a lottery is the sum of probabilities of each outcome times the utility of that outcome: U([p 1 ,S 1 ; p 2 ,S 2 ; ...; pn ,Sn ] = ∑i pi U(Si )

  • expected utility

Expected monetary value

Example: You can take a $1,000,000 prize (A) or gamble on it by flipping a coin (B). If you gamble, you will either triple the prize or loose it. EMV (expected monetary value) of the lottery is $1,500,000, but does it have higher utility? EU(Gamble)=0.5U(x)+0.5U(x+3,000,000) EU(Take)=U(x+1,000,000)

Expected Monetary Value

What if prize = 100? 10?

Does it matter whether or not you can play many times?

Expected monetary value

Bernoulli's 1738 St. Petersburg Paradox: Toss a coin until it comes up heads. If it happens after n times, you receive 2 n^ dollars.

How much should you pay to participate in this game?

Risk-averter’s curve

Decreasing marginal utility for money. Will buy affordable insurance. Will only take gambles with substantial positive expected monetary payoff.

m

U(m)

Risk-seeker’s curve

Increasing marginal utility for money. Will not buy insurance. Will sometimes participate in unfavorable gamble having negative expected monetary payoff.

m

U(m)

Utility curves

  • Risk-neutral agents (linear curve).
  • Regardless of the attitude towards risk, the utility function can always be approximated by a straight line over a small range of monetary outcome.

Are you rational?

  • Which lottery would you prefer?
  • A: 4% chance of winning $
  • B: 5% chance of winning $

Some Statistics

  • Death due to lightning: 1/6 million in 1Year, 1/80,000 lifetime
  • Air plane: 1/440,000 a year, 1/5, lifetime
  • Motor-Vehicle accidents: 1/6,535 a year, 1/84 lifetime

Gamble

  • Do you gamble?
  • Why or why not?

Is there a winning strategy?

  • Bet $1 in the beginning.
  • If lose, double the amount next time.
  • Does this work?

Homework 1 (due Oct 2)

    1. Write a short essay (1 page) about your opinion on whether gambling is rational or irrational, and reasons for you to gamble/no to gamble.
    1. Do a research on playing Black Jack. Design your strategy. Is the strategy suitable for a computer to adopt? What representation would you use?