Podcast
Questions and Answers
What is the expected monetary value (EMV) of the gamble if the coin is fair in the given scenario?
What is the expected monetary value (EMV) of the gamble if the coin is fair in the given scenario?
$1,250,000
What are the expected utilities of accepting and declining the gamble based on the provided utilities?
What are the expected utilities of accepting and declining the gamble based on the provided utilities?
EU (Accept) = 1/2 * U(Sk) + 1/2 * U(Sk+2,500,000), EU (Decline) = U(Sk+1,000,000)
What is the utility function for Mr. Beard as shown in Figure 16.2(a)?
What is the utility function for Mr. Beard as shown in Figure 16.2(a)?
U(Sk+n) = −263.31 + 22.09 log(n + 150,000)
What is the existence of a utility function based on?
What is the existence of a utility function based on?
Signup and view all the answers
What is the rational action to take if faced with the gamble scenario?
What is the rational action to take if faced with the gamble scenario?
Signup and view all the answers
Risk-averse agents prefer a sure thing with a payoff less than the expected monetary value of a gamble.
Risk-averse agents prefer a sure thing with a payoff less than the expected monetary value of a gamble.
Signup and view all the answers
Utility theory states that all rational agents must obey specific axioms.
Utility theory states that all rational agents must obey specific axioms.
Signup and view all the answers
What is the utility of a lottery according to utility theory?
What is the utility of a lottery according to utility theory?
Signup and view all the answers
In utility theory, a utility function maps from lotteries to ___.
In utility theory, a utility function maps from lotteries to ___.
Signup and view all the answers
What is the basic principle of decision theory discussed in Section 16.1?
What is the basic principle of decision theory discussed in Section 16.1?
Signup and view all the answers
Which function captures the agent's preferences by assigning a single number to express the desirability of a state?
Which function captures the agent's preferences by assigning a single number to express the desirability of a state?
Signup and view all the answers
The Maximum Expected Utility (MEU) principle states that an agent should choose the action with the lowest expected utility.
The Maximum Expected Utility (MEU) principle states that an agent should choose the action with the lowest expected utility.
Signup and view all the answers
_______ is a random variable in decision theory that represents the possible outcome states given evidence observations.
_______ is a random variable in decision theory that represents the possible outcome states given evidence observations.
Signup and view all the answers
Match the following constraints with their descriptions in utility theory:
Match the following constraints with their descriptions in utility theory:
Signup and view all the answers
What does the optimizer's curse result from?
What does the optimizer's curse result from?
Signup and view all the answers
Decision theory is a descriptive theory that describes how actual agents act.
Decision theory is a descriptive theory that describes how actual agents act.
Signup and view all the answers
What is the term used for the phenomenon where people are strongly attracted to gains that are certain?
What is the term used for the phenomenon where people are strongly attracted to gains that are certain?
Signup and view all the answers
The Allais paradox involves people consistently preferring ______ over A and C over D.
The Allais paradox involves people consistently preferring ______ over A and C over D.
Signup and view all the answers
Match the following phenomenon with its description:
Match the following phenomenon with its description:
Signup and view all the answers
What is the term used to describe a situation where one option is of lower value on all attributes than some other option?
What is the term used to describe a situation where one option is of lower value on all attributes than some other option?
Signup and view all the answers
What is the term for a situation where despite uncertainty, all possible concrete outcomes for S1 strictly dominate all possible outcomes for S2?
What is the term for a situation where despite uncertainty, all possible concrete outcomes for S1 strictly dominate all possible outcomes for S2?
Signup and view all the answers
Stochastic dominance can always be determined by comparing the expected costs of two options.
Stochastic dominance can always be determined by comparing the expected costs of two options.
Signup and view all the answers
In deterministic preference structures, the basic regularity that arises is called preference __________.
In deterministic preference structures, the basic regularity that arises is called preference __________.
Signup and view all the answers
Match the term with its description:
Match the term with its description:
Signup and view all the answers
What is postcoarctectomy syndrome?
What is postcoarctectomy syndrome?
Signup and view all the answers
What is tachypnea?
What is tachypnea?
Signup and view all the answers
What is tachycardia?
What is tachycardia?
Signup and view all the answers
What is paradoxical hypertension?
What is paradoxical hypertension?
Signup and view all the answers
What is failure to thrive?
What is failure to thrive?
Signup and view all the answers
What is aortic hypertension?
What is aortic hypertension?
Signup and view all the answers
What does the value of information derive from?
What does the value of information derive from?
Signup and view all the answers
What are the two factors that determine the importance of tests according to the text?
What are the two factors that determine the importance of tests according to the text?
Signup and view all the answers
The expected profit for the company, given survey information, is $C/n(n-1)$. Therefore, the company should be willing to pay the seismologist up to ____ dollars for the information.
The expected profit for the company, given survey information, is $C/n(n-1)$. Therefore, the company should be willing to pay the seismologist up to ____ dollars for the information.
Signup and view all the answers
The expected value of information can be negative.
The expected value of information can be negative.
Signup and view all the answers
What distinguishes sensing actions from ordinary actions in terms of order independence?
What distinguishes sensing actions from ordinary actions in terms of order independence?
Signup and view all the answers
What is the design principle mentioned when discussing the implementation of an information-gathering agent?
What is the design principle mentioned when discussing the implementation of an information-gathering agent?
Signup and view all the answers
Myopic control uses the VPI formula shortsightedly.
Myopic control uses the VPI formula shortsightedly.
Signup and view all the answers
DECISION ANALYSIS studies the application of decision theory to actual __________ problems.
DECISION ANALYSIS studies the application of decision theory to actual __________ problems.
Signup and view all the answers
What is a value function of the type V (noise, cost, deaths) called?
What is a value function of the type V (noise, cost, deaths) called?
Signup and view all the answers
Which type of function represents an additive value function?
Which type of function represents an additive value function?
Signup and view all the answers
Utility independence extends preference independence to cover lotteries.
Utility independence extends preference independence to cover lotteries.
Signup and view all the answers
For conciseness, in a multiplicative utility function, U = $k_1 U_1 + k_2 U_2 + k_3 U_3 + k_1 k_2 U_1 U_2 + k_2 k_3 U_2 U_3 + k_3 k_1 U_3 U_1$, an n-attribute problem exhibiting MUI can be modeled using ___ single-attribute utilities and ___ constants.
For conciseness, in a multiplicative utility function, U = $k_1 U_1 + k_2 U_2 + k_3 U_3 + k_1 k_2 U_1 U_2 + k_2 k_3 U_2 U_3 + k_3 k_1 U_3 U_1$, an n-attribute problem exhibiting MUI can be modeled using ___ single-attribute utilities and ___ constants.
Signup and view all the answers
Match the following node types with their representations in decision networks:
Match the following node types with their representations in decision networks:
Signup and view all the answers
Study Notes
Making Simple Decisions
- Decision theory combines with probability theory to yield a decision-theoretic agent that can make rational decisions based on what the agent believes and wants.
- The principle of maximum expected utility (MEU) says that a rational agent should choose the action that maximizes the agent's expected utility.
Combining Beliefs and Desires Under Uncertainty
- Decision theory deals with choosing among actions based on the desirability of their immediate outcomes.
- The agent's preferences are captured by a utility function, U(s), which assigns a single number to express the desirability of a state.
- The expected utility of an action given the evidence, EU(a|e), is the average utility value of the outcomes, weighted by the probability that the outcome occurs.
- The principle of maximum expected utility (MEU) says that a rational agent should choose the action that maximizes the agent's expected utility.
The Basis of Utility Theory
- The principle of Maximum Expected Utility (MEU) seems like a reasonable way to make decisions, but it is by no means obvious that it is the only rational way.
- There are six constraints that a rational preference relation should obey:
- Orderability: Given any two lotteries, a rational agent must either prefer one to the other or rate the two as equally preferable.
- Transitivity: Given any three lotteries, if an agent prefers A to B and prefers B to C, then the agent must prefer A to C.
- Continuity: If some lottery B is between A and C in preference, then there is some probability p for which the rational agent will be indifferent between getting B for sure and the lottery that yields A with probability p and C with probability 1 - p.
- Substitutability: If an agent is indifferent between two lotteries A and B, then the agent is indifferent between two more complex lotteries that are the same except that B is substituted for A in one of them.
- Monotonicity: Suppose two lotteries have the same two possible outcomes, A and B. If an agent prefers A to B, then the agent must prefer the lottery that has a higher probability for A (and vice versa).
- Decomposability: Compound lotteries can be reduced to simpler ones using the laws of probability.
Preferences Lead to Utility
- From the axioms of utility, we can derive the following consequences:
- Existence of Utility Function: If an agent's preferences obey the axioms of utility, then there exists a function U such that U(A) > U(B) if and only if A is preferred to B, and U(A) = U(B) if and only if the agent is indifferent between A and B.
- Expected Utility of a Lottery: The utility of a lottery is the sum of the probability of each outcome times the utility of that outcome.
Utility Functions
- A utility function is a function that maps from lotteries to real numbers.
- An agent's preferences can be unusual, but we can't call it irrational.
- Utility assessment involves presenting choices to the agent and using the observed preferences to pin down the underlying utility function.
- A scale can be established by fixing the utilities of any two particular outcomes, such as a "best possible prize" and a "worst possible catastrophe".### Probability and Utility
- The probability p is adjusted until the agent is indifferent between a standard lottery and a lottery involving a prize, thus determining the utility of the prize.
- In medical, transportation, and environmental decision problems, people's lives are at stake, and a value is assigned to immediate death (u⊥).
Micromort and QALY
- Micromort: a one in a million chance of death, used as a "currency" in medical and safety analysis.
- People are willing to pay about $50 per micromort, based on studies.
Utility of Money
- Utility theory has its roots in economics, and economics provides a candidate for a utility measure: money.
- Money plays a significant role in human utility functions, and agents usually prefer more money to less.
- The utility of money is not directly proportional to monetary value, but rather concave for positive wealth and convex for negative wealth.
Risk Aversion and Seeking
- Agents with concave utility curves are risk-averse, preferring a sure thing over a gamble.
- Agents with convex utility curves are risk-seeking, preferring a gamble over a sure thing.
- The certainty equivalent of a lottery is the value an agent will accept in lieu of the lottery.
Insurance Premium and Risk Neutrality
- The difference between the EMV of a lottery and its certainty equivalent is called the insurance premium.
- Insurance companies take advantage of people's risk aversion by offering insurance at a premium.
- For small changes in wealth relative to current wealth, almost any curve will be approximately linear, making agents risk-neutral.
Expected Utility and Post-Decision Disappointment
- The rational way to choose the best action is to maximize expected utility.
- Estimates of expected utility may be unbiased, but still lead to disappointment due to optimistic estimates.
- The optimizer's curse: the tendency for the estimated expected utility of the best choice to be too high, leading to disappointment.
Human Judgment and Irrationality
- Decision theory is a normative theory, describing how a rational agent should act, but humans may not act rationally.
- The Allais paradox: people consistently prefer B over A (taking the sure thing) and C over D (taking the higher EMV), contradicting normative analysis.
- The certainty effect: people are strongly attracted to gains that are certain, which may be due to reducing computational burden, distrust of probabilities, or accounting for emotional state.
- The Ellsberg paradox: people's choices are influenced by underconstrained probabilities, leading to apparent irrationality.### Ambiguity Aversion
- Ambiguity aversion is the tendency for people to prefer known probabilities over unknown ones.
- Example: people prefer lottery A (100 dollars for a red ball) over B (100 dollars for a black ball) and also prefer D (100 dollars for a black or yellow ball) over C (100 dollars for a red or yellow ball), even though there is no state of the world where this is rational.
Framing Effect
- The framing effect is the tendency for people to be influenced by the way a decision problem is framed.
- Example: people prefer a medical procedure with a "90% survival rate" over one with a "10% death rate", even though they mean the same thing.
Anchoring Effect
- The anchoring effect is the tendency for people to rely too heavily on the first piece of information they receive when making decisions.
- Example: a restaurant offers a $200 bottle of wine to make the $55 bottle seem like a bargain.
Multiattribute Utility Theory
- Multiattribute utility theory is used to handle decision problems with multiple attributes.
- Attributes are characterized by multiple values, and a complete vector of assignments is a combination of these values.
- Higher values of an attribute correspond to higher utilities.
- Strict dominance occurs when one option is better than another on all attributes.
- Stochastic dominance occurs when one option is better than another on all attributes, even when the outcomes are uncertain.
Stochastic Dominance
- Stochastic dominance is a more general and useful concept than strict dominance.
- It occurs when the cumulative distribution of one option is always to the right of the cumulative distribution of another option.
- If an option A stochastically dominates an option B, then the expected utility of A is at least as high as the expected utility of B.
- Qualitative probabilistic networks can be used to propagate stochastic dominance and make rational decisions without using numeric values.
Preference Structure and Multiattribute Utility
- Multiattribute utility theory is based on the idea that preferences have regularity and structure.
- Representation theorems are used to show that an agent with a certain preference structure has a utility function that can be decomposed into simpler functions.
- Preference independence is a regularity that arises in deterministic preference structures.
- Mutual preferential independence occurs when a set of attributes exhibits preference independence with respect to each other.
- Mutual preferential independence leads to a simple form for the agent's value function, which can be derived using a theorem by Gérard Debreu.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.
Related Documents
Description
This chapter covers how an agent makes decisions based on utility theory and probability theory to get what it wants.