0% found this document useful (0 votes)
30 views

Uncertainty and Utility

Depth-Limited Expectimax is a search algorithm that combines elements of depth-limited search and the Expectimax algorithm. It efficiently searches through game trees with uncertain outcomes by calculating the expected value of states based on the probability of different events occurring down to a limited search depth. The '11/36' and '11 = 36' terms seen in the context of Depth-Limited Expectimax likely represent specific probabilities or values being compared at a particular game state or decision point.

Uploaded by

mdsoyeb
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
30 views

Uncertainty and Utility

Depth-Limited Expectimax is a search algorithm that combines elements of depth-limited search and the Expectimax algorithm. It efficiently searches through game trees with uncertain outcomes by calculating the expected value of states based on the probability of different events occurring down to a limited search depth. The '11/36' and '11 = 36' terms seen in the context of Depth-Limited Expectimax likely represent specific probabilities or values being compared at a particular game state or decision point.

Uploaded by

mdsoyeb
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Question Answer

What is the probability of rolling a 2 on a standard six-sided die? 2/36 or 1/18.

Express the probability of rolling a 2 as a fraction. 2/36.

Express the probability of rolling a 2 as a simplified fraction. 1/18.

What is a random variable? An event whose outcome is unknown.

What is a probability distribution? An assignment of weights to outcomes.

What is an example of a random variable? Traffic on a freeway.

What are the possible outcomes for the random variable T? None, light, heavy.

What is the sum of probabilities over all possible outcomes? 1.

How do probabilities change as we get more evidence? Probabilities may change.

What are some laws of probability mentioned? Non-negative, sum of probabilities over all possible outcomes equals 1.

How does the probability of heavy traffic change with more evidence at 8 P(T = heavy) = 0.25, P(T = heavy | H = 8 a.m.) = 0.60.
a.m.?

What is the definition of expected value in the context of random variables? The expected value of a function of a random variable is the average,
weighted by the probability distribution over outcomes.

Can you provide an example of expected value calculation? For example, the question 'How long to get to the airport?' with a
probability of 4/36, the expected value is 4.

What is the concept of worst-case scenario? It involves uncertain outcomes controlled by an adversary.

What is the concept of average case scenario? It involves uncertain outcomes controlled by chance, not an adversary.

What does the notation '5/36' represent? It represents a probability or likelihood.

What does '5 = 36' signify in this context? It may represent an equation or comparison related to probability.

What is the concept of worst-case scenario? It involves uncertain outcomes controlled by an adversary.

What is the concept of average case scenario? It involves uncertain outcomes controlled by chance, not an adversary.

What does the expression '5/36' represent? It represents a probability or likelihood.

In the expression '5 = 36', what does the '=' symbol indicate? It indicates equality or equivalence.

What causes us to not know the result of an action in Expectimax search? Explicit randomness, unpredictable opponents, and failed actions.

Give an example of explicit randomness in Expectimax search. Rolling dice.

How do unpredictable opponents affect Expectimax search? The opponents respond randomly.

What is an example of failed actions in Expectimax search? When moving a robot, wheels might slip.

What should the values reflect in Expectimax search? Average-case (expectimax) outcomes, not worst-case (minimax)
outcomes.

What is the role of max nodes in Expectimax search? Same as in minimax search.

How are chance nodes different from min nodes in Expectimax search? The outcome is uncertain in chance nodes.

How are expected utilities calculated in Expectimax search? By taking the weighted average (expectation) of children.

What is the purpose of the 'value' function in Expectimax pseudocode? To calculate the value of a given state, considering terminal states and
the next agent to act.

When does the 'value' function return the state's utility? When the state is a terminal state.

What does the 'max-value' function do in Expectimax pseudocode? It returns the maximum value among the successors of a given state.

How does the 'exp-value' function work in Expectimax pseudocode? It calculates the expected value by considering the probability and
value of each successor of a given state.

What is the significance of '7/36' in the provided pseudocode? It seems to be a specific probability value used in the 'exp-value'
function calculations.
Question Answer

What is the purpose of the 'exp-value' function in Expectimax pseudocode? To calculate the expected value of a given state.

What does the variable 'v' represent in the 'exp-value' function? It represents the cumulative expected value.

How is the probability of each successor calculated in the 'exp-value' Using the 'probability' function for each successor.
function?

What does the 'value' function represent in the 'exp-value' function? It represents the value of a specific successor.

What does the '8/36' represent in the context of the Expectimax It seems to be an incomplete or unclear part of the pseudocode.
pseudocode?

What is the formula for calculating the Expectimax? Number of favorable outcomes divided by the total number of possible
outcomes.

In the context of a game, what does the Expectimax algorithm calculate? The expected value of different possible moves.

How is the Expectimax different from the Minimax algorithm? It considers the probability of different events occurring, while Minimax
does not.

What type of decision-making process is the Expectimax algorithm Probabilistic decision-making.


commonly used for?

What does the 'max' in Expectimax represent? The maximum value that can be achieved by making a particular move.

What does the 'expecti' in Expectimax represent? The expected value based on the probability of different events.

What is Expectimax Pruning? A search algorithm that is a variation of the minimax algorithm, used in
decision-making and game theory.

What is the purpose of Expectimax Pruning? To efficiently search through the game tree and make optimal decisions
in games with chance events.

How does Expectimax Pruning differ from the minimax algorithm? It accounts for chance events and averages over all possible outcomes
at a given state, rather than assuming the worst-case scenario.

In what type of games is Expectimax Pruning particularly useful? Games with uncertainty or chance events, such as card games or
games with dice.

What is Expectimax Pruning? A search algorithm that extends the minimax algorithm to work with
chance nodes.

How does Expectimax Pruning differ from the minimax algorithm? It works with chance nodes, while minimax does not.

What is the purpose of Expectimax Pruning? To efficiently search through game trees with uncertain outcomes.

In what context is Expectimax Pruning commonly used? In game theory and artificial intelligence for decision making.

What is Depth-Limited Expectimax? A search algorithm used in artificial intelligence.

What does the '11/36' represent in the context of Depth-Limited It could represent a probability or a specific outcome in a game or
Expectimax? decision-making scenario.

What does '11 = 36' signify in the context of Depth-Limited Expectimax? It could represent an equality condition or a specific comparison in a
game or decision-making scenario.

What is Depth-Limited Expectimax? A search algorithm that combines elements of depth-limited search and
the expectimax algorithm.

What does the '11/36' represent in the context of Depth-Limited It could represent a probability or a specific outcome in a game
Expectimax? scenario.

What does '11 = 36' signify in the context of Depth-Limited Expectimax? It could represent an equation or a comparison of values in a specific
game state.

What is Depth-Limited Expectimax? A search algorithm that combines elements of depth-limited search and
the expectimax algorithm.

What does the '11/36' represent in the context of Depth-Limited It could represent a probability or a specific outcome within the
Expectimax? algorithm.

What does '11 = 36' signify in the context of Depth-Limited Expectimax? It could represent an equality condition or a specific comparison within
Question Answer

the algorithm.

What is the probabilistic model used in expectimax search for the It could be a simple uniform distribution or a sophisticated model
opponent's behavior? requiring computation.

What does a chance node represent in expectimax search? It represents any outcome out of our control, such as the opponent or
environment.

How are the probabilities for chance nodes specified in expectimax search? They are specified as a distribution over its outcomes, for example,
12/36.

What kind of tree search should you use if your opponent is running a depth Expectimax.
2 minimax, using the result 80% of the time, and moving randomly
otherwise?

What is required to figure out EACH chance node’s probabilities in Running a simulation of your opponent.
Expectimax?

Why does the process of figuring out EACH chance node’s probabilities in Because it requires simulating your opponent, which can be
Expectimax get slow quickly? time-consuming.

What is the nice property of minimax in terms of game tree simulation? It all collapses into one game tree.

What is the equation for the given modeling assumption? 14 = 36

Is the given modeling assumption accurate? No, it is not accurate.

What is the correct relationship between 14 and 36? 14 is not equal to 36.

What is dangerous optimism? Assuming chance when the world is adversarial.

What is dangerous pessimism? Assuming the worst case when it's not likely.

What does 15/36 equal to? 15 = 36.

What search algorithm did Pacman use? Depth 4 search.

What type of eval function did Pacman use? One that avoids trouble.

What search algorithm did Ghost use? Depth 2 search.

What type of eval function did Ghost use? One that seeks Pacman.

What are the names of the Pacman videos mentioned? randGhostExpPac, advGhostMiniPac, miniGhostExpPac,
randGhostMiniPac.

What is the result of the equation 16 = 36? False.

What is 17 equal to? 36.

What is the ratio of 17 to 36? 17 = 36.

What is the Expectiminimax algorithm? It is a variation of the Minimax algorithm where the environment
includes an extra 'random agent' player that moves after each Min/Max
agent.

How does the environment in Expectiminimax differ from Minimax? It includes an extra 'random agent' player that moves after each
Min/Max agent.

What does each node in Expectiminimax compute? The appropriate combination of its children.

What is the result of 18 = 36? False.

How many possible rolls are there with 2 dice in Backgammon? 21 possible rolls.

How many legal moves are there in Backgammon at depth 2? 20 legal moves.

What happens to the probability of reaching a given search node as the It shrinks.
depth increases?

How is the usefulness of search affected as the depth increases? It is diminished.


Question Answer

What is the impact of limiting depth in Backgammon? It is less damaging.

What is trickier when the depth is limited in Backgammon? Pruning.

What are the components of historic AI TDGammon's strategy? Depth-2 search, very good evaluation function, and reinforcement
learning.

What is the significance of TDGammon in the history of AI? It was the 1st AI world champion in any game.

What is the probability of rolling a 19 out of 36 in Backgammon? 19/36.

What is the generalization of minimax for games that are not zero-sum or Multi-Agent Utilities.
have multiple players?

What are the characteristics of terminals in multi-agent utilities? Terminals have utility tuples.

How are node values represented in multi-agent utilities? Node values are also utility tuples.

What is the strategy for each player in multi-agent utilities? Each player maximizes its own component.

What can the generalization of minimax in multi-agent utilities give rise to? Cooperation and competition dynamically.

What is 21 divided by 36? 0.5833

Why should we average utilities instead of using minimax? Averaging utilities allows for the consideration of the expected utility,
while minimax focuses on minimizing the maximum possible loss.

What is the principle of maximum expected utility? A rational agent should choose the action that maximizes its expected
utility, given its knowledge.

Where do utilities come from? Utilities are derived from the preferences and values of the
decision-maker.

How do we know if such utilities even exist? The existence of utilities is based on the consistency of
decision-making and revealed preferences.

How do we know that averaging even makes sense? Averaging makes sense when it provides a reasonable representation
of the decision-maker's preferences and values.

What if our behavior (preferences) can’t be described by utilities? In such cases, alternative models or frameworks may be used to
describe and analyze decision-making behavior.

What is important when using worst-case minimax reasoning? Ensuring better states have higher evaluations.

What does insensitivity to monotonic transformations mean in the context of It means that the ordering of states is more important than the scale of
minimax reasoning? evaluations.

What is the relationship between 23 and 36 in the context of minimax 23 is equal to 36.
reasoning?

What is the concept of worst-case minimax reasoning? The terminal function scale doesn’t matter, the goal is to have better
states with higher evaluations and insensitivity to monotonic
transformations.

What is insensitivity to monotonic transformations in the context of minimax It means that the ordering of states should remain the same despite
reasoning? monotonic transformations.

What is the concept of average-case expectimax reasoning? It requires meaningful magnitudes for evaluations.

What does the expression '23/36' represent in the given context? It represents a ratio or fraction.

What does '23 = 36' imply in the given context? It implies that 23 is equal to 36.

What are utilities in the context of decision-making? Functions that describe an agent’s preferences from outcomes to real
numbers.

Where do utilities come from? Utilities summarize the agent’s goals and can be hard-wired to let
behaviors emerge.

How are rational preferences summarized in decision-making? As a utility function.

Why do we hard-wire utilities instead of letting agents pick utilities? To let behaviors emerge based on the predetermined goals.
Question Answer

Why don’t we let agents pick utilities? To ensure that behaviors align with the overall goals and objectives.

Why don’t we prescribe behaviors? To allow for flexibility and adaptation based on the utility function and
goals.

What is the outcome of 25 divided by 36? The outcome is 25/36.

What is the value of 25 compared to 36? 25 is equal to 36.

What is 25/36 equal to? 25 = 36

What is the result of 25/36? 25 = 36

Express 25/36 as an equation. 25 = 36

What is 25 divided by 36? Approximately 0.6944

What is the result of 25 equals 36? False

What is 25 divided by 36? 25/36

What is the result of 25 equals 36? False

What is the rationality of 27/36? 27/36 = 3/4

Simplify the fraction 27/36. 27/36 = 3/4

Express 27/36 in simplest form. 27/36 = 3/4

What is the Axiom of Transitivity in rational preferences? If A leads to B and B leads to C, then A leads to C.

What is the implication of intransitive preferences? An agent can be induced to give away all of its money.

What would an agent with preference C do if B leads to C? Pay 1 cent to get B.

What would an agent with preference B do if A leads to B? Pay 1 cent to get A.

What would an agent with preference A do if C leads to A? Pay 1 cent to get C.

What is the MEU principle? Choosing the action that maximizes expected utility.

What does the MEU principle state about rationality? An agent can be entirely rational without representing or manipulating
utilities and probabilities.

Who proposed the Maximum Expected Utility (MEU) principle? Ramsey (1931); von Neumann & Morgenstern (1944).

What function exists according to the MEU principle? A real-valued function U such that U(A) >= U(B), A >= B.

How are values assigned by U in the MEU principle? Values assigned by U preserve preferences of both prizes and
lotteries.

What is 31 divided by 36? 0.8611 (rounded to four decimal places)

What are Normalized utilities? They add up to 1:0.

What is a Micromort? It represents a one-millionth chance of death and is useful for


assessing and reducing product risks.

What are QALYs used for? They are used for medical decisions involving substantial risk and
represent Quality-Adjusted Life Years.

What is the equation for the utility function under a positive linear U0(x) = k1U(x) + k2, where k1 > 0.
transformation?

What can be determined with deterministic prizes only? Only ordinal utility, which establishes a total order on prizes.

What is the standard approach to assessing human utilities? Comparing a prize to a standard lottery.

How is the standard lottery defined in the assessment of human utilities? It is defined as the 'best possible prize' with probability p and the 'worst
possible catastrophe' with probability 1-p.
Question Answer

What is done to the lottery probability p in the assessment of human It is adjusted until indifference is reached between the prize and the
utilities? lottery.

What is the resulting utility in the assessment of human utilities? (1 - p), which is a value in the range of [0; 1].

Does money behave as a utility function? No, money does not behave as a utility function.

What is the utility of having money or being in debt? The utility is calculated using the expected monetary value and the
utility function.

What is the formula for Expected Monetary Value (EMV)? EMV(L) = p * X + (1 - p) * Y

What is the formula for Utility (U)? U(L) = p * U($X) + (1 - p) * U($Y)

Is the utility typically less than the utility of the Expected Monetary Value? Yes, typically U(L) < U(EMV(L)).

Are people generally risk-averse? Yes, people are generally risk-averse.

How do people behave when deep in debt? When deep in debt, people are risk-prone.

What is the famous example from Allais (1953) related to? Human rationality.

What are the options A, B, C, and D in the example? A: [0:80; $4k; 0:20; $0], B: [1:00; $3k; 0:00; $0], C: [0:20; $4k; 0:80;
$0], D: [0:25; $3k; 0:75; $0].

What is the preference order of most people in the example? B > A, C > D.

What contradiction is observed in the example? 35/36 = 35 = 36.

What sections of the Russell & Norvig book are suggested reading? Section 6.2-6.5 and 15.1-15.3.

What is the total number of suggested sections to read? 36.

You might also like