0% found this document useful (0 votes)

46 views13 pages

Midterm Review

CS 188 Fall 24

Uploaded by

sara172425

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views13 pages

Midterm Review

CS 188 Fall 24

Uploaded by

sara172425

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

CS 188 Introduction to

Fall 2024 Artificial Intelligence Midterm Review

Q1. Ants: Escape!
An ant wakes up and finds itself in a spider’s maze!

• The maze is an 𝑀-by-𝑁 rectangle.

• Legal actions: {Forward, TurnLeft, TurnRight}.

• Transition model: Forward moves the ant 1 square in the direction it’s facing, unless there is a wall in front. The two
turning actions rotate the ant by 90 degrees to face a different direction.
• Action cost: Each action costs 1.

• Start state: The ant starts at (𝑠𝑥 , 𝑠𝑦 ) facing North.

• Goal test: Returns true when the ant reaches the exit at 𝐺 = (𝑔𝑥 , 𝑔𝑦 ).

(a) (i) What’s the minimum state space size 𝑆 for this task?

𝑆=
(ii) Now suppose there are 𝐾 ants, where each ant 𝑖 must reach a distinct goal location 𝐺𝑖 ; any number of ants can
occupy the same square; and action costs are a sum of the individual ants’ step costs. What’s the minimum state
space size for this task, expressed in terms of 𝐾 and 𝑆?

(iii) Now suppose that each ant 𝑖 can exit at any of the goal locations 𝐺𝑗 , but no two ants can occupy the same square
if they are facing the same direction. What’s the minimum state space size for this task, expressed in terms of 𝐾
and 𝑆?

(iv) Now suppose, once again, that each ant 𝑖 must reach its own exit at 𝐺𝑖 , and no two ants can occupy the same square
∑
if they are facing the same direction. Let 𝐻 = 𝑖 ℎ∗𝑖 , where ℎ∗𝑖 is the optimal cost for ant 𝑖 to reach goal 𝐺𝑖 when
it is the only ant in the maze. Is 𝐻 admissible for the 𝐾-ant problem? Select all appropriate answers.
□ Yes, because for any multiagent problem the sum of individual agent costs, with each agent solving a
subproblem separately, is always a lower bound on the joint cost.
□ Yes, because 𝐻 is the exact cost for a relaxed version of the 𝐾-ant problem.
□ Yes, because the "no two ants. . . " condition can only make the true cost larger than 𝐻, not smaller.
□ No, because some ants can exit earlier than others so the sum may overestimate the total cost.
□ No, it should be max𝑖 rather than ∑𝑖 .
# None of the above

1
(b) The ant is alone again in the maze. Now, the spider will return in 𝑇 timesteps, so the ant must reach an exit in 𝑇 or fewer
actions. Any sequence with more than 𝑇 actions doesn’t count as a solution.
In this part, we’ll address this by solving the original problem and checking the resulting solution. That is, suppose 𝑝 is
a problem and 𝐴 is a search algorithm; 𝐴(𝑝) returns a solution 𝑠, and 𝓁(𝑠) is the length (number of actions) of 𝑠, where
𝓁(failure) = ∞. Let 𝑝𝑇 be 𝑝 with the added time limit 𝑇 . Then, given 𝐴, we can define a new algorithm 𝐴′ (𝑝𝑇 ) as follows:

• 𝑠 ← 𝐴(𝑝); if 𝓁(𝑠) ≤ 𝑇 then return 𝑠 else return failure.

(i) # T # F Suppose 𝐴 is an optimal algorithm for 𝑝, action costs are 1; then 𝐴′ is optimal for 𝑝𝑇 .
(ii) # T # F Suppose 𝐴 is a complete algorithm for 𝑝; then 𝐴′ is complete for 𝑝𝑇 .
(iii) # T # F Suppose 𝐴 is an optimal algorithm for 𝑝, and action costs may be any nonnegative real number;
then 𝐴′ is optimal for 𝑝𝑇 .

(c) Now we attempt to solve the time-limited problem by modifying the problem definition (specifically, the states, legal
actions in each state, and/or goal test) appropriately so that regular, unmodified search algorithms will automatically
avoid returning solutions with more than 𝑇 actions.
(i) Is this possible in general, for any problem where actions costs are all 1? Mark all correct answers.
□ Yes, by augmenting the state space only.
□ Yes, by augmenting the state space and modifying the goal test.
□ Yes, by modifying the goal test only.
□ Yes, by augmenting the state space and modifying the legal actions.
□ Yes, by modifying the legal actions only.
# No, it’s not possible in general.
(ii) Subpart removed for time.

2
Q2. Informed Search
ℎ(𝐴) = 4 ℎ(𝐶) = 2
2
𝐴 𝐶
2 4
3
𝑆 𝐺
1 3
ℎ(𝑆) = 6 ℎ(𝐺) = 0
2
𝐵 𝐷
ℎ(𝐵) ℎ(𝐷) = 2

Search problem graph: S is the start state and G is the goal state.
Tie-break in alphabetical order.
ℎ(𝐵) is unknown and will be determined in the subquestions.

(a) In this question, refer to the graph above where the optimal path is 𝑆 → 𝐵 → 𝐷 → 𝐺. For each of the following subparts,
you will be asked to write ranges of ℎ(𝐵). You should represent ranges as ≤ ℎ(𝐵) ≤ . Heuristic values can be
any number including ±∞. For responses of ±∞, you can treat the provided inequalities as a strict inequality. If you
believe that there is no possible range, please write "None" in the left-hand box and leave the right box empty.
(i) What is the range for the heuristic to be admissible?
≤ ℎ(𝐵) ≤

(ii) What range of heuristic values for B would allow A* tree search to still return the optimal path (𝑆 → 𝐵 → 𝐷 → 𝐺)?
≤ ℎ(𝐵) ≤

(iii) Now assume that the edges in the graph are undirected (which is equivalent to having two directed edges that point
at both directions with the same cost as before). Regardless of whether the heuristic is consistent, what range of
heuristic values for B would allow A* tree search to still return the optimal path (𝑆 → 𝐵 → 𝐷 → 𝐺)?
≤ ℎ(𝐵) ≤

(b) Part b removed on this worksheet for cooking too hard, try it on your own time for extra practice.

3
Q3. Games
(a) Minimax and Alpha-Beta Pruning We have a two-player, zero-sum game with 𝑘 rounds. In each round, the maximizer
acts first and chooses from 𝑛 possible actions, then the minimizer acts next and chooses from 𝑚 possible actions. After
the minimizer’s 𝑘-th turn, the game finishes and we arrive at a utility value (leaf node). Both players behave optimally.
Explore nodes from left to right.

(i) What is the total number of leaf nodes in the game tree, in terms of 𝑚, 𝑛, 𝑘?
(ii) In the minimax tree below 𝑘 = 1, 𝑛 = 3, 𝑚 = 4.

5 7 3 6 8 2 10 7 4 9 1 0
A B C D E F G H I J K L

(1) What is the minimax value of the root?

(2) Which leaf nodes are pruned by alpha-beta? Mark the corresponding letters below.
□ A □ B □ C □ D □ E □ F □ G □ H □ I □ J □ K □ L # None

(iii) When 𝑘 = 1, in the best case (pruning the most nodes possible), which nodes can be pruned in the tree below?
□ A □ B □ C □ D □ E □ F □ G □ H □I□J□ K □ L # None

A B C D E F G H I J K L

(iv) Now consider the same 𝑘 = 1 but with a general 𝑚 and 𝑛 number of maximizer and minimizer actions respectively.
How many leaf nodes would be pruned in the best case? Express your answer in terms of 𝑚 and 𝑛.

4
(v) When 𝑘 = 2, 𝑛 = 2, 𝑚 = 2, in the best case, which of the leaves labeled A, B, C, D will be pruned?

A B C D

□ A □ B □ C □ D # None

(b) Chance Nodes Our maximizer agent is now playing against a non-optimal opponent. In each round, the maximizer acts
first, then the opponent acts next and chooses uniformly at random from 𝑚 possible actions.
(i) Subpart removed for time (simple expectimax)
(ii) Consider the game tree below where we now know that the opponent always has 𝑚 = 4 possible moves and chooses
uniformly at random. We also know that all leaf node utility values are less than or equal to 𝑐 = 10.

5 3 8 4 10 6 7 9 1 0 9 2
A B C D E F G H I J K L

(1) What is the value of the root node?

(2) Which nodes can be pruned?

□ A □ B □ C □ D □ E □ F □ G □ H □ I □ J □ K □ L # None

(c) Now, let’s generalize this idea for pruning on expectimax. We consider expectimax game trees where the opponent always
chooses uniformly at random from 𝑚 possible moves, and all leaf nodes have values no more than 𝑐. These facts are known
by the maximizer player.
(i) Let’s say that our depth-first traversal of this game tree is currently at a chance node and has seen 𝑘 children of this
node so far. The sum of the children seen so far is 𝑆. What is the largest possible value that this chance node can
take on? (Answer in terms of 𝑚, 𝑐, 𝑘, and 𝑆)

5
(ii) [OPTIONAL]
Now, let’s write an algorithm for computing the root value. Fill in the pseudocode below.
Note that 𝑚 and 𝑐 are constants that you should use in your pseudocode. To find the value at the root, we will start
with a call to MAX-VALUE(root, −∞).
1: function MAX-VALUE (state, 𝛼)
2: if state has no successors then return eval(state)
3: v←
4: for each successor n of state do
5: v←

6:
7: return v
8:
9: function EXP-VALUE(state, 𝛼)
10: if state has no successors then return eval(state)
11: S←0
12: k←1
13: for each successor n of state do
14: S←S+
15: ci ← "expression from (c)(i) using m, c, k, and S"

16: if then
17: return S/m
18: k←k+1
19: return S/m

6
Q4. CSPs: The Zookeeper
You are a newly appointed zookeeper, and your first task is to find rooms for all of the animals.

The zoo has three animals: the Iguana (𝐼), Jaguar (𝐽 ), and Koala (𝐾).

Each animal needs to be assigned to one of four rooms: the North room (N), East room (E), South room (S), or West room (W),
subject to the following constraints:

1. The jaguar cannot share a room with any other animal.

2. The iguana and koala must be in different rooms.
3. The koala can only be in the East room or the South room.

(a) Consider the first constraint: “The jaguar cannot share a room with any other animal.”
Can this constraint be expressed using only binary constraints on the three variables 𝐼, 𝐽 , 𝐾?

# Yes, it can be expressed as 1 binary constraint.

# Yes, it can be expressed as 2 different binary constraints.
# Yes, it can be expressed as 4 different binary constraints.
# No, this is necessarily a unary constraint.
# No, this is necessarily a higher-order constraint.

(b) Suppose we enforce unary constraints, and then assign the jaguar to the South room. The remaining values in each domain
would be:

Iguana: North, East, South, West

Jaguar: South
Koala: East, South

In the table below, mark each value that would be removed by running forward-checking after this assignment.

North East South West

Iguana □ □ □ □
Koala □ □
(c) Regardless of your answer to the previous subpart, suppose we’ve done some filtering and the following values remain in
each variable’s domain:

Iguana: East, West

Jaguar: South
Koala: East

Are all the arcs in this CSP consistent?

# Yes, all arcs are consistent.

# No, only 𝐾 → 𝐼 is not consistent.
# No, only 𝐼 → 𝐾 is not consistent.
# No, 𝐾 → 𝐼 and 𝐼 → 𝐾 are both not consistent.
# No, only 𝐽 → 𝐼 is not consistent.
# No, only 𝐼 → 𝐽 is not consistent.
# No, 𝐽 → 𝐼 and 𝐼 → 𝐽 are both not consistent.

7
The constraints, repeated here for your convenience:

1. The jaguar cannot share a room with any other animal.

2. The iguana and koala must be in different rooms.
3. The koala can only be in the East room or the South room.

(d) Regardless of your answer to the previous subpart, suppose we start over and just enforce the third constraint. Then the
remaining values in each domain are:

Iguana: North, East, South, West

Jaguar: North, East, South, West
Koala: East, South

What does the minimum remaining values (MRV) heuristic suggest doing next?

# Assign North or West to a variable next.

# Assign East or South to a variable next.
# Assign a value to Koala next.
# Assign a value to Iguana or Jaguar next.

(e) Again, consider the CSP after just enforcing the third constraint:

Iguana: North, East, South, West

Jaguar: North, East, South, West
Koala: East, South

Which assignment would the least constraining value (LCV) heuristic prefer?
# Assign North to Jaguar.
# Assign East to Jaguar.
# LCV is indifferent between these two assignments.

(f) Suppose we add another constraint:

“The West room can contain at most one animal.”
Can this constraint be expressed using only binary constraints on the three variables 𝐼, 𝐽 , 𝐾?
# Yes, it can be expressed as 1 binary constraint.
# Yes, it can be expressed as 2 different binary constraints.
# Yes, it can be expressed as 3 different binary constraints.
# No, this is necessarily a higher-order constraint.

8
Q5. MDPs: Flying Pacman
Pacman is in a 1-dimensional grid with squares labeled 0 through 𝑛, inclusive, as shown below:

0 1 2 3 4 5 n-1 n

Pacman’s goal is to reach square 𝑛 as cheaply as possible. From state 𝑛, there are no more actions or rewards available.

At any given state, if Pacman is not in 𝑛, Pacman has two actions to choose from:

• Run: Pacman deterministically advances to the next state (i.e. from state 𝑖 to state 𝑖 + 1). This action costs Pacman $1.

• Fly: With probability 𝑝, Pacman directly reaches state 𝑛. With probability 1 − 𝑝, Pacman is stuck in the same state. This
action costs Pacman $2.

(a) Fill in the blank boxes below to define the MDP. 𝑖 represents an arbitrary state in the range {0, … , 𝑛 − 1}.

𝑠 𝑎 𝑠′ 𝑇 (𝑠, 𝑎, 𝑠′ ) 𝑅(𝑠, 𝑎, 𝑠′ )

𝑖 Run 𝑖+1

𝑖 Fly 𝑖

𝑖 Fly

For the next three subparts, assume that 𝛾 = 1.

Let 𝜋𝑅 denote the policy of always selecting Run, and 𝜋𝐹 denote the policy of always selecting Fly.

Compute the values of these two policies. Your answer should be an expression, possibly in terms of 𝑛, 𝑝, and/or 𝑖.

(b) What is 𝑉 𝜋𝑅 (𝑖)?

(c) What is 𝑉 𝜋𝐹 (𝑖)?

∑∞
Hint: Recall that the mean of a geometric distribution with success probability 𝑝 is 𝑘 = 1 𝑘(1 − 𝑝)
𝑘−1 𝑝 = 1∕𝑝.

(d) Given the results of the two previous subparts, we can now find the optimal policy for the MDP.
Which of the following are true? Select all that apply. (Hint: consider what value of 𝑖 makes 𝑉 𝜋𝑅 (𝑖) and 𝑉 𝜋𝐹 (𝑖) equal.)
Note: ⌈𝑥⌉ is the smallest integer greater than or equal to 𝑥.
□ If 𝑝 < 2∕𝑛, Fly is optimal for all states.
□ If 𝑝 < 2∕𝑛, Run is optimal for all states.
□ If 𝑝 ≥ 2∕𝑛, Fly is optimal for all 𝑖 ≥ ⌈𝑛 − 2∕𝑝⌉ and Run is optimal for all 𝑖 < ⌈𝑛 − 2∕𝑝⌉.
□ If 𝑝 ≥ 2∕𝑛, Run is optimal for all 𝑖 ≥ ⌈𝑛 − 2∕𝑝⌉ and Fly is optimal for all 𝑖 < ⌈𝑛 − 2∕𝑝⌉.
# None of the above.

9
Regardless of your answers to the previous parts, consider the following modified transition and reward functions (which may
not correspond to the original problem). As before, once Pacman reaches state 𝑛, no further actions or rewards are available.

For each modified MDP and discount factor, select whether value iteration will converge to a finite set of values.

(e) 𝛾 = 1
𝑠 𝑎 𝑠′ 𝑇 (𝑠, 𝑎, 𝑠′ ) 𝑅(𝑠, 𝑎, 𝑠′ )
𝑖 Run 𝑖+1 1.0 +5
𝑖 Fly 𝑖+1 1.0 +5

# Value iteration converges

# Value iteration does not converge
# Not enough information to decide

(f) 𝛾 = 1
𝑠 𝑎 𝑠′ 𝑇 (𝑠, 𝑎, 𝑠′ ) 𝑅(𝑠, 𝑎, 𝑠′ )
𝑖 Run 𝑖+1 1.0 +5
𝑖 Fly 𝑖−1 1.0 +5

# Value iteration converges

# Value iteration does not converge
# Not enough information to decide

(g) 𝛾 < 1
𝑠 𝑎 𝑠′ 𝑇 (𝑠, 𝑎, 𝑠′ ) 𝑅(𝑠, 𝑎, 𝑠′ )
𝑖 Run 𝑖+1 1.0 +5
𝑖 Fly 𝑖−1 1.0 +5

# Value iteration converges

# Value iteration does not converge
# Not enough information to decide

10
Q6. RL: Rest and ReLaxation
Consider the grid world MDP below, with The agent observes the following samples in this grid world:
unknown transition and reward functions.
𝑠 𝑎 𝑠′ 𝑅(𝑠, 𝑎, 𝑠′ )
A B C E East F −1
E East H −1
D E F E South H −1
G H I E South H −1
E South D −1

Reminder: In grid world, each non-exit action succeeds with some probability. If an action (e.g. North) fails, the agent moves
in one of the cardinally adjacent directions (e.g. East or West) with equal probability, but will not move in the opposite direction
(e.g. South).

Let 𝑝 denote the probability that an action succeeds.

In this question, we will consider 3 strategies for estimating the transition function in this MDP.

Strategy 1: The agent does not know the rules of grid world, and runs model-based learning to directly estimate the transition
function.

(a) From the samples provided, what is 𝑇̂ (E, South, H)?

# 0 # 3∕5 # 2∕3 # Not enough

# 1∕5 # 4∕5 # 1∕2 information
# 2∕5 # 1∕3 # 1

(b) From the samples provided, what is 𝑇̂ (E, West, D)?

# 0 # 3∕5 # 2∕3 # Not enough

# 1∕5 # 4∕5 # 1∕2 information
# 2∕5 # 1∕3 # 1

Strategy 2: The agent knows the rules of grid world, and runs model-based learning to estimate 𝑝. Then, the agent uses the
estimated 𝑝̂ to estimate the transition function.

(c) From the samples provided, what is 𝑝,

̂ the estimated probability of an action succeeding?

# 0 # 3∕5 # 2∕3 # Not enough

# 1∕5 # 4∕5 # 1∕2 information
# 2∕5 # 1∕3 # 1

(d) Based on 𝑝,
̂ what is 𝑇̂ (E, West, D)?

# 0 # 3∕5 # 2∕3 # Not enough

# 1∕5 # 4∕5 # 1∕2 information
# 2∕5 # 1∕3 # 1

(e) Select all true statements about comparing Strategy 1 and Strategy 2.
□ Strategy 1 will usually require fewer samples to estimate the transition function to the same accuracy threshold.
□ There are fewer unknown parameters to learn in Strategy 1.
□ Strategy 1 is more prone to overfitting on samples.
# None of the above

11
The grid world and samples, repeated for your convenience:

𝑠 𝑎 𝑠′ 𝑅(𝑠, 𝑎, 𝑠′ )
A B C E East F −1
E East H −1
D E F E South H −1
G H I E South H −1
E South D −1

Strategy 3: The agent knows the rules of grid world, and uses an exponential moving average to estimate 𝑝. Then, the agent
uses the estimated 𝑝̂ to estimate the transition function.

(f) Consider this update equation: 𝑝̂ ← (1 − 𝛼)𝑝̂ + (𝛼)𝑥

Given a sample (𝑠, 𝑎, 𝑠′ ), what value of 𝑥 should be used in the corresponding update?
# 𝑅(𝑠, 𝑎, 𝑠′ )
# 1.0 if the action succeeded, and 0.0 otherwise
# 1.0 if the action failed, and 0.0 otherwise
# 𝑉 (𝑠)
# 𝑉 (𝑠′ )

(g) Select all true statements about comparing Strategy 2 and Strategy 3.
□ Strategy 2 gives a more accurate estimate, because it is the maximum likelihood estimate.
□ Strategy 3 gives a more accurate estimate, because it gives more weight to more recent samples.
□ Strategy 3 can be run with samples streaming in one at a time.
# None of the above

The rest of the question is independent from the previous subparts.

Suppose the agent runs Q-learning in this grid world, with learning rate 0 < 𝛼 < 1, and discount factor 𝛾 = 1.

(h) After iterating through the samples once, how many learned Q-values will be nonzero?

# 0 # 1 # 2 # 3 # 4 # >4

(i) After iterating through the samples repeatedly until convergence, how many learned Q-values will be nonzero?

# 0 # 1 # 2 # 3 # 4 # >4

12
Q7. Potpourri [OPTIONAL]
(a) Below is a list of task environments. For each of the sub-parts, choose all the environments in the list that falls into the
specified type.
A: The competitive rock-paper-scissors game
B: The classical Pacman game (with ghosts following a fixed path)
C: Solving a crossword puzzle
D: A robot that removes defective cookies from a cookie conveyor belt
(i) Which of the environments can be formulated as single-agent? □ A □ B □ C □ D
(ii) Which of the environments are static? □ A □ B □ C □ D
(iii) Which of the environments are discrete? □ A □ B □ C □ D
(b) (i) # T # F Reflex agents cannot be rational.
(ii) # T # F There exist task environments in which no pure reflex agent can behave rationally.

(c) (i) # T # F If the costs can be arbitrarily large negative numbers in a search problem, then any optimal
search algorithm in this problem will need to explore the entire state space.
(ii) # T # F Depth-first search always expands at least as many nodes as A* search with an admissible
heuristic.

(d) (i) # T # F Local beam search with a beam size of 1 reduces to Hill climbing.
(ii) # T # F Local beam search with one initial state and no limit on the number of states retained reduces
to depth-first search.

cs188 sp19 Final Sol
No ratings yet
cs188 sp19 Final Sol
28 pages
Machine Learning Week 2 Coursera
100% (1)
Machine Learning Week 2 Coursera
4 pages
(Pieter Abbeel Midterm 1 ) Spring 2010
No ratings yet
(Pieter Abbeel Midterm 1 ) Spring 2010
12 pages
cs188 Fa07 mt1 Sol
No ratings yet
cs188 Fa07 mt1 Sol
8 pages
AI - AI417DE01 Lab - MidTerm Exam Review 23.2A
No ratings yet
AI - AI417DE01 Lab - MidTerm Exam Review 23.2A
7 pages
MT 2009 Answers
No ratings yet
MT 2009 Answers
8 pages
Indian Institute of Technology, Kharagpur Class Test 1, 2019-20
No ratings yet
Indian Institute of Technology, Kharagpur Class Test 1, 2019-20
4 pages
Midterm: CS 188 Spring 2019 Introduction To Artificial Intelligence
No ratings yet
Midterm: CS 188 Spring 2019 Introduction To Artificial Intelligence
23 pages
Midterm: Th. Nov.03, 2pm - 3pm
No ratings yet
Midterm: Th. Nov.03, 2pm - 3pm
3 pages
Artificial Intelligence CS188 Midterm1 Solutions
No ratings yet
Artificial Intelligence CS188 Midterm1 Solutions
28 pages
cse473sp19_midterm
No ratings yet
cse473sp19_midterm
12 pages
AI Exam Papers
No ratings yet
AI Exam Papers
7 pages
Ai2022 End Merged
No ratings yet
Ai2022 End Merged
28 pages
Artificial Intelligence - Quiz1 Review Sol
No ratings yet
Artificial Intelligence - Quiz1 Review Sol
25 pages
Midterm F06 Solutions
No ratings yet
Midterm F06 Solutions
12 pages
Assignment 2 Solutions PDF
No ratings yet
Assignment 2 Solutions PDF
13 pages
Artificial Intelligence (AI 2002) Sessional-I Exam: National University of Computer and Emerging Sciences
No ratings yet
Artificial Intelligence (AI 2002) Sessional-I Exam: National University of Computer and Emerging Sciences
7 pages
2425-CSC14003-23CLC1-Quiz01
No ratings yet
2425-CSC14003-23CLC1-Quiz01
8 pages
Solutions by Mike Sokolovsky, Sam Ogden, Ahmedul Kabir, and Prof. Ruiz
No ratings yet
Solutions by Mike Sokolovsky, Sam Ogden, Ahmedul Kabir, and Prof. Ruiz
9 pages
Midterm 1: CS 188 Summer 2019 Introduction To Artificial Intelligence
No ratings yet
Midterm 1: CS 188 Summer 2019 Introduction To Artificial Intelligence
13 pages
Sp12 Midterm1 Solutions
No ratings yet
Sp12 Midterm1 Solutions
17 pages
Sp12 Midterm1 Solutions
No ratings yet
Sp12 Midterm1 Solutions
17 pages
ai-ex-21
No ratings yet
ai-ex-21
10 pages
Exercises
No ratings yet
Exercises
50 pages
Game-Playing & Adversarial Search
No ratings yet
Game-Playing & Adversarial Search
68 pages
Mid I
No ratings yet
Mid I
6 pages
Final 2016 Answers
No ratings yet
Final 2016 Answers
9 pages
Homework 2 Solutions
No ratings yet
Homework 2 Solutions
6 pages
Midterm Exam: CS 188 Introduction To Fall 2008 Artificial Intelligence
No ratings yet
Midterm Exam: CS 188 Introduction To Fall 2008 Artificial Intelligence
12 pages
Cs 4511 Mid1 s18
No ratings yet
Cs 4511 Mid1 s18
2 pages
02 Homework Search 2
No ratings yet
02 Homework Search 2
2 pages
ICS 271 - Solutions Homework 2: F (N) G (N) H (N)
No ratings yet
ICS 271 - Solutions Homework 2: F (N) G (N) H (N)
6 pages
Disc02 Examprep
No ratings yet
Disc02 Examprep
4 pages
Umbc CMSC 471 Midterm Exam 15 March 2017
No ratings yet
Umbc CMSC 471 Midterm Exam 15 March 2017
5 pages
cs461 hw1
No ratings yet
cs461 hw1
14 pages
Midterm Exam Key
No ratings yet
Midterm Exam Key
6 pages
202 2018 1 b-7 PDF
No ratings yet
202 2018 1 b-7 PDF
12 pages
Ai - Faizan - s2018266056 Final
No ratings yet
Ai - Faizan - s2018266056 Final
13 pages
Sample Mid Term ACI
No ratings yet
Sample Mid Term ACI
3 pages
Fa13 Midterm1 Solutions
No ratings yet
Fa13 Midterm1 Solutions
21 pages
ICS 171 HW # 2 Solutions: Nverma@ics - Uci.edu
No ratings yet
ICS 171 HW # 2 Solutions: Nverma@ics - Uci.edu
5 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
1505760060csen 3111
No ratings yet
1505760060csen 3111
6 pages
Assignment 3 without 9,10 ans
No ratings yet
Assignment 3 without 9,10 ans
3 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
62 pages
Week-11 - Adversarial Search
No ratings yet
Week-11 - Adversarial Search
50 pages
cs188 Su19 Final - Sol
No ratings yet
cs188 Su19 Final - Sol
29 pages
AI-UNIT-2 PPT
No ratings yet
AI-UNIT-2 PPT
135 pages
IOC-AI-ESE-2023-24-13-04-2024
No ratings yet
IOC-AI-ESE-2023-24-13-04-2024
3 pages
Fa11 Final
No ratings yet
Fa11 Final
21 pages
Final: CS 188 Spring 2014 Introduction To Artificial Intelligence
No ratings yet
Final: CS 188 Spring 2014 Introduction To Artificial Intelligence
28 pages
CO304 AI 2023 End May (Better Quality)
No ratings yet
CO304 AI 2023 End May (Better Quality)
4 pages
Cst401 Scheme
No ratings yet
Cst401 Scheme
9 pages
Midterm s02 Solns
No ratings yet
Midterm s02 Solns
13 pages
AIML CIA II Question Paper ECE Remedial Anskey
No ratings yet
AIML CIA II Question Paper ECE Remedial Anskey
33 pages
Elaborate Artificial Intelligence
No ratings yet
Elaborate Artificial Intelligence
8 pages
ai_exo2
No ratings yet
ai_exo2
8 pages
PracticeSolution 1
No ratings yet
PracticeSolution 1
15 pages
Final Practice
No ratings yet
Final Practice
68 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
VLSI Design Automation Syllabus Modified
No ratings yet
VLSI Design Automation Syllabus Modified
3 pages
Gradually Varied Flow
No ratings yet
Gradually Varied Flow
7 pages
NN Examples Matlab
No ratings yet
NN Examples Matlab
91 pages
ADA
No ratings yet
ADA
2 pages
Ptspunit2 VRC
No ratings yet
Ptspunit2 VRC
164 pages
7 Is QB 2023-2024
No ratings yet
7 Is QB 2023-2024
36 pages
Paper Formated Single Column Jan 22
No ratings yet
Paper Formated Single Column Jan 22
28 pages
Maximum and Minimum
No ratings yet
Maximum and Minimum
3 pages
AdvDIP CommSys
No ratings yet
AdvDIP CommSys
2 pages
Voice Recognition and Voice Comparison Using Machine Learning Techniques: A Survey
No ratings yet
Voice Recognition and Voice Comparison Using Machine Learning Techniques: A Survey
7 pages
Ec2020 2016 Exam Questions and Answers
No ratings yet
Ec2020 2016 Exam Questions and Answers
33 pages
Lec05 Invariance For Web
No ratings yet
Lec05 Invariance For Web
34 pages
Computer Eng Study Plan
No ratings yet
Computer Eng Study Plan
1 page
07 - Fourier Transform
No ratings yet
07 - Fourier Transform
21 pages
Section 9
No ratings yet
Section 9
2 pages
Inverse Trig Worksheet 1. Find The EXACT Value of The Following
No ratings yet
Inverse Trig Worksheet 1. Find The EXACT Value of The Following
3 pages
CSE5014 2
No ratings yet
CSE5014 2
2 pages
Pragya
100% (1)
Pragya
31 pages
Control Systems - Signal Flow Graphs
No ratings yet
Control Systems - Signal Flow Graphs
4 pages
Hate Speech Detection Using Lstm and NLp Sushan Pratihar 3 Page
No ratings yet
Hate Speech Detection Using Lstm and NLp Sushan Pratihar 3 Page
13 pages
AOA VIVA Q&A
No ratings yet
AOA VIVA Q&A
5 pages
Frank R Hampel, The Influence Curve and its Role in Robust Estimation
No ratings yet
Frank R Hampel, The Influence Curve and its Role in Robust Estimation
12 pages
Be Mechanical Engineering Semester 6 2023 February Computer Aided Engineering Cae Pattern 2019
No ratings yet
Be Mechanical Engineering Semester 6 2023 February Computer Aided Engineering Cae Pattern 2019
2 pages
اشارات ونظم
No ratings yet
اشارات ونظم
17 pages
Lab 6
No ratings yet
Lab 6
5 pages
1 s2.0 S1746809423000812 Main
No ratings yet
1 s2.0 S1746809423000812 Main
12 pages
Lecture 07 - Goal Programming
No ratings yet
Lecture 07 - Goal Programming
24 pages
Digital Signals Processing ﺔﯿﻤﻗﺮﻟا تارﺎﺷﻻا ﺔﺠﻟﺎﻌﻣ: Lectures (Nine and Ten)
No ratings yet
Digital Signals Processing ﺔﯿﻤﻗﺮﻟا تارﺎﺷﻻا ﺔﺠﻟﺎﻌﻣ: Lectures (Nine and Ten)
14 pages
6 Vibration F22-2DOF
No ratings yet
6 Vibration F22-2DOF
29 pages

Midterm Review

Uploaded by

Midterm Review

Uploaded by

CS 188 Introduction to

Fall 2024 Artificial Intelligence Midterm Review

• The maze is an 𝑀-by-𝑁 rectangle.

• Start state: The ant starts at (𝑠𝑥 , 𝑠𝑦 ) facing North.

• 𝑠 ← 𝐴(𝑝); if 𝓁(𝑠) ≤ 𝑇 then return 𝑠 else return failure.

(1) What is the minimax value of the root?

(1) What is the value of the root node?

(2) Which nodes can be pruned?

1. The jaguar cannot share a room with any other animal.

# Yes, it can be expressed as 1 binary constraint.

Iguana: North, East, South, West

North East South West

Iguana: East, West

Are all the arcs in this CSP consistent?

# Yes, all arcs are consistent.

1. The jaguar cannot share a room with any other animal.

Iguana: North, East, South, West

# Assign North or West to a variable next.

Iguana: North, East, South, West

(f) Suppose we add another constraint:

For the next three subparts, assume that 𝛾 = 1.

(b) What is 𝑉 𝜋𝑅 (𝑖)?

(c) What is 𝑉 𝜋𝐹 (𝑖)?

# Value iteration converges

# Value iteration converges

# Value iteration converges

Let 𝑝 denote the probability that an action succeeds.

(a) From the samples provided, what is 𝑇̂ (E, South, H)?

# 0 # 3∕5 # 2∕3 # Not enough

(b) From the samples provided, what is 𝑇̂ (E, West, D)?

# 0 # 3∕5 # 2∕3 # Not enough

(c) From the samples provided, what is 𝑝,

# 0 # 3∕5 # 2∕3 # Not enough

# 0 # 3∕5 # 2∕3 # Not enough

(f) Consider this update equation: 𝑝̂ ← (1 − 𝛼)𝑝̂ + (𝛼)𝑥

The rest of the question is independent from the previous subparts.

You might also like