0% found this document useful (0 votes)

45 views753 pages

Notes

The document outlines the course structure for 'Artificial Intelligence 1' at FAU Erlangen-Nürnberg, focusing on symbolic AI in the first semester and statistical approaches in the second. It details prerequisites, course objectives, contents, and acknowledges contributions from various sources and students. The course is designed for both computer science students and external students from other disciplines, providing a foundational understanding of AI concepts and practices.

Uploaded by

pritamhf98

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views753 pages

Notes

Uploaded by

pritamhf98

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 753

Artificial Intelligence 1

Winter Semester 2024/25

– Lecture Notes –

Prof. Dr. Michael Kohlhase

Professur für Wissensrepräsentation und -verarbeitung
Informatik, FAU Erlangen-Nürnberg
[email protected]

2025-02-06
0.1. PREFACE i

0.1 Preface
0.1.1 Course Concept
Objective: The course aims at giving students a solid (and often somewhat theoretically ori-
ented) foundation of the basic concepts and practices of artificial intelligence. The course will
predominantly cover symbolic AI – also sometimes called “good old-fashioned AI (GofAI)” – in
the first semester and offers the very foundations of statistical approaches in the second. Indeed, a
full account sub symbolic, machine learning based AI deserves its own specialization courses and
needs much more mathematical prerequisites than we can assume in this course.
Context: The course “Artificial Intelligence” (AI 1 & 2) at FAU Erlangen is a two-semester
course in the “Wahlpflichtbereich” (specialization phase) in semester 5/6 of the bachelor program
“Computer Science” at FAU Erlangen. It is also available as a (somewhat remedial) course in the
“Vertiefungsmodul Künstliche Intelligenz” in the Computer Science Master’s program.
Prerequisites: AI-1 & 2 builds on the mandatory courses in the FAU bachelor’s program, in
particular the course “Grundlagen der Logik in der Informatik” [Glo], which already covers a lot
of the materials usually presented in the “knowledge and reasoning” part of an introductory AI
course. The AI 1& 2 course also minimizes overlap with the course.
The course is relatively elementary, we expect that any student who attended the mandatory
CS course at FAU Erlangen can follow it.
Open to external students: Other bachelor programs are increasingly co-opting the course as
specialization option. There is no inherent restriction to computer science students in this course.
Students with other study biographies – e.g. students from other bachelor programs our external
Master’s students should be able to pick up the prerequisites when needed.

0.1.2 Course Contents

Goal: To give students a solid foundation of the basic concepts and practices of the field of
Artificial Intelligence. The course will be based on Russell/Norvig’s book “Artificial Intelligence;
A modern Approach” [RN09]
Artificial Intelligence I (the first semester): introduces AI as an area of study, discusses
“rational agents” as a unifying conceptual paradigm for AI and covers problem solving, search,
constraint propagation, logic, knowledge representation, and planning.
Artificial Intelligence II (the second semester): is more oriented towards exposing students
to the basics of statistically based AI: We start out with reasoning under uncertainty, setting the
foundation with Bayesian Networks and extending this to rational decision theory. Building on
this we cover the basics of machine learning.

0.1.3 This Document

Format: The document mixes the slides presented in class with comments of the instructor to
give students a more complete background reference.
Caveat: This document is made available for the students of this course only. It is still very
much a draft and will develop over the course of the current course and in coming academic
years. Licensing: This document is licensed under a Creative Commons license that requires
attribution, allows commercial use, and allows derivative works as long as these are licensed
under the same license. Knowledge Representation Experiment: This document is also
an experiment in knowledge representation. Under the hood, it uses the STEX package [Koh08;
sTeX], a TEX/LATEX extension for semantic markup, which allows to export the contents into
active documents that adapt to the reader and can be instrumented with services based on the
explicitly represented meaning of the documents.
ii

0.1.4 Acknowledgments
Materials: Most of the materials in this course is based on Russel/Norvik’s book “Artificial
Intelligence — A Modern Approach” (AIMA [RN95]). Even the slides are based on a LATEX-based
slide set, but heavily edited. The section on search algorithms is based on materials obtained from
Bernhard Beckert (then Uni Koblenz), which is in turn based on AIMA. Some extensions have
been inspired by an AI course by Jörg Hoffmann and Wolfgang Wahlster at Saarland University
in 2016. Finally Dennis Müller suggested and supplied some extensions on AGI. Florian Rabe,
Max Rapp and Katja Berčič have carefully re-read the text and pointed out problems.
All course materials have been restructured and semantically annotated in the STEX format,
so that we can base additional semantic services on them.
AI Students: The following students have submitted corrections and suggestions to this and
earlier versions of the notes: Rares Ambrus, Ioan Sucan, Yashodan Nevatia, Dennis Müller, Si-
mon Rainer, Demian Vöhringer, Lorenz Gorse, Philipp Reger, Benedikt Lorch, Maximilian Lösch,
Luca Reeb, Marius Frinken, Peter Eichinger, Oskar Herrmann, Daniel Höfer, Stephan Mattejat,
Matthias Sonntag, Jan Urfei, Tanja Würsching, Adrian Kretschmer, Tobias Schmidt, Maxim On-
ciul, Armin Roth, Liam Corona, Tobias Völk, Lena Voigt, Yinan Shao, Michael Girstl, Matthias
Vietz, Anatoliy Cherepantsev, Stefan Musevski, Matthias Lobenhofer, Philipp Kaludercic, Di-
warkara Reddy, Martin Helmke, Stefan Müller, Dominik Mehlich, Paul Martini, Vishwang Dave,
Arthur Miehlich, Christian Schabesberger, Vishaal Saravanan, Simon Heilig, Michelle Fribrance,
Wenwen Wang, Xinyuan Tu, Lobna Eldeeb.

0.1.5 Recorded Syllabus

The recorded syllabus – a record the progress of the course in the academic year 2024/25– is
in the course page in the ALeA system at https://courses.voll-ki.fau.de/course-home/
ai-1. The table of contents in the AI-1 notes at https://courses.voll-ki.fau.de indicates
the material covered to date in yellow.
The recorded syllabus of AI-2 can be found at https://courses.voll-ki.fau.de/course-home/
ai-2. For the topics planned for this course, see ??.
Contents

0.1 Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . i
0.1.1 Course Concept . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . i
0.1.2 Course Contents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . i
0.1.3 This Document . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . i
0.1.4 Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ii
0.1.5 Recorded Syllabus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ii

1 Preliminaries 1
1.1 Administrative Ground Rules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Getting Most out of AI-1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.3 Learning Resources for AI-1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.4 AI-Supported Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

2 AI – Who?, What?, When?, Where?, and Why? 19

2.1 What is Artificial Intelligence? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
2.2 Artificial Intelligence is here today! . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
2.3 Ways to Attack the AI Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
2.4 Strong vs. Weak AI . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
2.5 AI Topics Covered . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
2.6 AI in the KWARC Group . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30

I Getting Started with AI: A Conceptual Framework 33

3 Logic Programming 37
3.1 Introduction to Logic Programming and ProLog . . . . . . . . . . . . . . . . . . . 37
3.2 Programming as Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
3.2.1 Running Prolog . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
3.2.2 Knowledge Bases and Backtracking . . . . . . . . . . . . . . . . . . . . . . . 42
3.2.3 Programming Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
3.2.4 Advanced Relational Programming . . . . . . . . . . . . . . . . . . . . . . . 46

4 Recap of Prerequisites from Math & Theoretical Computer Science 49

4.1 Recap: Complexity Analysis in AI? . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
4.2 Recap: Formal Languages and Grammars . . . . . . . . . . . . . . . . . . . . . . . 55
4.3 Mathematical Language Recap . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61

5 Rational Agents: An AI Framework 65

5.1 Introduction: Rationality in Artificial Intelligence . . . . . . . . . . . . . . . . . . . 65
5.2 Agent/Env. as a Framework . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
5.3 Good Behavior ; Rationality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
5.4 Classifying Environments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
5.5 Types of Agents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75

iii
iv CONTENTS

5.6 Representing the Environment in Agents . . . . . . . . . . . . . . . . . . . . . . . . 82

5.7 Rational Agents: Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83

II General Problem Solving 85

6 Problem Solving and Search 89
6.1 Problem Solving . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
6.2 Problem Types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
6.3 Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
6.4 Uninformed Search Strategies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
6.4.1 Breadth-First Search Strategies . . . . . . . . . . . . . . . . . . . . . . . . . 100
6.4.2 Depth-First Search Strategies . . . . . . . . . . . . . . . . . . . . . . . . . . 104
6.4.3 Further Topics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
6.5 Informed Search Strategies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112
6.5.1 Greedy Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
6.5.2 Heuristics and their Properties . . . . . . . . . . . . . . . . . . . . . . . . . 117
6.5.3 A-Star Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
6.5.4 Finding Good Heuristics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125
6.6 Local Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127

7 Adversarial Search for Game Playing 133

7.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133
7.2 Minimax Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137
7.3 Evaluation Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145
7.4 Alpha-Beta Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 148
7.5 Monte-Carlo Tree Search (MCTS) . . . . . . . . . . . . . . . . . . . . . . . . . . . 160
7.6 State of the Art . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 165
7.7 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 166

8 Constraint Satisfaction Problems 169

8.1 Constraint Satisfaction Problems: Motivation . . . . . . . . . . . . . . . . . . . . . 169
8.2 The Waltz Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 174
8.3 CSP: Towards a Formal Definition . . . . . . . . . . . . . . . . . . . . . . . . . . . 177
8.4 Constraint Networks: Formalizing Binary CSPs . . . . . . . . . . . . . . . . . . . . 180
8.5 CSP as Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 182
8.6 Conclusion & Preview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 187

9 Constraint Propagation 189

9.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189
9.2 Constraint Propagation/Inference . . . . . . . . . . . . . . . . . . . . . . . . . . . . 190
9.3 Forward Checking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 194
9.4 Arc Consistency . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 196
9.5 Decomposition: Constraint Graphs, and Three Simple Cases . . . . . . . . . . . . . 204
9.6 Cutset Conditioning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 209
9.7 Constraint Propagation with Local Search . . . . . . . . . . . . . . . . . . . . . . . 211
9.8 Conclusion & Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 212

III Knowledge and Inference 215

10 Propositional Logic & Reasoning, Part I: Principles 219
10.1 Introduction: Inference with Structured State Representations . . . . . . . . . . . 219
10.1.1 A Running Example: The Wumpus World . . . . . . . . . . . . . . . . . . . 219
10.1.2 Propositional Logic: Preview . . . . . . . . . . . . . . . . . . . . . . . . . . 222
CONTENTS v

10.1.3 Propositional Logic: Agenda . . . . . . . . . . . . . . . . . . . . . . . . . . 224

10.2 Propositional Logic (Syntax/Semantics) . . . . . . . . . . . . . . . . . . . . . . . . 224
10.3 Inference in Propositional Logics . . . . . . . . . . . . . . . . . . . . . . . . . . . . 230
10.4 Propositional Natural Deduction Calculus . . . . . . . . . . . . . . . . . . . . . . . 233
10.5 Predicate Logic Without Quantifiers . . . . . . . . . . . . . . . . . . . . . . . . . . 238
10.6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241

11 Formal Systems 243

12 Machine-Oriented Calculi for Propositional Logic 247

12.1 Test Calculi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 247
12.1.1 Normal Forms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 248
12.2 Analytical Tableaux . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 249
12.2.1 Analytical Tableaux . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 249
12.2.2 Practical Enhancements for Tableaux . . . . . . . . . . . . . . . . . . . . . 253
12.2.3 Soundness and Termination of Tableaux . . . . . . . . . . . . . . . . . . . . 254
12.3 Resolution for Propositional Logic . . . . . . . . . . . . . . . . . . . . . . . . . . . 256
12.3.1 Resolution for Propositional Logic . . . . . . . . . . . . . . . . . . . . . . . 256
12.3.2 Killing a Wumpus with Propositional Inference . . . . . . . . . . . . . . . . 259
12.4 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 261

13 Propositional Reasoning: SAT Solvers 263

13.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 263
13.2 Davis-Putnam . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 265
13.3 DPLL = b (A Restricted Form of) Resolution . . . . . . . . . . . . . . . . . . . . . . 267
13.4 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 270

14 First-Order Predicate Logic 273

14.1 Motivation: A more Expressive Language . . . . . . . . . . . . . . . . . . . . . . . 273
14.2 First-Order Logic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 277
14.2.1 First-Order Logic: Syntax and Semantics . . . . . . . . . . . . . . . . . . . 277
14.2.2 First-Order Substitutions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 281
14.3 First-Order Natural Deduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 284
14.4 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 288

15 Automated Theorem Proving in First-Order Logic 291

15.1 First-Order Inference with Tableaux . . . . . . . . . . . . . . . . . . . . . . . . . . 291
15.1.1 First-Order Tableau Calculi . . . . . . . . . . . . . . . . . . . . . . . . . . . 291
15.1.2 First-Order Unification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 295
15.1.3 Efficient Unification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 300
15.1.4 Implementing First-Order Tableaux . . . . . . . . . . . . . . . . . . . . . . 303
15.2 First-Order Resolution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 305
15.2.1 Resolution Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 306
15.3 Logic Programming as Resolution Theorem Proving . . . . . . . . . . . . . . . . . 308
15.4 Summary: ATP in First-Order Logic . . . . . . . . . . . . . . . . . . . . . . . . . . 311

16 Knowledge Representation and the Semantic Web 313

16.1 Introduction to Knowledge Representation . . . . . . . . . . . . . . . . . . . . . . . 313
16.1.1 Knowledge & Representation . . . . . . . . . . . . . . . . . . . . . . . . . . 313
16.1.2 Semantic Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 315
16.1.3 The Semantic Web . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 320
16.1.4 Other Knowledge Representation Approaches . . . . . . . . . . . . . . . . . 325
16.2 Logic-Based Knowledge Representation . . . . . . . . . . . . . . . . . . . . . . . . 326
16.2.1 Propositional Logic as a Set Description Language . . . . . . . . . . . . . . 327
16.2.2 Ontologies and Description Logics . . . . . . . . . . . . . . . . . . . . . . . 330
vi CONTENTS

16.2.3 Description Logics and Inference . . . . . . . . . . . . . . . . . . . . . . . . 332

16.3 A simple Description Logic: ALC . . . . . . . . . . . . . . . . . . . . . . . . . . . . 334
16.3.1 Basic ALC: Concepts, Roles, and Quantification . . . . . . . . . . . . . . . 335
16.3.2 Inference for ALC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 339
16.3.3 ABoxes, Instance Testing, and ALC . . . . . . . . . . . . . . . . . . . . . . 346
16.4 Description Logics and the Semantic Web . . . . . . . . . . . . . . . . . . . . . . . 348

IV Planning & Acting 357

17 Planning I: Framework 361
17.1 Logic-Based Planning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 362
17.2 Planning: Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 366
17.3 Planning History . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 373
17.4 STRIPS Planning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 375
17.5 Partial Order Planning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 381
17.6 PDDL Language . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 397
17.7 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 400

18 Planning II: Algorithms 401

18.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 401
18.2 How to Relax . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 403
18.3 Delete Relaxation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 415
18.4 The h+ Heuristic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 421
18.5 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 433

19 Searching, Planning, and Acting in the Real World 435

19.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 435
19.2 The Furniture Coloring Example . . . . . . . . . . . . . . . . . . . . . . . . . . . . 437
19.3 Searching/Planning with Non-Deterministic Actions . . . . . . . . . . . . . . . . . 438
19.4 Agent Architectures based on Belief States . . . . . . . . . . . . . . . . . . . . . . . 441
19.5 Searching/Planning without Observations . . . . . . . . . . . . . . . . . . . . . . . 443
19.6 Searching/Planning with Observation . . . . . . . . . . . . . . . . . . . . . . . . . 446
19.7 Online Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 451
19.8 Replanning and Execution Monitoring . . . . . . . . . . . . . . . . . . . . . . . . . 454

20 Semester Change-Over 459

20.1 What did we learn in AI 1? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 459
20.2 Administrativa . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 465
20.3 Overview over AI and Topics of AI-II . . . . . . . . . . . . . . . . . . . . . . . . . 468
20.3.1 What is Artificial Intelligence? . . . . . . . . . . . . . . . . . . . . . . . . . 468
20.3.2 Artificial Intelligence is here today! . . . . . . . . . . . . . . . . . . . . . . . 470
20.3.3 Ways to Attack the AI Problem . . . . . . . . . . . . . . . . . . . . . . . . 474
20.3.4 AI in the KWARC Group . . . . . . . . . . . . . . . . . . . . . . . . . . . . 476
20.3.5 Agents and Environments in AI2 . . . . . . . . . . . . . . . . . . . . . . . . 477

V Reasoning with Uncertain Knowledge 487

21 Quantifying Uncertainty 491
21.1 Probability Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 491
21.2 Probabilistic Reasoning Techniques . . . . . . . . . . . . . . . . . . . . . . . . . . . 500
CONTENTS vii

22 Probabilistic Reasoning: Bayesian Networks 513

22.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 513
22.2 Constructing Bayesian Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . 515
22.3 Inference in Bayesian Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 520
22.4 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 525

23 Making Simple Decisions Rationally 527

23.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 527
23.2 Decision Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 529
23.3 Preferences and Utilities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 530
23.4 Utilities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 532
23.5 Multi-Attribute Utility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 534
23.6 The Value of Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 537

24 Temporal Probability Models 541

24.1 Modeling Time and Uncertainty . . . . . . . . . . . . . . . . . . . . . . . . . . . . 541
24.2 Inference: Filtering, Prediction, and Smoothing . . . . . . . . . . . . . . . . . . . . 545
24.3 Hidden Markov Models – Extended Example . . . . . . . . . . . . . . . . . . . . . 551
24.4 Dynamic Bayesian Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 553

25 Making Complex Decisions 557

25.1 Sequential Decision Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 557
25.2 Utilities over Time . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 560
25.3 Value/Policy Iteration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 562
25.4 Partially Observable MDPs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 566
25.5 Online Agents with POMDPs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 572

VI Machine Learning 577

26 Learning from Observations 581

26.1 Forms of Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 581
26.2 Supervised Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 583
26.3 Learning Decision Trees . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 586
26.4 Using Information Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 589
26.5 Evaluating and Choosing the Best Hypothesis . . . . . . . . . . . . . . . . . . . . . 591
26.6 Computational Learning Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . 598
26.7 Regression and Classification with Linear Models . . . . . . . . . . . . . . . . . . . 603
26.8 Support Vector Machines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 610
26.9 Artificial Neural Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 613

27 Statistical Learning 625

27.1 Full Bayesian Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 625
27.2 Approximations of Bayesian Learning . . . . . . . . . . . . . . . . . . . . . . . . . 628
27.3 Parameter Learning for Bayesian Networks . . . . . . . . . . . . . . . . . . . . . . 629

28 Reinforcement Learning 633

28.1 Reinforcement Learning: Introduction & Motivation . . . . . . . . . . . . . . . . . 633
28.2 Passive Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 634
28.3 Active Reinforcement Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 638
viii CONTENTS

29 Knowledge in Learning 641

29.1 Logical Formulations of Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . 641
29.2 Inductive Logic Programming . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 644
29.2.1 An Example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 645
29.2.2 Top-Down Inductive Learning: FOIL . . . . . . . . . . . . . . . . . . . . . . 648
29.2.3 Inverse Resolution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 650

VII Natural Language 653

30 Natural Language Processing 657
30.1 Introduction to NLP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 657
30.2 Natural Language and its Meaning . . . . . . . . . . . . . . . . . . . . . . . . . . . 658
30.3 Looking at Natural Language . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 661
30.4 Language Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 665
30.5 Part of Speech Tagging . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 668
30.6 Text Classification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 670
30.7 Information Retrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 672
30.8 Information Extraction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 676

31 Deep Learning for NLP 679

31.1 Word Embeddings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 679
31.2 Recurrent Neural Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 683
31.3 Sequence-to-Sequence Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 686
31.4 The Transformer Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 689
31.5 Large Language Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 691

32 What did we learn in AI 1/2? 695

VIII Excursions 707

A Completeness of Calculi for Propositional Logic 711
A.1 Abstract Consistency and Model Existence . . . . . . . . . . . . . . . . . . . . . . 711
A.2 A Completeness Proof for Propositional Tableaux . . . . . . . . . . . . . . . . . . . 717

B Conflict Driven Clause Learning 719

B.1 UP Conflict Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 719
B.2 Clause Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 724
B.3 Phase Transitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 728

C Completeness of Calculi for First-Order Logic 733

C.1 Abstract Consistency and Model Existence . . . . . . . . . . . . . . . . . . . . . . 733
C.2 A Completeness Proof for First-Order ND . . . . . . . . . . . . . . . . . . . . . . . 739
C.3 Soundness and Completeness of First-Order Tableaux . . . . . . . . . . . . . . . . 741
C.4 Soundness and Completeness of First-Order Resolution . . . . . . . . . . . . . . . . 742
Chapter 1

Preliminaries

In this chapter, we want to get all the organizational matters out of the way, so that we can get into
the discussion of artificial intelligence content unencumbered. We will talk about the necessary
administrative details, go into how students can get most out of the course, talk about where the
various resources provided with the course can be found, and finally introduce the ALeA system,
an experimental – using AI methods – learning support system for the AI course.

1.1 Administrative Ground Rules

We will now go through the ground rules for the course. This is a kind of a social contract
between the instructor and the students. Both have to keep their side of the deal to make learning
as efficient and painless as possible.

Prerequisites for AI-1

Content Prerequisites: The mandatory courses in CS@FAU; Sem. 1-4, in par-
ticular:
Course “Algorithmen und Datenstrukturen”. (Algorithms & Data Structures)
Course “Grundlagen der Logik in der Informatik” (GLOIN). (Logic in CS)
Course “Berechenbarkeit und Formale Sprachen”. (Theoretical CS)

Skillset Prerequisite: Coping with mathematical formulation of the structures

Mathematics is the language of science (in particular computer science)
It allows us to be very precise about what we mean. (good for you)

Intuition: (take them with a kilo of salt)

This is what I assume you know! (I have to assume something)
In most cases, the dependency on these is partial and “in spirit”.
If you have not taken these (or do not remember), read up on them as needed!

Real Prerequisites: Motivation, interest, curiosity, hard work.(AI-1 is non-trivial)

You can do this course if you want! (and I hope you are successful)

Michael Kohlhase: Artificial Intelligence 1 1 2025-02-06

1
2 CHAPTER 1. PRELIMINARIES

Note: I do not literally presuppose the courses on the slide above – most of you do not have a
bachelor’s degree from FAU, so you cannot have taken them. And indeed some of the content of
these courses is irrelevant for AI-1. Stating these courses is just the easiest way to specifying what
content I will be building on – and any graduate courses has to build on something.
Many of you will have taken the moral equivalent of these courses in your undergraduate studies
at your home university. If you did not, you will have to somehow catch up on the content as we
go along in AI-1. This should be possible with enough motivation.
There are essentially three skillsets that are essential for AI-1:
1. A solid understanding and practical skill in programming (whatever programming language)

2. A good understanding and practice in using mathematical language to represent complex struc-
tures
3. A solid understanding of formal languages and grammars, as well as applied complexity theory
(basics of theoretical computer science).
Without (catching up on) these the AI-1 course will be quite frustrating and hard.
We will briefly go over the most important topics in ?? to synchronize concepts and notation.
Note that if you do not have a formal education in courses like the ones mentioned above you will
very probably have to do significant remedial work.
Now we come to a topic that is always interesting to the students: the grading scheme.

Assessment, Grades
Overall (Module) Grade:

Grade via the exam (Klausur) ; 100% of the grade.

Up to 10% bonus on-top for an exam with ≥ 50% points.(< 50% ; no bonus)
Bonus points =
b percentage sum of the best 10 prepquizzes divided by 100.
Exam: 90 minutes exam conducted in presence on paper! (∼ April 1. 2025)

Retake Exam: 90 min exam six months later. (∼ October 1. 2025)

Register for exams in https://campo.fau.de. (there is a deadine!)

Note: You can de-register from an exam on https://campo.fau.de up to three
working days before exam. (do not miss that if you are not prepared)

Michael Kohlhase: Artificial Intelligence 1 2 2025-02-06

Preparedness Quizzes
PrepQuizzes: Every tuesday 16:15 we start the lecture with a 10 min online quiz
– the PrepQuiz – about the material from the previous week. (starts in week 2)
Motivations: We do this to

keep you prepared and working continuously. (primary)

update the ALeA learner model (fringe benefit)
The prepquiz will be given in the ALeA system
1.1. ADMINISTRATIVE GROUND RULES 3

https://courses.voll-ki.fau.de/quiz-dash/ai-1
You have to be logged into ALeA! (via FAU IDM)

You can take the prepquiz on your laptop or phone, . . .

. . . in the lecture or at home . . .
. . . via WLAN or 4G Network. (do not overload)
Prepquizzes will only be available 16:15-16:25!

Michael Kohlhase: Artificial Intelligence 1 3 2025-02-06

This Thursday: Pretest

This thursday we will try out the prepquiz infrastructure with a pretest!

Presence: bring your laptop or cellphone.

Online: you can and should take the pretest as well.
Have a recent firefox or chrome (chrome: younger than March 2023)
Make sure that you are logged into ALeA (via FAU IDM; see below)

Definition 1.1.1. A pretest is an assessment for evaluating the preparedness of

learners for further studies.
Concretely: This pretest
establishes a baseline for the competency expectations in AI-1 and
tests the ALeA quiz infrastructure for the prepquizzes.
Participation in the pretest is optional; it will not influence grades in any way.
The pretest covers the prerequisites of AI-1 and some of the material that may have
been covered in other courses.

The test will be also used to refine the ALeA learner model, which may make
learning experience in ALeA better. (see below)

Michael Kohlhase: Artificial Intelligence 1 4 2025-02-06

4 CHAPTER 1. PRELIMINARIES

Due to the current AI hype, the course Artificial Intelligence is very popular and thus many degree
programs at FAU have adopted it for their curricula. Sometimes the course setup that fits for the
CS program does not fit the other’s very well, therefore there are some special conditions. I want
to state here.

Special Admin Conditions

Some degree programs do not “import” the course Artificial Intelligence 1, and thus
you may not be able to register for the exam via https://campo.fau.de.

Just send me an e-mail and come to the exam, (we do the necessary admin)
Tell your program coordinator about AI-1/2 so that they remedy this situation
In “Wirtschafts-Informatik” you can only take AI-1 and AI-2 together in the “Wahlpflicht-
bereich”.

ECTS credits need to be divisible by five ⇝ 7.5 + 7.5 = 15.

Michael Kohlhase: Artificial Intelligence 1 5 2025-02-06

I can only warn of what I am aware, so if your degree program lets you jump through extra hoops,
please tell me and then I can mention them here.

1.2 Getting Most out of AI-1

In this section we will discuss a couple of measures that students may want to consider to get
most out of the AI-1 course.
None of the things discussed in this section – homeworks, tutorials, study groups, and at-
tendance – are mandatory (we cannot force you to do them; we offer them to you as learning
opportunities), but most of them are very clearly correlated with success (i.e. passing the exam
and getting a good grade), so taking advantage of them may be in your own interest.

AI-1 Homework Assignments

Goal: Homework assignments reinforce what was taught in lectures.
Homework Assignments: Small individual problem/programming/proof task
but take time to solve (at least read them directly ; questions)

Didactic Intuition: Homework assignments give you material to test your under-
standing and show you how to apply it.
Homeworks give no points, but without trying you are unlikely to pass the exam.
Homeworks will be mainly peer-graded in the ALeA system.

Didactic Motivation: Through peer grading students are able to see mistakes
in their thinking and can correct any problems in future assignments. By grading
assignments, students may learn how to complete assignments more accurately and
how to improve their future results. (not just us being lazy)

Michael Kohlhase: Artificial Intelligence 1 6 2025-02-06

It is very well-established experience that without doing the homework assignments (or something
1.2. GETTING MOST OUT OF 5

similar) on your own, you will not master the concepts, you will not even be able to ask sensible
questions, and take very little home from the course. Just sitting in the course and nodding is not
enough!

AI-1 Homework Assignments – Howto

Homework Workflow: in ALeA (see below)

Homework assignments will be published on thursdays: see https://courses.
voll-ki.fau.de/hw/ai-1
Submission of solutions via the ALeA system in the week after
Peer grading/feedback (and master solutions) via answer classes.
Quality Control: TAs and instructors will monitor and supervise peer grading.
Experiment: Can we motivate enough of you to make peer assessment self-
sustaining?
I am appealing to your sense of community responsibility here . . .
You should only expect other’s to grade your submission if you grade their’s
(cf. Kant’s “Moral Imperative”)
Make no mistake: The grader usually learns at least as much as the gradee.

Homework/Tutorial Discipline:
Start early! (many assignments need more than one evening’s work)
Don’t start by sitting at a blank screen (talking & study groups help)
Humans will be trying to understand the text/code/math when grading it.
Go to the tutorials, discuss with your TA! (they are there for you!)

Michael Kohlhase: Artificial Intelligence 1 7 2025-02-06

If you have questions please make sure you discuss them with the instructor, the teaching assistants,
or your fellow students. There are three sensible venues for such discussions: online in the lectures,
in the tutorials, which we discuss now, or in the course forum – see below. Finally, it is always a
very good idea to form study groups with your friends.

Tutorials for Artificial Intelligence 1

Approach: Weekly tutorials and homework assignments (first one in week two)
Goal 1: Reinforce what was taught in the lectures. (you need practice)

Goal 2: Allow you to ask any question you have in a protected environment.
Instructor/Lead TA: Florian Rabe (KWARC Postdoc)
Room: 11.137 @ Händler building, [email protected]
Tutorials: One each taught by Florian Rabe (lead); Yasmeen Shawat, Hatem
Mousa, Xinyuan Tu, and Florian Guthmann.
6 CHAPTER 1. PRELIMINARIES

Life-saving Advice: Go to your tutorial, and prepare for it by having looked at

the slides and the homework assignments!

Michael Kohlhase: Artificial Intelligence 1 8 2025-02-06

Collaboration
Definition 1.2.1. Collaboration (or cooperation) is the process of groups of agents
acting together for common, mutual benefit, as opposed to acting in competition
for selfish benefit. In a collaboration, every agent contributes to the common goal
and benefits from the contributions of others.

In learning situations, the benefit is “better learning”.

Observation: In collaborative learning, the overall result can be significantly better
than in competitive learning.
Good Practice: Form study groups. (long- or short-term)

1. those learners who work most, learn most!

2. freeloaders – individuals who only watch – learn very little!
It is OK to collaborate on homework assignments in AI-1! (no bonus points)
Choose your study group well! (We will (eventually) help via ALeA)

Michael Kohlhase: Artificial Intelligence 1 9 2025-02-06

As we said above, almost all of the components of the AI-1 course are optional. That even applies
to attendance. But make no mistake, attendance is important to most of you. Let me explain, . . .

Do I need to attend the AI-1 Lectures

Attendance is not mandatory for the AI-1 course. (official version)

Note: There are two ways of learning: (both are OK, your mileage may vary)
Approach B: Read a book/papers (here: lecture notes)
Approach I: come to the lectures, be involved, interrupt the instructor whenever
you have a question.
The only advantage of I over B is that books/papers do not answer questions

Approach S: come to the lectures and sleep does not work!

The closer you get to research, the more we need to discuss!

Michael Kohlhase: Artificial Intelligence 1 10 2025-02-06

1.3 Learning Resources for AI-1

But what if you are not in a lecture or tutorial and want to find out more about the AI-1 topics?
1.3. LEARNING RESOURCES FOR 7

Textbook, Handouts and Information, Forums, Videos

Textbook: Russel/Norvig: Artificial Intelligence, A modern Approach [RN09].
basically “broad but somewhat shallow”
great to get intuitions on the basics of AI
Make sure that you read the edition ≥ 3 ⇝ vastly improved over ≤ 2.
Lecture notes: will be posted at https://kwarc.info/teaching/AI

more detailed than [RN09] in some areas

I mostly prepare them as we go along (semantically preloaded ; research
resource)
please e-mail me any errors/shortcomings you notice. (improve for the group)
Course Videos: AI-1 will be streamed/recorded at https://fau.tv/course/
id/4047
Organized: Video course nuggets are available at https://fau.tv/course/
id/1690 (short; organized by topic)
Backup: The lectures from WS 2016/17 to SS 2018 have been recorded
(in English and German), see https://www.fau.tv/search/term.html?q=
Kohlhase
Do not let the videos mislead you: Coming to class is highly correlated with
passing the exam!
StudOn Forum: https://www.studon.fau.de/crs5832535.html for

announcements, homeworks (my view on the forum)

questions, discussion among your fellow students (your forum too, use it!)

Michael Kohlhase: Artificial Intelligence 1 11 2025-02-06

FAU has issued a very insightful guide on using lecture videos. It is a good idea to heed these
recommendations, even if they seem annoying at first.

Practical recommendations on Lecture Videos

Excellent Guide: [Nor+18a] (German version at [Nor+18b])
8 CHAPTER 1. PRELIMINARIES

Using lecture Attend lectures.

recordings: Take notes.

A guide for students
Be specific.

Catch up.

Ask for help.

Don’t cut corners.

Michael Kohlhase: Artificial Intelligence 1 12 2025-02-06

NOT a Resource for : LLMs – AI-based tools like ChatGPT

Definition 1.3.1. A large language model (LLM) is a computational model capable
of language generation or other natural language processing tasks.
Example 1.3.2. OpenAI’s GPT, Google’s Bard, and Meta’s Llama.

Definition 1.3.3. A chatbot is a software application or web interface that is

designed to mimic human conversation through text or voice interactions. Modern
chatbots are usually based on LLMs.
Example 1.3.4 (ChatGPT talks about AI-1). (but remains vague)

Note: LLM-based chatbots invent every word ! (suprpisingly often correct)

Example 1.3.5 (In the AI-1 exam). ChatGPT scores ca. 50% of the points.
ChatGPT can almost pass the exam . . . (We could award it a Master’s degree)
But can you? (the AI-1 exams will be in person on paper)
1.4. AI-SUPPORTED LEARNING 9

You will only pass the exam, if you can do AI-1 yourself!
Intuition: AI tools like GhatGPT, CoPilot, etc. (see also [She24])
can help you solve problems, (valuable tools in production situations)
hinders learning if used for homeworks/quizzes, etc. (like driving instead of
jogging)
What (not) to do: (to get most of the brave new AI-supported world)
try out these tools to get a first-hand intuition what they can/cannot do
challenge yourself while learning so that you can also do it (mind over matter!)

Michael Kohlhase: Artificial Intelligence 1 13 2025-02-06

1.4 AI-Supported Learning

In this section we introduce the ALeA (Adaptive Learning Assistant) system, a learning support
system we have developed using symbolic AI methods – the stuff we learn about in AI-1 – and
which we will use to support students in the course. As such, ALeA does double duty in the AI-1
course it supports learning activities and serves as a showcase, what symbolic AI methods can to
in an important application.

ALeA: Adaptive Learning Assistant

Idea: Use AI methods to help teach/learn AI (AI4AI)

Concretely: Provide HTML versions of the AI-1 slides/lecture notes and embed
learning support services into them. (for pre/postparation of lectures)
Definition 1.4.1. Call a document active, iff it is interactive and adapts to specific
information needs of the readers. (lecture notes on steroids)
Intuition: ALeA serves active course materials. (PDF mostly inactive)
Goal: Make ALeA more like a instructor + study group than like a book!
Example 1.4.2 (Course Notes). =
b Slides + Comments

; yellow parts in table of contents (left) already covered in lectures.

10 CHAPTER 1. PRELIMINARIES

Michael Kohlhase: Artificial Intelligence 1 14 2025-02-06

The central idea in the AI4AI approach – using AI to support learning AI – and thus the ALeA
system is that we want to make course materials – i.e. what we give to students for preparing and
postparing lectures – more like teachers and study groups (only available 24/7) than like static
books.

VoLL-KI Portal at https://courses.voll-ki.fau.de

Portal for ALeA Courses: https://courses.voll-ki.fau.de

AI-1 in ALeA: https://courses.voll-ki.fau.de/course-home/ai-1

All details for the course.
recorded syllabus (keep track of material covered in course)
syllabus of the last semesters (for over/preview)

ALeA Status: The ALeA system is deployed at FAU for over 1000 students
taking eight courses
(some) students use the system actively (our logs tell us)
reviews are mostly positive/enthusiastic (error reports pour in)

Michael Kohlhase: Artificial Intelligence 1 15 2025-02-06

The ALeA AI-1 page is the central entry point for working with the ALeA system. You can get
to all the components of the system, including two presentations of the course contents (notes-
and slides-centric ones), the flashcards, the localized forum, and the quiz dashboard.
We now come to the heart of the ALeA system: its learning support services, which we will now
briefly introduce. Note that this presentation is not really sufficient to undertstand what you may
be getting out of them, you will have to try them, and interact with them sufficiently that the
learner model can get a good estimate of your competencies to adapt the results to you.

Learning Support Services in ALeA

Idea: Embed learning support services into active course materials.
Example 1.4.3 (Definition on Hover). Hovering on a (cyan) term reference
reminds us of its definition. (even works recursively)
1.4. AI-SUPPORTED LEARNING 11

Example 1.4.4 (More Definitions on Click). Clicking on a (cyan) term reference

shows us more definitions from other contexts.
12 CHAPTER 1. PRELIMINARIES

Example 1.4.5 (Guided Tour). A guided tour for a concept c assembles defini-
tions/etc. into a self-contained mini-course culminating at c.

c = count-
able ;

. . . your idea here . . . (the sky is the limit)

Michael Kohlhase: Artificial Intelligence 1 16 2025-02-06

Note that this is only an initial collection of learning support services, we are constantly working
on additional ones. Look out for feature notifications ( ) on the upper right hand of
the ALeA screen.

(Practice/Remedial) Problems Everywhere

Problem: Learning requires a mix of understanding and test-driven practice.
Idea: ALeA supplies targeted practice problems everywhere.

Concretely: Revision markers at the end of sections.

A relatively non-intrusive overview over competency

Click to extend it for details.

1.4. AI-SUPPORTED LEARNING 13

Practice problems as usual. (targeted to your specific competency)

Michael Kohlhase: Artificial Intelligence 1 17 2025-02-06

While the learning support services up to now have been adressed to individual learners, we
now turn to services addressed to communities of learners, ranging from study groups with three
learners, to whole courses, and even – eventually – all the alumni of a course, if they have not
de-registered from ALeA.
Currently, the community aspect of ALeA only consists in localized interactions with the course
materials.
The ALeA system uses the semantic structure of the course materials to localize some interactions
that are otherwise often from separate applications. Here we see two:
1. one for reporting content errors – and thus making the material better for all learners – and‘’
2. a localized course forum, where forum threads can be attached to learning objects.

Localized Interactions with the Community

Selecting text brings up localized – i.e. anchored on the selection – interactions:
post a (public) comment or take (private) note
report an error to the course authors/instructors

Localized comments induce a thread in the ALeA forum (like the StudOn
Forum, but targeted towards specific learning objects.)
14 CHAPTER 1. PRELIMINARIES

Answering questions gives karma =

b a public measure of user helpfulness.
Notes can be anonymous (; generate no karma)

Michael Kohlhase: Artificial Intelligence 1 18 2025-02-06

Let us briefly look into how the learning support services introduced above might work, focusing
on where the necessary information might come from. Even though some of the concepts in the
discussion below may be new to AI-1 students, it is worth looking into them. Bear with us as we
try to explain the AI components of the ALeA system.

ALeA=
b Data-Driven & AI-enabled Learning Assistance
Learner Rhetoric/Didactic
Model Model

Idea: Do what a teacher does!

Use/maintain four models: Domain Formulation
Model Model
Ingredient 1: Domain model =
b
knowledge/theory graph DyBN POMDP MDP

Ingredient 2: Learner model =

b
time pref
adding competency estimations

Ingredient 3: A collection of ready- N ≤ utility

formulated learning objects
DyBN MDP POMDP
Ingredient 4: Educational dialogue
planner ; guided tours pref
time

⟨N, ≤⟩ poset utility

(Good) teachers

understand the objects and their properties they are talking about
have readimade formulations how to convey them best

and understand how these best work together

model what the learners already know/understand and adapts them accordingly
1.4. AI-SUPPORTED LEARNING 15

A theory graph provides (modular representation of the domain)

symbols with URIs for all concepts, objects, and relations
definitions, notations, and verbalizations for all symbols

“object-oriented inheritance” and views between theories.

The learner model is a function from learner IDs × symbol URIs to competency values
competency comes in six cognitive dimensions: remember, understand, analyze,
evaluate, apply, and create.

ALeA logs all learner interactions (keeps data learner-private)

each interaction updates the learner model function.
Learning objects are the text fragments learners see and interact with; they are struc-
tured by

didactic relations, e.g. tasks have prerequisites and learning objectives

rhetoric relations, e.g. introduction, elaboration, and transition
The dialogue planner assembles learning objects into active course material using

the domain model and didactic relations to determine the order of LOs
the learner model to determine what to show
the rhetoric relations to make the dialogue coherent

Michael Kohlhase: Artificial Intelligence 1 19 2025-02-06

We can use the same four models discussed in the space of guided tours to deploy additional
learning support services, which we now discuss.

New Feature: Drilling with Flashcards

Flashcards challenge you with a task (term/problem) on the front. . .

. . . and the definition/answer is on the back.

Self-assessment updates the learner model (before/after)

16 CHAPTER 1. PRELIMINARIES

Idea: Challenge yourself to a card stack, keep drilling/assessing flashcards until

the learner model eliminates all.
Bonus: Flashcards can be generated from existing semantic markup (educational
equivalent to free beer)

Michael Kohlhase: Artificial Intelligence 1 20 2025-02-06

We have already seen above how the learner model can drive the drilling with flashcards. It can
also be used for the configuration of card stacks by configuring a domain e.g. a section in the
course materials and a competency threshold. We now come to a very important issue that we
always face when we do AI systems that interface with humans. Most web technology companies
that take one the approach “the user pays for the services with their personal data, which is sold
on” or integrate advertising for renumeration. Both are not acceptable in university setting.
But abstaining from monetizing personal data still leaves the problem how to protect it from
intentional or accidental misuse. Even though the GDPR has quite extensive exceptions for
research, the ALeA system – a research prototype – adheres to the principles and mandates of
the GDPR. In particular it makes sure that personal data of the learners is only used in learning
support services directly or indirectly initiated by the learners themselves.

Learner Data and Privacy in ALeA

Observation: Learning support services in ALeA use the learner model; they
need the learner model data to adapt to the invidivual learner!
collect learner interaction data (to update the learner model)
Consequence: You need to be logged in (via your FAU IDM credentials) for useful
learning support services!
Problem: Learner model data is highly sensitive personal data!

ALeA Promise: The ALeA team does the utmost to keep your personal data
safe. (SSO via FAU IDM/eduGAIN, ALeA trust zone)
ALeA Privacy Axioms:
1. ALeA only collects learner models data about logged in users.
2. Personally identifiable learner model data is only accessible to its subject
(delegation possible)
3. Learners can always query the learner model about its data.
4. All learner model data can be purged without negative consequences (except
usability deterioration)
5. Logging into ALeA is completely optional.
Observation: Authentication for bonus quizzes are somewhat less optional, but
you can always purge the learner model later.

Michael Kohlhase: Artificial Intelligence 1 21 2025-02-06

So, now that you have an overview over what the ALeA system can do for you, let us see what
you have to concretely do to be able to use it.
1.4. AI-SUPPORTED LEARNING 17

Concrete Todos for ALeA

Recall: You will use ALeA for the prepquizzes (or lose bonus points)
All other use is optional. (but AI-supported pre/postparation can be helpful)

To use the ALeA system, you will have to log in via SSO: (do it now)
go to https://courses.voll-ki.fau.de/course-home/ai-1,

in the upper right hand corner you see ,

You get access to your personal ALeA profile via

(plus feature notifications, manual, and language chooser)
Problem: Most ALeA services depend on the learner model. (to adapt to you)
Solution: Initialize your learner model with your educational history!

Concretely: enter taken CS courses (FAU equivalents) and grades.

ALeA uses that to estimate your CS/AI competencies. (for your benefit)
then ALeA knows about you; I don’t! (ALeA trust zone)

Michael Kohlhase: Artificial Intelligence 1 22 2025-02-06

Even if you did not understand some of the AI jargon or the underlying methods (yet), you should
be good to go for using the ALeA system in your day-to-day work.
18 CHAPTER 1. PRELIMINARIES
Chapter 2

Artificial Intelligence – Who?,

What?, When?, Where?, and Why?

We start the course by giving an overview of (the problems, methods, and issues of ) Artificial
Intelligence, and what has been achieved so far.
Naturally, this will dwell mostly on philosophical aspects – we will try to understand what
the important issues might be and what questions we should even be asking. What the most
important avenues of attacks may be and where AI research is being carried out.
In particular the discussion will be very non-technical – we have very little basis to discuss
technicalities yet. But stay with me, this will drastically change very soon. A Video Nugget
covering the introduction of this chapter can be found at https://fau.tv/clip/id/21467.

Plot for this chapter

Motivation, overview, and finding out what you already know
What is Artificial Intelligence?
What has AI already achieved?
A (very) quick walk through the AI-1 topics.
How can you get involved with AI at KWARC?

Michael Kohlhase: Artificial Intelligence 1 23 2025-02-06

2.1 What is Artificial Intelligence?

A Video Nugget covering this section can be found at https://fau.tv/clip/id/21701.
The first question we have to ask ourselves is “What is Artificial Intelligence?”, i.e. how can we
define it. And already that poses a problem since the natural definition like human intelligence,
but artificially realized presupposes a definition of intelligence, which is equally problematic; even
Psychologists and Philosophers – the subjects nominally “in charge” of natural intelligence – have
problems defining it, as witnessed by the plethora of theories e.g. found at [WHI].

What is Artificial Intelligence? Definition

19
20 CHAPTER 2. AI – WHO?, WHAT?, WHEN?, WHERE?, AND WHY?

Definition 2.1.1 (According to

Wikipedia). Artificial Intelligence (AI)
is intelligence exhibited by machines
Definition 2.1.2 (also). Artificial Intelli-
gence (AI) is a sub-field of computer science
that is concerned with the automation of in-
telligent behavior.
BUT: it is already difficult to define intel-
ligence precisely.

Definition 2.1.3 (Elaine Rich). Artificial

Intelligence (AI) studies how we can make
the computer do things that humans can still
do better at the moment.
Michael Kohlhase: Artificial Intelligence 1 24 2025-02-06

Maybe we can get around the problems of defining “what artificial intelligence is”, by just describing
the necessary components of AI (and how they interact). Let’s have a try to see whether that is
more informative.

What is Artificial Intelligence? Components

Elaine Rich: AI studies how we can make the computer do things that humans
can still do better at the moment.

This needs a combination of

the ability to learn

Inference

Perception
2.2. ARTIFICIAL INTELLIGENCE IS HERE TODAY! 21

Language understanding

Emotion

Michael Kohlhase: Artificial Intelligence 1 25 2025-02-06

Note that list of components is controversial as well. Some say that it lumps together cognitive
capacities that should be distinguished or forgets others, . . . . We state it here much more to get
AI-1 students to think about the issues than to make it normative.

2.2 Artificial Intelligence is here today!

A Video Nugget covering this section can be found at https://fau.tv/clip/id/21697.
The components of Artificial Intelligence are quite daunting, and none of them are fully understood,
much less achieved artificially. But for some tasks we can get by with much less. And indeed that
is what the field of Artificial Intelligence does in practice – but keeps the lofty ideal around. This
practice of “trying to achieve AI in selected and restricted domains” (cf. the discussion starting
with slide 32) has borne rich fruits: systems that meet or exceed human capabilities in such areas.
Such systems are in common use in many domains of application.

Artificial Intelligence is here today!

22 CHAPTER 2. AI – WHO?, WHAT?, WHEN?, WHERE?, AND WHY?
2.2. ARTIFICIAL INTELLIGENCE IS HERE TODAY! 23

in outer space
in outer space systems
need autonomous con-
trol:
remote control impos-
sible due to time lag
in artificial limbs
the user controls the
prosthesis via existing
nerves, can e.g. grip
a sheet of paper.
in household appliances
The iRobot Roomba
vacuums, mops, and
sweeps in corners, . . . ,
parks, charges, and
discharges.
general robotic house-
hold help is on the
horizon.
in hospitals
in the USA 90% of the
prostate operations are
carried out by Ro-
boDoc
Paro is a cuddly robot
that eases solitude in
nursing homes.
24 CHAPTER 2. AI – WHO?, WHAT?, WHEN?, WHERE?, AND WHY?

Michael Kohlhase: Artificial Intelligence 1 26 2025-02-06

We will conclude this section with a note of caution.

The AI Conundrum
Observation: Reserving the term “Artificial Intelligence” has been quite a land
grab!
But: researchers at the Dartmouth Conference (1956) really thought they would
solve/reach AI in two/three decades.

Consequence: AI still asks the big questions. (and still promises answers soon)
Another Consequence: AI as a field is an incubator for many innovative tech-
nologies.
AI Conundrum: Once AI solves a subfield it is called “computer science”.
(becomes a separate subfield of CS)
Example 2.2.1. Functional/Logic Programming, automated theorem proving,
Planning, machine learning, Knowledge Representation, . . .
Still Consequence: AI research was alternatingly flooded with money and cut off
brutally.

Michael Kohlhase: Artificial Intelligence 1 27 2025-02-06

All of these phenomena can be seen in the growth of AI as an academic discipline over the course
of its now over 70 year long history.

The current AI Hype — Part of a longer Story

The history of AI as a discipline has been very much tied to the amount of funding
– that allows us to do research and development.
Funding levels are tied to public perception of success (especially for AI)
Definition 2.2.2. An AI winter is a time period of low public perception and
funding for AI,
mostly because AI has failed to deliver on its – sometimes overblown – promises
An AI summer is a time period of high public perception and funding for AI
A potted history of AI (AI summers and summers)
2.3. WAYS TO ATTACK THE AI PROBLEM 25

AI becomes
scarily effective,
ubiquitous

Excitement fades;
some applications
AI-conse- profit a lot
quences,
Biases, AI-bubble bursts,
Regulation the next AI winter
Lighthill report WWW ; comes
Dartmouth Conference Data/-
Turing Test Computing
AI Winter 2
AI Winter 1 Explosion
1987-1994
1974-1980

1950 1960 1970 1980 1990 2000 2010 2021

Michael Kohlhase: Artificial Intelligence 1 28 2025-02-06

Of course, the future of AI is still unclear, we are currently in a massive hype caused by the advent
of deep neural networks being trained on all the data of the Internet, using the computational
power of huge compute farms owned by an oligopoly of massive technology companies – we are
definitely in an AI summer.
But AI as a academic community and the tech industry also make outrageous promises, and
the media pick it up and distort it out of proportion, . . . So public opinion could flip again, sending
AI into the next winter.

2.3 Ways to Attack the AI Problem

A Video Nugget covering this section can be found at https://fau.tv/clip/id/21717.
There are currently three main avenues of attack to the problem of building artificially intelligent
systems. The (historically) first is based on the symbolic representation of knowledge about the
world and uses inference-based methods to derive new knowledge on which to base action decisions.
The second uses statistical methods to deal with uncertainty about the world state and learning
methods to derive new (uncertain) world assumptions to act on.

Four Main Approaches to Artificial Intelligence

Definition 2.3.1. Symbolic AI is a subfield of AI based on the assumption that
many aspects of intelligence can be achieved by the manipulation of symbols, com-
bining them into meaning-carrying structures (expressions) and manipulating them
(using processes) to produce new expressions.
Definition 2.3.2. Statistical AI remedies the two shortcomings of symbolic AI
approaches: that all concepts represented by symbols are crisply defined, and that
all aspects of the world are knowable/representable in principle. Statistical AI adopts
sophisticated mathematical models of uncertainty and uses them to create more
accurate world models and reason about them.
Definition 2.3.3. Subsymbolic AI (also called connectionism or neural AI) is a
subfield of AI that posits that intelligence is inherently tied to brains, where infor-
mation is represented by a simple sequence pulses that are processed in parallel via
simple calculations realized by neurons, and thus concentrates on neural computing.

Definition 2.3.4. Embodied AI posits that intelligence cannot be achieved by

reasoning about the state of the world (symbolically, statistically, or connectivist),
but must be embodied i.e. situated in the world, equipped with a “body” that can
26 CHAPTER 2. AI – WHO?, WHAT?, WHEN?, WHERE?, AND WHY?

interact with it via sensors and actuators. Here, the main method for realizing
intelligent behavior is by learning from the world.

Michael Kohlhase: Artificial Intelligence 1 29 2025-02-06

As a consequence, the field of Artificial Intelligence (AI) is an engineering field at the intersection
of computer science (logic, programming, applied statistics), Cognitive Science (psychology, neu-
roscience), philosophy (can machines think, what does that mean?), linguistics (natural language
understanding), and mechatronics (robot hardware, sensors).
Subsymbolic AI and in particular machine learning is currently hyped to such an extent, that
many people take it to be synonymous with “Artificial Intelligence”. It is one of the goals of this
course to show students that this is a very impoverished view.

Two ways of reaching Artificial Intelligence?

We can classify the AI approaches by their coverage and the analysis depth (they
are complementary)

Deep symbolic not there yet

AI-1 cooperation?

Shallow no-one wants this statistical/sub symbolic

AI-2
Analysis ↑
vs. Narrow Wide
Coverage →

This semester we will cover foundational aspects of symbolic AI (deep/narrow

processing)
next semester concentrate on statistical/subsymbolic AI.
(shallow/wide-coverage)

Michael Kohlhase: Artificial Intelligence 1 30 2025-02-06

We combine the topics in this way in this course, not only because this reproduces the histor-
ical development but also as the methods of statistical and subsymbolic AI share a common
basis.
It is important to notice that all approaches to AI have their application domains and strong points.
We will now see that exactly the two areas, where symbolic AI and statistical/subsymbolic AI
have their respective fortes correspond to natural application areas.

Environmental Niches for both Approaches to AI

Observation: There are two kinds of applications/tasks in AI

Consumer tasks: consumer grade applications have tasks that must be fully
generic and wide coverage. ( e.g. machine translation like Google Translate)
Producer tasks: producer grade applications must be high-precision, but can be
2.4. STRONG VS. WEAK AI 27

domain-specific (e.g. multilingual documentation, machinery-control, program

verification, medical technology)

Precision
100% Producer Tasks

50% Consumer Tasks

103±1 Concepts 106±1 Concepts Coverage

after Aarne Ranta [Ran17].

General Rule: Subsymbolic AI is well suited for consumer tasks, while symbolic
AI is better suited for producer tasks.
A domain of producer tasks I am interested in: mathematical/technical documents.

Michael Kohlhase: Artificial Intelligence 1 31 2025-02-06

An example of a producer task – indeed this is where the name comes from – is the case of a
machine tool manufacturer T , which produces digitally programmed machine tools worth multiple
million Euro and sells them into dozens of countries. Thus T must also provide comprehensive
machine operation manuals, a non-trivial undertaking, since no two machines are identical and
they must be translated into many languages, leading to hundreds of documents. As those manual
share a lot of semantic content, their management should be supported by AI techniques. It is
critical that these methods maintain a high precision, operation errors can easily lead to very
costly machine damage and loss of production. On the other hand, the domain of these manuals is
quite restricted. A machine tool has a couple of hundred components only that can be described
by a couple of thousand attributes only.
Indeed companies like T employ high-precision AI techniques like the ones we will cover in this
course successfully; they are just not so much in the public eye as the consumer tasks.

2.4 Strong vs. Weak AI

A Video Nugget covering this section can be found at https://fau.tv/clip/id/21724.
To get this out of the way before we begin: We now come to a distinction that is often mud-
dled in popular discussions about “Artificial Intelligence”, but should be cristal clear to students
of the course AI-1 – after all, you are upcoming “AI-specialists”.

Strong AI vs. Narrow AI

Definition 2.4.1. With the term narrow AI (also weak AI, instrumental AI, applied
AI) we refer to the use of software to study or accomplish specific problem solving
or reasoning tasks (e.g. playing chess/go, controlling elevators, composing music,
...)
Definition 2.4.2. With the term strong AI (also full AI, AGI) we denote the quest
for software performing at the full range of human cognitive abilities.
Definition 2.4.3. Problems requiring strong AI to solve are called AI hard, and AI
complete, iff AGI should be able to solve them all.
28 CHAPTER 2. AI – WHO?, WHAT?, WHEN?, WHERE?, AND WHY?

In short: We can characterize the difference intuitively:

narrow AI: What (most) computer scientists think AI is / should be.
strong AI: What Hollywood authors think AI is / should be.

Needless to say we are only going to cover narrow AI in this course!

Michael Kohlhase: Artificial Intelligence 1 32 2025-02-06

One can usually defuse public worries about “is AI going to take control over the world” by just
explaining the difference between strong AI and weak AI clearly.
I would like to add a few words on AGI, that – if you adopt them; they are not universally accepted
– will strengthen the arguments differentiating between strong and weak AI.

A few words on AGI. . .

The conceptual and mathematical framework (agents, environments etc.) is the
same for strong AI and weak AI.
AGI research focuses mostly on abstract aspects of machine learning (reinforce-
ment learning, neural nets) and decision/game theory (“which goals should an AGI
pursue?”).
Academic respectability of AGI fluctuates massively, recently increased (again).
(correlates somewhat with AI winters and golden years)
Public attention increasing due to talk of “existential risks of AI” (e.g. Hawking,
Musk, Bostrom, Yudkowsky, Obama, . . . )

Kohlhase’s View: Weak AI is here, strong AI is very far off. (not in my lifetime)
: But even if that is true, weak AI will affect all of us deeply in everyday life.
Example 2.4.4. You should not train to be an accountant or truck driver!
(bots will replace you soon)

Michael Kohlhase: Artificial Intelligence 1 33 2025-02-06

I want to conclude this section with an overview over the recent protagonists – both personal and
institutional – of AGI.

AGI Research and Researchers

“Famous” research(ers) / organizations
MIRI (Machine Intelligence Research Institute), Eliezer Yudkowsky (Formerly
known as “Singularity Institute”)
Future of Humanity Institute Oxford (Nick Bostrom),
Google (Ray Kurzweil),
AGIRI / OpenCog (Ben Goertzel),
petrl.org (People for the Ethical Treatment of Reinforcement Learners).
(Obviously somewhat tongue-in-cheek)
Be highly skeptical about any claims with respect to AGI! (Kohlhase’s View)
2.5. AI TOPICS COVERED 29

Michael Kohlhase: Artificial Intelligence 1 34 2025-02-06

2.5 AI Topics Covered

A Video Nugget covering this section can be found at https://fau.tv/clip/id/21719.
We will now preview the topics covered by the course “Artificial Intelligence” in the next two
semesters.

Topics of AI-1 (Winter Semester)

Getting Started
What is Artificial Intelligence? (situating ourselves)
Logic programming in Prolog (An influential paradigm)
Intelligent Agents (a unifying framework)
Problem Solving
Problem Solving and search (Black Box World States and Actions)
Adversarial search (Game playing) (A nice application of search)
constraint satisfaction problems (Factored World States)
Knowledge and Reasoning
Formal Logic as the mathematics of Meaning
Propositional logic and satisfiability (Atomic Propositions)
First-order logic and theorem proving (Quantification)
Logic programming (Logic + Search; Programming)
Description logics and semantic web
Planning

Planning Frameworks
Planning Algorithms
Planning and Acting in the real world

Michael Kohlhase: Artificial Intelligence 1 35 2025-02-06

Topics of AI-2 (Summer Semester)

Learning from Observations

Knowledge in Learning
Statistical Learning Methods

Communication (If there is time)

Natural Language Processing
Natural Language for Communication

Michael Kohlhase: Artificial Intelligence 1 36 2025-02-06

AI1SysProj: A Systems/Project Supplement to AI-1

The AI-1 course concentrates on concepts, theory, and algorithms of symbolic AI.

Problem: Engineering/Systems Aspects of AI are very important as well.

Partial Solution: Getting your hands dirty in the homeworks and the Kalah
Challenge
Full Solution: AI1SysProj: AI-1 Systems Project (10 ECTS, 30-50places)

For each Topic of AI-1, where will be a mini-project in AI1SysProj

e.g. for game-play there will be Chinese Checkers (more difficult than Kalah)
e.g. for CSP we will schedule TechFak courses or exams (from real data)
solve challenges by implementing the AI-1 algorithms or use SoA systems

Question: Should I take AI1SysProj in my first semester? (i.e. now)

Answer: It depends . . . (on your situation)
most master’s programs require a 10-ECTS “Master’s Project”(Master AI: two)
there will be a great pressure on project places (so reserve one early)
BUT 10 ECTS =
b 250-300 hours involvement by definition (1/3 of your
time/ECTS)
BTW: There will also be an AI2SysProj next semester! (another chance)

Michael Kohlhase: Artificial Intelligence 1 37 2025-02-06

2.6 AI in the KWARC Group

A Video Nugget covering this section can be found at https://fau.tv/clip/id/21725.
Now allow me to beat my own drum. In my research group at FAU, we do research on
a particular kind of Artificial Intelligence: logic, language, and information. This may not be
the most fashionable or well-hyped area in AI, but it is challenging, well-respected, and – most
importantly – fun.

The KWARC Research Group

2.6. AI IN THE KWARC GROUP 31

Observation: The ability to represent knowledge about the world and to draw
logical inferences is one of the central components of intelligent behavior.
Thus: reasoning components of some form are at the heart of many AI systems.

KWARC Angle: Scaling up (web-coverage) without dumbing down (too much)

Content markup instead of full formalization (too tedious)
User support and quality control instead of “The Truth” (elusive anyway)
use Mathematics as a test tube ( Mathematics =
b Anything Formal )
care more about applications than about philosophy (we cannot help getting
this right anyway as logicians)
The KWARC group was established at Jacobs Univ. in 2004, moved to FAU Erlan-
gen in 2016
see http://kwarc.info for projects, publications, and links

Michael Kohlhase: Artificial Intelligence 1 38 2025-02-06

Research in the KWARC group ranges over a variety of topics, which range from foundations of
mathematics to relatively applied web information systems. I will try to organize them into three
pillars here.

Overview: KWARC Research and Projects

Applications: eMath 3.0, Active Documents, Active Learning, Semantic Spread-

sheets/CAD/CAM, Change Mangagement, Global Digital Math Library, Math
Search Systems, SMGloM: Semantic Multilingual Math Glossary, Serious Games,
...
Foundations of Math: KM & Interaction: Semantization:
MathML, OpenM ath Semantic Interpretation LATEXML: LATEX ; XML
advanced Type Theories (aka. Framing) STEX: Semantic LATEX
Mmt: Meta Meta The- math-literate interaction invasive editors
ory MathHub: math archi- Context-Aware IDEs
Logic Morphisms/Atlas ves & active docs
Mathematical Corpora
Theorem Prover/CAS In- Active documents: em-
bedded semantic services Linguistics of Math
teroperability
Model-based Education ML for Math Semantics
Mathematical Model- Extraction
s/Simulation
Foundations: Computational Logic, Web Technologies, OMDoc/Mmt

Michael Kohlhase: Artificial Intelligence 1 39 2025-02-06

For all of these areas, we are looking for bright and motivated students to work with us. This
can take various forms, theses, internships, and paid students assistantships.

Research Topics in the KWARC Group

We are always looking for bright, motivated KWARCies.
We have topics in for all levels! (Enthusiast, Bachelor, Master, Ph.D.)
32 CHAPTER 2. AI – WHO?, WHAT?, WHEN?, WHERE?, AND WHY?

List of current topics: https://gl.kwarc.info/kwarc/thesis-projects/

Automated Reasoning: Maths Representation in the Large
Logics development, (Meta)n -Frameworks
Math Corpus Linguistics: Semantics Extraction
Serious Games, Cognitive Engineering, Math Information Retrieval, Legal Rea-
soning, . . .
. . . last but not least: KWARC is the home of ALeA!
We always try to find a topic at the intersection of your and our interests.
1
We also sometimes have positions!. (HiWi, Ph.D.: 2 E-13, PostDoc: full E-13)

Michael Kohlhase: Artificial Intelligence 1 40 2025-02-06

Sciences like physics or geology, and engineering need high-powered equipment to perform mea-
surements or experiments. computer science and in particular the KWARC group needs high
powered human brains to build systems and conduct thought experiments.
The KWARC group may not always have as much funding as other AI research groups, but
we are very dedicated to give the best possible research guidance to the students we supervise.
So if this appeals to you, please come by and talk to us.
Part I

Getting Started with AI: A

Conceptual Framework

33
35

This part of the lecture notes sets the stage for the technical parts of the course by establishing
a common framework (Rational Agents) that gives context and ties together the various methods
discussed in the course.
After having seen what AI can do and where AI is being employed today (see ??), we will now

1. introduce a programming language to use in the course,

2. prepare a conceptual framework in which we can think about “intelligence” (natural and arti-
ficial), and
3. recap some methods and results from theoretical computer science that we will need throughout
the course.
ad 1. Prolog: For the programming language we choose Prolog, historically one of the most
influential “AI programming languages”. While the other AI programming language: Lisp which
gave rise to the functional programming programming paradigm has been superseded by typed
languages like SML, Haskell, Scala, and F#, Prolog is still the prime example of the declarative
programming paradigm. So using Prolog in this course gives students the opportunity to explore
this paradigm. At the same time, Prolog is well-suited for trying out algorithms in symbolic AI the
topic of this semester since it internalizes the more complex primitives of the algorithms presented
here.
ad 2. Rational Agents: The conceptual framework centers around rational agents which
combine aspects of purely cognitive architectures (an original concern for the field of AI) with the
more recent realization that intelligence must interact with the world (embodied AI) to grow and
learn. The cognitive architectures aspect allows us to place and relate the various algorithms and
methods we will see in this course. Unfortunately, the “situated AI” aspect will not be covered in
this course due to the lack of time and hardware.
ad 3. Topics of Theoretical Computer Science: When we evaluate the methods and
algorithms introduced in AI-1, we will need to judge their suitability as agent functions. The main
theoretical tool for that is complexity theory; we will give a short motivation and overview of the
main methods and results as far as they are relevant for AI-1 in ??.
In the second half of the semester we will transition from search-based methods for problem
solving to inference-based ones, i.e. where the problem formulation is described as expressions of a
formal language which are transformed until an expression is reached from which the solution can
be read off. Phrase structure grammars are the method of choice for describing such languages;
we will introduce/recap them in ??.

Enough philosophy about “Intelligence” (Artificial or Natural)

So far we had a nice philosophical chat, about “intelligence” et al.

As of today, we look at technical stuff!

Before we go into the algorithms and data structures proper, we will
1. introduce a programming language for AI-1
2. prepare a conceptual framework in which we can think about “intelligence” (nat-
ural and artificial), and
3. recap some methods and results from theoretical computer science.

Michael Kohlhase: Artificial Intelligence 1 41 2025-02-06

36
Chapter 3

Logic Programming

We will now learn a new programming paradigm: logic programming, which is one of the most
influential paradigms in AI. We are going to study Prolog (the oldest and most widely used) as a
concrete example of ideas behind logic programming and use it for our homeworks in this course.
As Prolog is a representative of a programming paradigm that is new to most students, pro-
gramming will feel weird and tedious at first. But subtracting the unusual syntax and program
organization logic programming really only amounts to recursive programming just as in func-
tional programming (the other declarative programming paradigm). So the usual advice applies,
keep staring at it and practice on easy examples until the pain goes away.

3.1 Introduction to Logic Programming and ProLog

Logic programming is a programming paradigm that differs from functional and imperative pro-
gramming in the basic procedural intuition. Instead of transforming the state of the memory by
issuing instructions (as in imperative programming), or computing the value of a function on some
arguments, logic programming interprets the program as a body of knowledge about the respective
situation, which can be queried for consequences.
This is actually a very natural conception of program; after all we usually run (imperative or
functional) programs if we want some question answered. Video Nuggets covering this section
can be found at https://fau.tv/clip/id/21752 and https://fau.tv/clip/id/21753.
.

Logic Programming
Idea: Use logic as a programming language!

We state what we know about a problem (the program) and then ask for results
(what the program would compute).
Example 3.1.1.

Program Leibniz is human x+0=x

Sokrates is human If x + y = z then x + s(y) = s(z)
Sokrates is a greek 3 is prime
Every human is fallible
Query Are there fallible greeks? is there a z with s(s(0)) + s(0) = z
Answer Yes, Sokrates! yes s(s(s(0)))

37
38 CHAPTER 3. LOGIC PROGRAMMING

How to achieve this? Restrict a logic calculus sufficiently that it can be used as
computational procedure.
Remark: This idea leads a totally new programming paradigm: logic programming.

Slogan: Computation = Logic + Control (Robert Kowalski 1973; [Kow97])

We will use the programming language Prolog as an example.

Michael Kohlhase: Artificial Intelligence 1 42 2025-02-06

We now formally define the language of Prolog, starting off the atomic building blocks.

Prolog Terms and Literals

Definition 3.1.2. Prolog expresses knowledge about the world via

constants denoted by lowercase strings,

variables denoted by strings starting with an uppercase letter or _, and
functions and predicates (lowercase strings) applied to terms.
Definition 3.1.3. A Prolog term is

a Prolog variable, or constant, or

a Prolog function applied to terms.
A Prolog literal is a constant or a predicate applied to terms.
Example 3.1.4. The following are

Prolog terms: john, X, _, father(john), . . .

Prolog literals: loves(john,mary), loves(john,_), loves(john,wife_of(john)),. . .

Michael Kohlhase: Artificial Intelligence 1 43 2025-02-06

Now we build up Prolog programs from those building blocks.

Prolog Programs: Facts and Rules

Definition 3.1.5. A Prolog program is a sequence of clauses, i.e.
facts of the form l., where l is a literal, (a literal and a dot)
rules of the form h:−b1 ,. . .,bn ., where n > 0. h is called the head literal (or
simply head) and the bi are together called the body of the rule.
A rule h:−b1 ,. . .,bn ., should be read as h (is true) if b1 and . . . and bn are.
Example 3.1.6. Write “something is a car if it has a motor and four wheels” as
car(X) :− has_motor(X),has_wheels(X,4). (variables are uppercase)
This is just an ASCII notation for m(x) ∧ w(x, 4) ⇒ car(x).
Example 3.1.7. The following is a Prolog program:
human(leibniz).
human(sokrates).
3.1. INTRODUCTION TO LOGIC PROGRAMMING AND PROLOG 39

greek(sokrates).
fallible(X):−human(X).

The first three lines are Prolog facts and the last a rule.

Michael Kohlhase: Artificial Intelligence 1 44 2025-02-06

The whole point of writing down a knowledge base (a Prolog program with knowledge about the
situation), if we do not have to write down all the knowledge, but a (small) subset, from which
the rest follows. We have already seen how this can be done: with logic. For logic programming
we will use a logic called “first-order logic” which we will not formally introduce here.

Prolog Programs: Knowledge bases

Intuition: The knowledge base given by a Prolog program is the set of facts that
can be derived from it under the if/and reading above.

Definition 3.1.8. The knowledge base given by Prolog program is that set of facts
that can be derived from it by Modus Ponens (MP), ∧I and instantiation.

A A⇒B A B A
MP ∧I Subst
B A∧B [B/X](A)

Michael Kohlhase: Artificial Intelligence 1 45 2025-02-06

?? introduces a very important distinction: that between a Prolog program and the knowledge
base it induces. Whereas the former is a finite, syntactic object (essentially a string), the latter
may be an infinite set of facts, which represents the totality of knowledge about the world or the
aspects described by the program.
As knowledge bases can be infinite, we cannot pre-compute them. Instead, logic programming
languages compute fragments of the knowledge base by need; i.e. whenever a user wants to check
membership; we call this approach querying: the user enters a query expression and the system
answers yes or no. This answer is computed in a depth first search process.

Querying the Knowledge Base: Size Matters

Idea: We want to see whether a fact is in the knowledge base.
Definition 3.1.9. A query is a list of Prolog literals called goal literal (also subgoals
or simply goals). We write a query as ?−A1 , . . ., An . where Ai are goals.

Problem: Knowledge bases can be big and even infinite. (cannot pre-compute)
Example 3.1.10. The knowledge base induced by the Prolog program
nat(zero).
nat(s(X)) :− nat(X).

contains the facts nat(zero), nat(s(zero)), nat(s(s(zero))), . . .

Michael Kohlhase: Artificial Intelligence 1 46 2025-02-06

40 CHAPTER 3. LOGIC PROGRAMMING

Querying the Knowledge Base: Backchaining

Definition 3.1.11. Given a query Q: ?− A1 , . . ., An . and rule R: h:− b1 ,. . .,bn ,
backchaining computes a new query by
1. finding terms for all variables in h to make h and A1 equal and
2. replacing A1 in Q with the body literals of R, where all variables are suitably
replaced.
Backchaining motivates the names goal/subgoal:
the literals in the query are “goals” that have to be satisfied,
backchaining does that by replacing them by new “goals”.

Definition 3.1.12. The Prolog interpreter keeps backchaining from the top to the
bottom of the program until the query
succeeds, i.e. contains no more goals, or (answer: true)
fails, i.e. backchaining becomes impossible. (answer: false)

Example 3.1.13 (Backchaining). We continue ??

?− nat(s(s(zero))).
?− nat(s(zero)).
?− nat(zero).
true

Michael Kohlhase: Artificial Intelligence 1 47 2025-02-06

Note that backchaining replaces the current query with the body of the rule suitably instantiated.
For rules with a long body this extends the list of current goals, but for facts (rules without a
body), backchaining shortens the list of current goals. Once there are no goals left, the Prolog
interpreter finishes and signals success by issuing the string true.
If no rules match the current subgoal, then the interpreter terminates and signals failure with the
string false,

Querying the Knowledge Base: Failure

If no instance of a query can be derived from the knowledge base, then the Prolog
interpreter reports failure.
Example 3.1.14. We vary ?? using 0 instead of zero.
?− nat(s(s(0))).
?− nat(s(0)).
?− nat(0).
FAIL
false

Michael Kohlhase: Artificial Intelligence 1 48 2025-02-06

We can extend querying from simple yes/no answers to programs that return values by simply
using variables in queries. In this case, the Prolog interpreter returns a substitution.
3.2. PROGRAMMING AS SEARCH 41

Querying the Knowledge base: Answer Substitutions

Definition 3.1.15. If a query contains variables, then Prolog will return an answer
substitution as the result to the query, i.e the values for all the query variables
accumulated during repeated backchaining.
Example 3.1.16. We talk about (Bavarian) cars for a change, and use a query
with a variables
has_wheels(mybmw,4).
has_motor(mybmw).
car(X):−has_wheels(X,4),has_motor(X).
?− car(Y) % query
?− has_wheels(Y,4),has_motor(Y). % substitution X = Y
?− has_motor(mybmw). % substitution Y = mybmw
Y = mybmw % answer substitution
true

Michael Kohlhase: Artificial Intelligence 1 49 2025-02-06

In ?? the first backchaining step binds the variable X to the query variable Y, which gives us the
two subgoals has_wheels(Y,4),has_motor(Y). which again have the query variable Y. The next
backchaining step binds this to mybmw, and the third backchaining step exhausts the subgoals.
So the query succeeds with the (overall) answer substitution Y = mybmw. With this setup, we
can already do the “fallible Greeks” example from the introduction.

PROLOG: Are there Fallible Greeks?

Program:
human(leibniz).
human(sokrates).
greek(sokrates).
fallible(X):−human(X).

Example 3.1.17 (Query). ?−fallible(X),greek(X).

Answer substitution: [sokrates/X]

Michael Kohlhase: Artificial Intelligence 1 50 2025-02-06

3.2 Programming as Search

In this section, we want to really use Prolog as a programming language, so let use first get our tools
set up. Video Nuggets covering this section can be found at https://fau.tv/clip/id/21754
and https://fau.tv/clip/id/21827.

3.2.1 Running Prolog

We will now discuss how to use a Prolog interpreter to get to know the language. The SWI
Prolog interpreter can be downloaded from http://www.swi-prolog.org/. To start the Prolog
interpreter with pl or prolog or swipl from the shell. The SWI manual is available at http:
//www.swi-prolog.org/pldoc/
42 CHAPTER 3. LOGIC PROGRAMMING

We will introduce working with the interpreter using unary natural numbers as examples: we
first add the fact1 to the knowledge base
unat(zero).
which asserts that the predicate unat2 is true on the term zero. Generally, we can add a fact to
the knowledge base either by writing it into a file (e.g. example.pl) and then “consulting it” by
writing one of the following three commands into the interpreter:
[example]
consult(’example.pl’).
consult(’example’).
or by directly typing
assert(unat(zero)).
into the Prolog interpreter. Next tell Prolog about the following rule
assert(unat(suc(X)) :− unat(X)).
which gives the Prolog runtime an initial (infinite) knowledge base, which can be queried by
?− unat(suc(suc(zero))).
Even though we can use any text editor to program Prolog, but running Prolog in a modern
editor with language support is incredibly nicer than at the command line, because you can see
the whole history of what you have done. Its better for debugging too.

3.2.2 Knowledge Bases and Backtracking

Depth-First Search with Backtracking

So far, all the examples led to direct success or to failure. (simple KB)

Definition 3.2.1 (Prolog Search Procedure). The Prolog interpreter employs

top-down, left-right depth first search, concretely, Prolog search:
works on the subgoals in left right order.
matches first query with the head literals of the clauses in the program in top-
down order.
if there are no matches, fail and backtracks to the (chronologically) last back-
track point.
otherwise backchain on the first match, keep the other matches in mind for
backtracking via backtrack points.

We say that a goal G matches a head H, iff we can make them equal by replacing
variables in H with terms.
We can force backtracking to compute more answers by typing ;.

Michael Kohlhase: Artificial Intelligence 1 51 2025-02-06

Note: With the Prolog search procedure detailed above, computation can easily go into infinite
loops, even though the knowledge base could provide the correct answer. Consider for instance
the simple program
“unary natural numbers”; we cannot use the predicate nat and the constructor function s here, since their
1 for

meaning is predefined in Prolog

2 for “unary natural numbers”.
3.2. PROGRAMMING AS SEARCH 43

p(X):− p(X).
p(X):− q(X).
q(X).

If we query this with ?− p(john), then DFS will go into an infinite loop because Prolog expands
by default the first predicate. However, we can conclude that p(john) is true if we start expanding
the second predicate.
In fact this is a necessary feature and not a bug for a programming language: we need to
be able to write non-terminating programs, since the language would not be Turing complete
otherwise. The argument can be sketched as follows: we have seen that for Turing machines the
halting problem is undecidable. So if all Prolog programs were terminating, then Prolog would be
weaker than Turing machines and thus not Turing complete.
We will now fortify our intuition about the Prolog search procedure by an example that extends
the setup from ?? by a new choice of a vehicle that could be a car (if it had a motor).

Backtracking by Example
Example 3.2.2. We extend ??:
has_wheels(mytricycle,3).
has_wheels(myrollerblade,3).
has_wheels(mybmw,4).
has_motor(mybmw).
car(X):-has_wheels(X,3),has_motor(X). % cars sometimes have three wheels
car(X):-has_wheels(X,4),has_motor(X). % and sometimes four.
?- car(Y).
?- has_wheels(Y,3),has_motor(Y). % backtrack point 1
Y = mytricycle % backtrack point 2
?- has_motor(mytricycle).
FAIL % fails, backtrack to 2
Y = myrollerblade % backtrack point 2
?- has_motor(myrollerblade).
FAIL % fails, backtrack to 1
?- has_wheels(Y,4),has_motor(Y).
Y = mybmw
?- has_motor(mybmw).
Y=mybmw
true

Michael Kohlhase: Artificial Intelligence 1 52 2025-02-06

In general, a Prolog rule of the form A:−B,C reads as A, if B and C. If we want to express A if
B or C, we have to express this two separate rules A:−B and A:−C and leave the choice which
one to use to the search procedure.
In ?? we indeed have two clauses for the predicate car/1; one each for the cases of cars with three
and four wheels. As the three-wheel case comes first in the program, it is explored first in the
search process.
Recall that at every point, where the Prolog interpreter has the choice between two clauses for a
predicate, chooses the first and leaves a backtrack point. In ?? this happens first for the predicate
car/1, where we explore the case of three-wheeled cars. The Prolog interpreter immediately has
to choose again – between the tricycle and the rollerblade, which both have three wheels. Again,
it chooses the first and leaves a backtrack point. But as tricycles do not have motors, the subgoal
has_motor(mytricycle) fails and the interpreter backtracks to the chronologically nearest backtrack
point (the second one) and tries to fulfill has_motor(myrollerblade). This fails again, and the next
backtrack point is point 1 – note the stack-like organization of backtrack points which is in keeping
with the depth-first search strategy – which chooses the case of four-wheeled cars. This ultimately
succeeds as before with y=mybmw.
44 CHAPTER 3. LOGIC PROGRAMMING

3.2.3 Programming Features

We now turn to a more classical programming task: computing with numbers. Here we turn
to our initial example: adding unary natural numbers. If we can do that, then we have to consider
Prolog a programming language.

Can We Use This For Programming?

Question: What about functions? E.g. the addition function?
Question: We cannot define functions, in Prolog!
Idea (back to math): use a three-place predicate.
Example 3.2.3. add(X,Y,Z) stands for X+Y=Z

Now we can directly write the recursive equations X + 0 = X (base case) and
X + s(Y ) = s(X + Y ) into the knowledge base.
add(X,zero,X).
add(X,s(Y),s(Z)) :− add(X,Y,Z).

Similarly with multiplication and exponentiation.

mult(X,zero,zero).
mult(X,s(Y),Z) :− mult(X,Y,W), add(X,W,Z).

expt(X,zero,s(zero)).
expt(X,s(Y),Z) :− expt(X,Y,W), mult(X,W,Z).

Michael Kohlhase: Artificial Intelligence 1 53 2025-02-06

Note: Viewed through the right glasses logic programming is very similar to functional program-
ming; the only difference is that we are using n + 1 ary relations rather than n ary function. To see
how this works let us consider the addition function/relation example above: instead of a binary
function + we program a ternary relation add, where relation add(X,Y ,Z) means X + Y = Z. We
start with the same defining equations for addition, rewriting them to relational style.
The first equation is straight-forward via our correspondence and we get the Prolog fact
add(X,zero,X). For the equation X + s(Y ) = s(X + Y ) we have to work harder, the straight-
forward relational translation add(X,s(Y),s(X+Y)) is impossible, since we have only partially
replaced the function + with the relation add. Here we take refuge in a very simple trick that we
can always do in logic (and mathematics of course): we introduce a new name Z for the offending
expression X + Y (using a variable) so that we get the fact add(X,s(Y ),s(Z)). Of course this is
not universally true (remember that this fact would say that “X + s(Y ) = s(Z) for all X, Y , and
Z”), so we have to extend it to a Prolog rule add(X,s(Y),s(Z)):−add(X,Y,Z). which relativizes to
mean “X + s(Y ) = s(Z) for all X, Y , and Z with X + Y = Z”.
Indeed the rule implements addition as a recursive predicate, we can see that the recursion
relation is terminating, since the left hand sides have one more constructor for the successor
function. The examples for multiplication and exponentiation can be developed analogously, but
we have to use the naming trick twice.
We now apply the same principle of recursive programming with predicates to other examples
to reinforce our intuitions about the principles.

More Examples from elementary Arithmetic

3.2. PROGRAMMING AS SEARCH 45

Example 3.2.4. We can also use the add relation for subtraction without changing
the implementation. We just use variables in the “input positions” and ground terms
in the other two. (possibly very inefficient “generate and test approach”)
?−add(s(zero),X,s(s(s(zero)))).
X = s(s(zero))
true

Example 3.2.5. Computing the nth Fibonacci number (0, 1, 1, 2, 3, 5, 8, 13,. . . ;

add the last two to get the next), using the addition predicate above.
fib(zero,zero).
fib(s(zero),s(zero)).
fib(s(s(X)),Y):−fib(s(X),Z),fib(X,W),add(Z,W,Y).

Example 3.2.6. Using Prolog’s internal floating-point arithmetic: a goal of the

form ?− D is e. — where e is a ground arithmetic expression binds D to the result
of evaluating e.
fib(0,0).
fib(1,1).
fib(X,Y):− D is X − 1, E is X − 2,fib(D,Z),fib(E,W), Y is Z + W.

Michael Kohlhase: Artificial Intelligence 1 54 2025-02-06

Note: Note that the is relation does not allow “generate and test” inversion as it insists on the
right hand being ground. In our example above, this is not a problem, if we call the fib with
the first (“input”) argument a ground term. Indeed, it matches the last rule with a goal ?− g,Y.,
where g is a ground term, then g−1 and g−2 are ground and thus D and E are bound to the
(ground) result terms. This makes the input arguments in the two recursive calls ground, and we
get ground results for Z and W, which allows the last goal to succeed with a ground result for
Y. Note as well that re-ordering the bodys literal of the rule so that the recursive calls are called
before the computation literals will lead to failure.
We will now add the primitive data structure of lists to Prolog; they are constructed by prepending
an element (the head) to an existing list (which becomes the rest list or “tail” of the constructed
one).

Adding Lists to Prolog

Definition 3.2.7. In Prolog, lists are represented by list terms of the form
1. [a,b,c,. . .] for list literals, and
2. a first/rest constructor that represents a list with head F and rest list R as [F|R].
Observation: Just as in functional programming, we can define list operations by
recursion, only that we program with relations instead of with functions.
Example 3.2.8. Predicates for member, append and reverse of lists in default
Prolog representation.
member(X,[X|_]).
member(X,[_|R]):−member(X,R).

append([],L,L).
append([X|R],L,[X|S]):−append(R,L,S).
46 CHAPTER 3. LOGIC PROGRAMMING

reverse([],[]).
reverse([X|R],L):−reverse(R,S),append(S,[X],L).

Michael Kohlhase: Artificial Intelligence 1 55 2025-02-06

Logic programming is the third large programming paradigm (together with functional program-
ming and imperative programming).

Relational Programming Techniques

Example 3.2.9. Parameters have no unique direction “in” or “out”
?− rev(L,[1,2,3]).
?− rev([1,2,3],L1).
?− rev([1|X],[2|Y]).

Example 3.2.10. Symbolic programming by structural induction:

rev([],[]).
rev([X|Xs],Ys) :− ...

Example 3.2.11. Generate and test:

sort(Xs,Ys) :− perm(Xs,Ys), ordered(Ys).

Michael Kohlhase: Artificial Intelligence 1 56 2025-02-06

From a programming practice point of view it is probably best understood as “relational program-
ming” in analogy to functional programming, with which it shares a focus on recursion.
The major difference to functional programming is that “relational programming” does not have
a fixed input/output distinction, which makes the control flow in functional programs very direct
and predictable. Thanks to the underlying search procedure, we can sometime make use of the
flexibility afforded by logic programming.
If the problem solution involves search (and depth first search is sufficient), we can just get by
with specifying the problem and letting the Prolog interpreter do the rest. In ?? we just specify
that list Xs can be sorted into Ys, iff Ys is a permutation of Xs and Ys is ordered. Given a concrete
(input) list Xs, the Prolog interpreter will generate all permutations of Ys of Xs via the predicate
perm/2 and then test them whether they are ordered.
This is a paradigmatic example of logic programming. We can (sometimes) directly use the
specification of a problem as a program. This makes the argument for the correctness of the
program immediate, but may make the program execution non optimal.

3.2.4 Advanced Relational Programming

It is easy to see that the running time of the Prolog program from ?? is not O(nlog2 (n)) which
is optimal for sorting algorithms. This is the flip side of the flexibility in logic programming. But
Prolog has ways of dealing with that: the cut operator, which is a Prolog atom, which always
succeeds, but which cannot be backtracked over. This can be used to prune the search tree in
Prolog. We will not go into that here but refer the readers to the literature.

Specifying Control in Prolog

Remark 3.2.12. The running time of the program from ?? is not O(nlog2 (n))
3.2. PROGRAMMING AS SEARCH 47

which is optimal for sorting algorithms.

sort(Xs,Ys) :− perm(Xs,Ys), ordered(Ys).

Idea: Gain computational efficiency by shaping the search!

Michael Kohlhase: Artificial Intelligence 1 57 2025-02-06

Functions and Predicates in Prolog

Remark 3.2.13. Functions and predicates have radically different roles in Prolog.
Functions are used to represent data. (e.g. father(john) or s(s(zero)))
Predicates are used for stating properties about and computing with data.
Remark 3.2.14. In functional programming, functions are used for both.
(even more confusing than in Prolog if you think about it)
Example 3.2.15. Consider again the reverse predicate for lists below:
An input datum is e.g. [1,2,3], then the output datum is [3,2,1].
reverse([],[]).
reverse([X|R],L):−reverse(R,S),append(S,[X],L).

We “define” the computational behavior of the predicate rev, but the list constructors
[. . .] are just used to construct lists from arguments.
Example 3.2.16 (Trees and Leaf Counting). We represent (unlabelled) trees via
the function t from tree lists to trees. For instance, a balanced binary tree of depth
2 is t([t([t([]),t([])]),t([t([]),t([])])]). We count leaves by
leafcount(t([]),1).
leafcount(t([V]),W) :− leafcount(V,W).
leafcount(t([X|R]),Y) :− leafcount(X,Z), leafcount(t(R),W), Y is Z + W.

Michael Kohlhase: Artificial Intelligence 1 58 2025-02-06

For more information on Prolog

RTFM (b
= “read the fine manuals”)

RTFM Resources: There are also lots of good tutorials on the web,
I personally like [Fis; LPN],
[Fla94] has a very thorough logic-based introduction,
48 CHAPTER 3. LOGIC PROGRAMMING

consult also the SWI Prolog Manual [SWI],

Michael Kohlhase: Artificial Intelligence 1 59 2025-02-06

Chapter 4

Recap of Prerequisites from Math &

Theoretical Computer Science

In this chapter we will briefly recap some of the prerequisites from theoretical computer science
that are needed for understanding Artificial Intelligence 1.

4.1 Recap: Complexity Analysis in AI?

We now come to an important topic which is not really part of Artificial Intelligence but which
adds an important layer of understanding to this enterprise: We (still) live in the era of Moore’s
law (the computing power available on a single CPU doubles roughly every two years) leading to an
exponential increase. A similar rule holds for main memory and disk storage capacities. And the
production of computer (using CPUs and memory) is (still) very rapidly growing as well; giving
mankind as a whole, institutions, and individual exponentially grow of computational resources.
In public discussion, this development is often cited as the reason why (strong) AI is inevitable.
But the argument is fallacious if all the algorithms we have are of very high complexity (i.e. at
least exponential in either time or space). So, to judge the state of play in Artificial Intelligence,
we have to know the complexity of our algorithms.
In this section, we will give a very brief recap of some aspects of elementary complexity theory
and make a case of why this is a generally important for computer scientists.
A Video Nugget covering this section can be found at https://fau.tv/clip/id/21839 and
https://fau.tv/clip/id/21840.
To get a feeling what we mean by “fast algorithm”, we do some preliminary computations.

Performance and Scaling

Suppose we have three algorithms to choose from. (which one to select)

Systematic analysis reveals performance characteristics.
Example 4.1.1. For a computational problem of size n we have

49
50CHAPTER 4. RECAP OF PREREQUISITES FROM MATH & THEORETICAL COMPUTER SCIENCE

performance
size linear quadratic exponential
n 100nµs 7n2 µs 2n µs
1 100µs 7µs 2µs
5 .5ms 175µs 32µs
10 1ms .7ms 1ms
45 4.5ms 14ms 1.1Y
100 ... ... ...
1 000 ... ... ...
10 000 ... ... ...
1 000 000 ... ... ...

Michael Kohlhase: Artificial Intelligence 1 60 2025-02-06

The last number in the rightmost column may surprise you. Does the run time really grow that
fast? Yes, as a quick calculation shows; and it becomes much worse, as we will see.

What?! One year?

210 = 1 024 (1024µs ≃ 1ms)

245 = 35 184 372 088 832 (3.5×1013 µs ≃ 3.5×107 s ≃ 1.1Y )
Example 4.1.2. We denote all times that are longer than the age of the universe
with −

performance
size linear quadratic exponential
n 100nµs 7n2 µs 2n µs
1 100µs 7µs 2µs
5 .5ms 175µs 32µs
10 1ms .7ms 1ms
45 4.5ms 14ms 1.1Y
< 100 100ms 7s 1016 Y
1 000 1s 12min −
10 000 10s 20h −
1 000 000 1.6min 2.5mon −

Michael Kohlhase: Artificial Intelligence 1 61 2025-02-06

So it does make a difference for larger computational problems what algorithm we choose. Consid-
erations like the one we have shown above are very important when judging an algorithm. These
evaluations go by the name of “complexity theory”.
Let us now recapitulate some notions of elementary complexity theory: we are interested in the
worst-case growth of the resources (time and space) required by an algorithm in terms of the sizes
of its arguments. Mathematically we look at the functions from input size to resource size and
classify them into “big-O” classes, abstracting from constant factors (which depend on the machine
thealgorithm runs on and which we cannot control) and initial (algorithm startup) factors.

Recap: Time/Space Complexity of Algorithms

We are mostly interested in worst-case complexity in AI-1.
4.1. RECAP: COMPLEXITY ANALYSIS IN AI? 51

Definition 4.1.3. We say that an algorithm α that terminates in time t(n) for all
inputs of size n has running time T (α) := t.
Let S ⊆ N → N be a set of natural number functions, then we say that α has time
complexity in S (written T (α)∈S or colloquially T (α)=S), iff t∈S. We say α has
space complexity in S, iff α uses only memory of size s(n) on inputs of size n and
s∈S.
Time/space complexity depends on size measures. (no canonical one)

Definition 4.1.4. The following sets are often used for S in T (α):

Landau set class name rank Landau set class name rank
O(1) constant 1 O(n2 ) quadratic 4
O(log2 (n)) logarithmic 2 O(nk ) polynomial 5
O(n) linear 3 O(kn ) exponential 6

where O(g) = {f | ∃k > 0.f ≤a k · g} and f ≤a g (f is asymptotically bounded by g),

iff there is an n0 ∈ N, such that f (n) ≤ g(n) for all n > n0 .
Lemma 4.1.5 (Growth Ranking). For k ′ > 2 and k > 1 we have
′
O(1)⊂O(log2 (n))⊂O(n)⊂O(n2 )⊂O(nk )⊂O(k n )

For AI-1: I expect that given an algorithm, you can determine its complexity class.
(next)

Michael Kohlhase: Artificial Intelligence 1 62 2025-02-06

Advantage: Big-Oh Arithmetics

Practical Advantage: Computing with Landau sets is quite simple. (good
simplification)

Theorem 4.1.6 (Computing with Landau Sets).

1. If O(c · f ) = O(f ) for any constant c ∈ N. (drop constant factors)
2. If O(f ) ⊆ O(g), then O(f + g) = O(g). (drop low-complexity summands)
3. If O(f · g) = O(f ) · O(g). (distribute over products)

These are not all of “big-Oh calculation rules”, but they’re enough for most purposes
Applications: Convince yourselves using the result above that
O(4n3 + 3n + 71000n ) = O(2n )
O(n)⊂O(n · log2 (n))⊂O(n2 )

Michael Kohlhase: Artificial Intelligence 1 63 2025-02-06

OK, that was the theory, . . . but how do we use that in practice?
What I mean by this is that given an algorithm, we have to determine the time complexity.
This is by no means a trivial enterprise, but we can do it by analyzing the algorithm instruction
by instruction as shown below.
52CHAPTER 4. RECAP OF PREREQUISITES FROM MATH & THEORETICAL COMPUTER SCIENCE

Determining the Time/Space Complexity of Algorithms

Definition 4.1.7. Given a function Γ that assigns variables v to functions Γ(v)
and α an imperative algorithm, we compute the

time complexity TΓ (α) of program α and

the context CΓ (α) introduced by α
by joint induction on the structure of α:
constant: can be accessed in constant time
If α = δ for a data constant δ, then TΓ (α)∈O(1).
variable: need the complexity of the value
If α = v with v ∈ dom(Γ), then TΓ (α)∈O(Γ(v)).
application: compose the complexities of the function and the argument
If α = φ(ψ) with TΓ (φ)∈O(f ) and TΓ∪CΓ (φ) (ψ)∈O(g), then TΓ (α)∈O(f ◦ g)
and CΓ (α) = CΓ∪CΓ (φ) (ψ).
assignment: has to compute the value ; has its complexity
If α is v:= φ with TΓ (φ)∈S, then TΓ (α)∈S and CΓ (α) = Γ ∪ (v,S).
composition: has the maximal complexity of the components
If α is φ ; ψ, with TΓ (φ)∈P and TΓ∪CΓ (ψ) (ψ)∈Q, then TΓ (α)∈max {P , Q} and
CΓ (α) = CΓ∪CΓ (ψ) (ψ).
branching: has the maximal complexity of the condition and branches
If α is ifγthenφelseψend, with TΓ (γ)∈C, TΓ∪CΓ (γ) (φ)∈P , TΓ∪CΓ (γ) (φ)∈Q,
and then TΓ (α)∈max {C, P , Q} and CΓ (α) = Γ ∪ CΓ (γ) ∪ CΓ∪CΓ (γ) (φ) ∪
CΓ∪CΓ (γ) (ψ).
looping: multiplies complexities
If α is whileγdoφend, with TΓ (γ)∈O(f ), TΓ∪CΓ (γ) (φ)∈O(g), then TΓ (α)∈O(f (n)·
g(n)) and CΓ (α) = CΓ∪CΓ (γ) (φ).
The time complexity T (α) is just T∅ (α), where ∅ is the empty function.
Recursion is much more difficult to analyze ; recurrences and Master’s theorem.

Michael Kohlhase: Artificial Intelligence 1 64 2025-02-06

As instructions in imperative programs can introduce new variables, which have their own time
complexity, we have to carry them around via the introduced context, which has to be defined
co-recursively with the time complexity. This makes ?? rather complex. The main two cases to
note here are
• the variable case, which “uses” the context Γ and

• the assignment case, which extends the introduced context by the time complexity of the value.
The other cases just pass around the given context and the introduced context systematically.
Let us now put one motivation for knowing about complexity theory into the perspective of the
job market; here the job as a scientist.
Please excuse the chemistry pictures, public imagery for CS is really just quite boring, this is
what people think of when they say “scientist”. So, imagine that instead of a chemist in a lab, it’s
me sitting in front of a computer.
4.1. RECAP: COMPLEXITY ANALYSIS IN AI? 53

Why Complexity Analysis? (General)

Example 4.1.8. Once upon a time I was trying to invent an efficient algorithm.
My first algorithm attempt didn’t work, so I had to try harder.

But my 2nd attempt didn’t work either, which got me a bit agitated.

The 3rd attempt didn’t work either. . .

And neither the 4th. But then:

54CHAPTER 4. RECAP OF PREREQUISITES FROM MATH & THEORETICAL COMPUTER SCIENCE

Ta-da . . . when, for once, I turned around and looked in the other direction–
CAN one actually solve this efficiently? – NP hardness was there to rescue me.

Michael Kohlhase: Artificial Intelligence 1 65 2025-02-06

The meat of the story is that there is no profit in trying to invent an algorithm, which we could
have known that cannot exist. Here is another image that may be familiar to you.

Why Complexity Analysis? (General)

Example 4.1.9. Trying to find a sea route east to India (from Spain) (does not
exist)

Observation: Complexity theory saves you from spending lots of time trying to
4.2. RECAP: FORMAL LANGUAGES AND GRAMMARS 55

invent algorithms that do not exist.

Michael Kohlhase: Artificial Intelligence 1 66 2025-02-06

It’s like, you’re trying to find a route to India (from Spain), and you presume it’s somewhere to
the east, and then you hit a coast, but no; try again, but no; try again, but no; ... if you don’t
have a map, that’s the best you can do. But NP hardness gives you the map: you can check
that there actually is no way through here. But what is this notion of NP completness alluded
to above? We observe that we can analyze the complexity of problems by the complexity of the
algorithms that solve them. This gives us a notion of what to expect from solutions to a given
problem class, and thus whether efficient (i.e. polynomial time) algorithms can exist at all.

Reminder (?): NP and PSPACE (details ; e.g. [GJ79])

Turing Machine: Works on a tape consisting of cells, across which its Read/Write
head moves. The machine has internal states. There is a transition function that
specifies – given the current cell content and internal state – what the subsequent
internal state will be, how what the R/W head does (write a symbol and/or move).
Some internal states are accepting.
Decision problems are in NP if there is a non deterministic Turing machine that
halts with an answer after time polynomial in the size of its input. Accepts if at
least one of the possible runs accepts.

Decision problems are in NPSPACE, if there is a non deterministic Turing ma-

chine that runs in space polynomial in the size of its input.
NP vs. PSPACE: Non-deterministic polynomial space can be simulated in deter-
ministic polynomial space. Thus PSPACE = NPSPACE, and hence (trivially)
NP ⊆ PSPACE.
It is commonly believed that NP̸⊇PSPACE. (similar to P ⊆ NP)

Michael Kohlhase: Artificial Intelligence 1 67 2025-02-06

The Utility of Complexity Knowledge (NP-Hardness)

Assume: In 3 years from now, you have finished your studies and are working in
your first industry job. Your boss Mr. X gives you a problem and says Solve It!. By
which he means, write a program that solves it efficiently.
Question: Assume further that, after trying in vain for 4 weeks, you got the next
meeting with Mr. X. How could knowing about NP hardness help?
Answer: reserved for the plenary sessions ; be there!

Michael Kohlhase: Artificial Intelligence 1 68 2025-02-06

4.2 Recap: Formal Languages and Grammars

One of the main ways of designing rational agents in this course will be to define formal languages
that represent the state of the agent environment and let the agent use various inference techniques
56CHAPTER 4. RECAP OF PREREQUISITES FROM MATH & THEORETICAL COMPUTER SCIENCE

to predict effects of its observations and actions to obtain a world model. In this section we recap
the basics of formal languages and grammars that form the basis of a compositional theory for
them.

The Mathematics of Strings

Definition 4.2.1. An alphabet A is a finite set; we call each element a ∈ A a
character, and an n tuple s ∈ An a string (of length n over A).

Definition 4.2.2. Note that A0 = {⟨⟩}, where ⟨⟩ is the (unique) 0-tuple. With
the definition above we consider ⟨⟩ as the string of length 0 and call it the empty
string and denote it with ϵ.
Note: Sets ̸= strings, e.g. {1, 2, 3} = {3, 2, 1}, but ⟨1, 2, 3⟩ =
̸ ⟨3, 2, 1⟩.
Notation: We will often write a string ⟨c1 , . . ., cn ⟩ as ”c1 . . .cn ”, for instance
”abc” for ⟨a, b, c⟩
Example 4.2.3. Take A = {h, 1, /} as an alphabet. Each of the members h, 1,
and / is a character. The vector ⟨/, /, 1, h, 1⟩ is a string of length 5 over A.
Definition 4.2.4 (String Length). Given a string s we denote its length with |s|.

Definition 4.2.5. The concatenation conc(s, t) of two strings s = ⟨s1 , ..., sn ⟩ ∈ An

and t = ⟨t1 , ..., tm ⟩ ∈ Am is defined as ⟨s1 , ..., sn , t1 , ..., tm ⟩ ∈ An+m .
We will often write conc(s, t) as s + t or simply st
Example 4.2.6. conc(”text”, ”book”) = ”text” + ”book” = ”textbook”

Michael Kohlhase: Artificial Intelligence 1 69 2025-02-06

We have multiple notations for concatenation, since it is such a basic operation, which is used
so often that we will need very short notations for it, trusting that the reader can disambiguate
based on the context.
Now that we have defined the concept of a string as a sequence of characters, we can go on to
give ourselves a way to distinguish between good strings (e.g. programs in a given programming
language) and bad strings (e.g. such with syntax errors). The way to do this by the concept of a
formal language, which we are about to define.

Formal Languages
S
Definition 4.2.7. Let A be an alphabet, then we define the sets A+ := i∈N+ Ai
of nonempty string and A∗ :=A+ ∪ {ϵ} of strings.
Example 4.2.8. If A = {a, b, c}, then A∗ = {ϵ, a, b, c, aa, ab, ac, ba, . . . , aaa, . . . }.
Definition 4.2.9. A set L ⊆ A∗ is called a formal language over A.

Definition 4.2.10. We use c[n] for the string that consists of the character c
repeated n times.
Example 4.2.11. #[5] = ⟨#, #, #, #, #⟩
Example 4.2.12. The set M := {ba[n] | n ∈ N} of strings that start with character
b followed by an arbitrary numbers of a’s is a formal language over A = {a, b}.
4.2. RECAP: FORMAL LANGUAGES AND GRAMMARS 57

Definition 4.2.13. Let L1 , L2 , L ⊆ Σ∗ be formal languages over Σ.

Intersection and union: L1 ∩ L2 , L1 ∪ L2 .
Language complement L: L := Σ∗ \L.
The language concatenation of L1 and L2 : L1 L2 := {uw | u ∈ L1 , w ∈ L2 }.
We often use L1 L2 instead of L1 L2 .
Language power L: L0 := {ϵ}, Ln+1 := LLn , where Ln := {w1 . . .wn | wi ∈
L, for i = 1. . .n}, (for n ∈ N).
∗
S S
language Kleene closure L: L := n∈N L and also L := n∈N+ L .
n + n

The reflection of a language L: LR := {wR | w ∈ L}.

Michael Kohlhase: Artificial Intelligence 1 70 2025-02-06

There is a common misconception that a formal language is something that is difficult to under-
stand as a concept. This is not true, the only thing a formal language does is separate the “good”
from the bad strings. Thus we simply model a formal language as a set of stings: the “good”
strings are members, and the “bad” ones are not.
Of course this definition only shifts complexity to the way we construct specific formal languages
(where it actually belongs), and we have learned two (simple) ways of constructing them: by
repetition of characters, and by concatenation of existing languages. As mentioned above,
the purpose of a formal language is to distinguish “good” from “bad” strings. It is maximally
general, but not helpful, since it does not support computation and inference. In practice we
will be interested in formal languages that have some structure, so that we can represent formal
languages in a finite manner (recall that a formal language is a subset of A∗ , which may be infinite
and even undecidable – even though the alphabet A is finite).
To remedy this, we will now introduce phrase structure grammars (or just grammars), the stan-
dard tool for describing structured formal languages.

Phrase Structure Grammars (Theory)

Recap: A formal language is an arbitrary set of symbol sequences.

Problem: This may be infinite and even undecidable even if A is finite.

Idea: Find a way of representing formal languages with structure finitely.
Definition 4.2.14. A phrase structure grammar (also called type 0 grammar,
unrestricted grammar, or just grammar) is a tuple ⟨N , Σ, P , S ⟩ where

N is a finite set of nonterminal symbols,

Σ is a finite set of terminal symbols, members of Σ ∪ N are called symbols.
P is a finite set of production rules: pairs p := h → b (also written as h⇒b),
∗ ∗ ∗
where h ∈ (Σ ∪ N ) N (Σ ∪ N ) and b ∈ (Σ ∪ N ) . The string h is called the
head of p and b the body.
S ∈ N is a distinguished symbol called the start symbol (also sentence symbol).
The sets N and Σ are assumed to be disjoint. Any word w ∈ Σ∗ is called a terminal
word.
Intuition: Production rules map strings with at least one nonterminal to arbitrary
other strings.
58CHAPTER 4. RECAP OF PREREQUISITES FROM MATH & THEORETICAL COMPUTER SCIENCE

Notation: If we have n rules h → bi sharing a head, we often write h → b1 | . . . | bn

instead.

Michael Kohlhase: Artificial Intelligence 1 71 2025-02-06

We fortify our intuition about these – admittedly very abstract – constructions by an example
and introduce some more vocabulary.

Phrase Structure Grammars (cont.)

Example 4.2.15. A simple phrase structure grammar G:

Here S , is the start symbol, NP , Article, N , and Vi are nonterminals.

Definition 4.2.16. A production rule whose head is a single non-terminal and
whose body consists of a single terminal is called lexical or a lexical insertion rule.
Definition 4.2.17. The subset of lexical rules of a grammar G is called the lexicon
of G and the set of body symbols the vocabulary (or alphabet). The nonterminals
in their heads are called lexical categories of G.
Definition 4.2.18. The non-lexicon production rules are called structural, and the
nonterminals in the heads are called phrasal or syntactic categories.

Michael Kohlhase: Artificial Intelligence 1 72 2025-02-06

Now we look at just how a grammar helps in analyzing formal languages. The basic idea is that
a grammar accepts a word, iff the start symbol can be rewritten into it using only the rules of the
grammar.

Phrase Structure Grammars (Theory)

Idea: Each symbol sequence in a formal language can be analyzed/generated by
the grammar.

Definition 4.2.19. Given a phrase structure grammar G := ⟨N , Σ, P , S ⟩, we say

∗ ∗
G derives t ∈ (Σ ∪ N ) from s ∈ (Σ ∪ N ) in one step, iff there is a production
∗
rule p ∈ P with p = h → b and there are u, v ∈ (Σ ∪ N ) , such that s = suhv and
p
t = ubv. We write s→G t (or s→G t if p is clear from the context) and use →∗G for
the reflexive transitive closure of →G . We call s→∗ G t a G derivation of t from s.
A →G B
TEST1:
C →G D
4.2. RECAP: FORMAL LANGUAGES AND GRAMMARS 59

s →G2 asb
A →G B →G2 aaSbb
TEST2: →G C TEST3: →G2 aaaSbbb
→G D →G2 aaaaSbbbb
→G2 aaaabbbb

Definition 4.2.20. Given a phrase structure grammar G := ⟨N , Σ, P , S ⟩, we say

∗
that s ∈ (N ∪ Σ) is a sentential form of G, iff S→∗ G s. A sentential form that
does not contain nontermials is called a sentence of G, we also say that G accepts
s. We say that G rejects s, iff it is not a sentence of G.
Definition 4.2.21. The language L(G) of G is the set of its sentences. We say
that L(G) is generated by G.
Definition 4.2.22. We call two grammars equivalent, iff they have the same lan-
guages.
Definition 4.2.23. A grammar G is said to be universal if L(G) = Σ∗ .
Definition 4.2.24. Parsing, syntax analysis, or syntactic analysis is the process of
analyzing a string of symbols, either in a formal or a natural language by means of
a grammar.

Michael Kohlhase: Artificial Intelligence 1 73 2025-02-06

Again, we fortify our intuitions with ??.

Phrase Structure Grammars (Example)

Example 4.2.25. In the grammar G from ??:
1. Article teacher Vi is a sentential
form,

Michael Kohlhase: Artificial Intelligence 1 74 2025-02-06

Note that this process indeed defines a formal language given a grammar, but does not provide
an efficient algorithm for parsing, even for the simpler kinds of grammars we introduce below.

Grammar Types (Chomsky Hierarchy [Cho65])

60CHAPTER 4. RECAP OF PREREQUISITES FROM MATH & THEORETICAL COMPUTER SCIENCE

Observation: The shape of the grammar determines the “size” of its language.
Definition 4.2.26. We call a grammar:
1. context-sensitive (or type 1), if the bodies of production rules have no less symbols
than the heads,
2. context-free (or type 2), if the heads have exactly one symbol,
3. regular (or type 3), if additionally the bodies are empty or consist of a nonterminal,
optionally followed by a terminal symbol.
By extension, a formal language L is called context-sensitive/context-free/regular
(or type 1/type 2/type 3 respectively), iff it is the language of a respective grammar.
Context-free grammars are sometimes CFGs and context-free languages CFLs.
Example 4.2.27 (Context-sensitive). The language {a[n] b[n] c[n] } is accepted by

S → abc|A
A → aAB c|abc
cB → Bc
bB → bb

Example 4.2.28 (Context-free). The language {a[n] b[n] } is accepted by S → a S b|

ϵ.
Example 4.2.29 (Regular). The language {a[n] } is accepted by S → S a
Observation: Natural languages are probably context-sensitive but parsable in
real time! (like languages low in the hierarchy)

Michael Kohlhase: Artificial Intelligence 1 75 2025-02-06

While the presentation of grammars from above is sufficient in theory, in practice the various
grammar rules are difficult and inconvenient to write down. Therefore computer science – where
grammars are important to e.g. specify parts of compilers – has developed extensions – notations
that can be expressed in terms of the original grammar rules – that make grammars more readable
(and writable) for humans. We introduce an important set now.

Useful Extensions of Phrase Structure Grammars

Definition 4.2.30. The Bachus Naur form or Backus normal form (BNF) is a
metasyntax notation for context-free grammars.
It extends the body of a production rule by mutiple (admissible) constructors:
alternative: s1 | . . . | sn ,
repetition: s∗ (arbitrary many s) and s+ (at least one s),
optional: [s] (zero or one times),
grouping: (s1 ; . . . ; sn ), useful e.g. for repetition,
character sets: [s−t] (all characters c with s≤c≤t for a given ordering on the
characters), and
complements: [∧ s1 ,. . .,sn ], provided that the base alphabet is finite.
4.3. MATHEMATICAL LANGUAGE RECAP 61

Observation: All of these can be eliminated, .e.g (; many more rules)

replace X → Z (s∗ ) W with the production rules X → Z Y W , Y → ϵ, and
Y → Y s.
replace X → Z (s+ ) W with the production rules X → Z Y W , Y → s, and
Y → Y s.

Michael Kohlhase: Artificial Intelligence 1 76 2025-02-06

We will now build on the notion of BNF grammar notations and introduce a way of writing
down the (short) grammars we need in AI-1 that gives us even more of an overview over what is
happening.

An Grammar Notation for AI-1

Problem: In grammars, notations for nonterminal symbols should be
short and mnemonic (for the use in the body)
close to the official name of the syntactic category (for the use in the head)

In AI-1 we will only use context-free grammars (simpler, but problem still applies)
in AI-1: I will try to give “grammar overviews” that combine those, e.g. the
grammar of first-order logic.

Michael Kohlhase: Artificial Intelligence 1 77 2025-02-06

We will generally get by with context-free grammars, which have highly efficient into parsing
algorithms, for the formal language we use in this course, but we will not cover the algorithms in
AI-1.

4.3 Mathematical Language Recap

We already clarified above that we will use mathematical language as the main vehicle for speci-
fying the concepts underlying the AI algorithms in this course.
In this section, we will recap (or introduce if necessary) an important conceptual practice of
modern mathematics: the use of mathematical structures.

Mathematical Structures
Observation: Mathematicians often cast classes of complex objects as mathemat-
ical structures.
62CHAPTER 4. RECAP OF PREREQUISITES FROM MATH & THEORETICAL COMPUTER SCIENCE

We have just seen an example of a mathematical structure: (repeated here for

convenience)
Definition 4.3.1. A phrase structure grammar (also called type 0 grammar, unre-
stricted grammar, or just grammar) is a tuple ⟨N , Σ, P , S ⟩ where

N is a finite set of nonterminal symbols,

Σ is a finite set of terminal symbols, members of Σ ∪ N are called symbols.
P is a finite set of production rules: pairs p := h → b (also written as h⇒b),
∗ ∗ ∗
where h ∈ (Σ ∪ N ) N (Σ ∪ N ) and b ∈ (Σ ∪ N ) . The string h is called the
head of p and b the body.
S ∈ N is a distinguished symbol called the start symbol (also sentence symbol).
The sets N and Σ are assumed to be disjoint. Any word w ∈ Σ∗ is called a terminal
word.
Intuition: All grammars share structure: they have four components, which again
share struccture, which is further described in the definition above.
Observation: Even though we call production rules “pairs” above, they are also
mathematical structures ⟨h, b⟩ with a funny notation h → b.

Michael Kohlhase: Artificial Intelligence 1 78 2025-02-06

Note that the idea of mathematical structures has been picked up by most programming lan-
guages in various ways and you should therefore be quite familiar with it once you realize the
parallelism.

Mathematical Structures in Programming

Observation: Most programming languages have some way of creating “named
structures”. Referencing components is usually done via “dot notation”.
Example 4.3.2 (Structs in C). C data structures for representing grammars:
struct grule {
char[][] head;
char[][] body;
}
struct grammar {
char[][] nterminals;
char[][] termininals;
grule[] grules;
char[] start;
}
int main() {
struct grule r1;
r1.head = "foo";
r1.body = "bar";
}

Example 4.3.3 (Classes in OOP). Classes in object-oriented programming lan-

guages are based on the same ideas as mathematical structures, only that OOP
adds powerful inheritance mechanisms.
4.3. MATHEMATICAL LANGUAGE RECAP 63

Michael Kohlhase: Artificial Intelligence 1 79 2025-02-06

Even if the idea of mathematical structures may be familiar from programming, it may be quite
intimidating to some students in the mathematical notation we will use in this course. Therefore
will – when we get around to it – use a special overview notation in AI-1. We introduce it below.

In AI-1 we use a mixture between Math and Programming Styles

In AI-1 we use mathematical notation, . . .
Definition 4.3.4. A structure signature combines the components, their “types”,
and accessor names of a mathematical structure in a tabular overview.

Example 4.3.5.
* N Set nonterminal symbols, +
Σ Set terminal symbols,
grammar =
P {h → b | . . . } production rules,
S N start symbol
∗ ∗
h (Σ ∪ N ) , N , (Σ ∪ N ) head,
production rule h→b = ∗
b (Σ ∪ N ) body

Read the first line “N Set nonterminal symbols” in the structure above as “N is in
an (unspecified) set and is a nonterminal symbol”.
Here – and in the future – we will use Set for the class of sets ; “N is a set”.

I will try to give structure signatures where necessary.

Michael Kohlhase: Artificial Intelligence 1 80 2025-02-06

64CHAPTER 4. RECAP OF PREREQUISITES FROM MATH & THEORETICAL COMPUTER SCIENCE
Chapter 5

Rational Agents: a Unifying

Framework for Artificial Intelligence

In this chapter, we introduce a framework that gives a comprehensive conceptual model for the
multitude of methods and algorithms we cover in this course. The framework of rational agents
accommodates two traditions of AI.
Initially, the focus of AI research was on symbolic methods concentrating on the mental processes
of problem solving, starting from Newell/Simon’s “physical symbol hypothesis”:
A physical symbol system has the necessary and sufficient means for general intelligent action.
[NS76]
Here a symbol is a representation an idea, object, or relationship that is physically manifested in
(the brain of) an intelligent agent (human or artificial).
Later – in the 1980s – the proponents of embodied AI posited that most features of cognition,
whether human or otherwise, are shaped – or at least critically influenced – by aspects of the
entire body of the organism. The aspects of the body include the motor system, the perceptual
system, bodily interactions with the environment (situatedness) and the assumptions about the
world that are built into the structure of the organism. They argue that symbols are not always
necessary since
The world is its own best model. It is always exactly up to date. It always has every detail
there is to be known. The trick is to sense it appropriately and often enough. [Bro90]
The framework of rational agents initially introduced by Russell and Wefald in [RW91] – ac-
commodates both, it situates agents with percepts and actions in an environment, but does not
preclude physical symbol systems – i.e. systems that manipulate symbols as agent functions. Rus-
sell and Norvig make it the central metaphor of their book “Artificial Intelligence – A modern
approach” [RN03], which we follow in this course.

5.1 Introduction: Rationality in Artificial Intelligence

We now introduce the notion of rational agents as entities in the world that act optimally (given
the available information). We situate rational agents in the scientific landscape by looking at
variations of the concept that lead to slightly different fields of study.

What is AI? Going into Details

Recap: AI studies how we can make the computer do things that humans can still
do better at the moment. (humans are proud to be rational)

65
66 CHAPTER 5. RATIONAL AGENTS: AN AI FRAMEWORK

What is AI?: Four possible answers/facets: Systems that

think like humans think rationally

act like humans act rationally

expressed by four different definitions/quotes:

Humanly Rational
Thinking “The exciting new effort “The formalization of mental
to make computers think faculties in terms of computa-
. . . machines with human-like tional models” [CM85]
minds” [Hau85]
Acting “The art of creating machines “The branch of CS concerned
that perform actions requiring with the automation of appro-
intelligence when performed by priate behavior in complex situ-
people” [Kur90] ations” [LS93]

Idea: Rationality is performance-oriented rather than based on imitation.

Michael Kohlhase: Artificial Intelligence 1 81 2025-02-06

So, what does modern AI do?

Acting Humanly: Turing test, not much pursued outside Loebner prize

b building pigeons that can fly so much like real pigeons that they can fool
=
pigeons
Not reproducible, not amenable to mathematical analysis
Thinking Humanly: ; Cognitive Science.

How do humans think? How does the (human) brain work?

Neural networks are a (extremely simple so far) approximation
Thinking Rationally: Logics, Formalization of knowledge and inference
You know the basics, we do some more, fairly widespread in modern AI

Acting Rationally: How to make good action choices?

Contains logics (one possible way to make intelligent decisions)
We are interested in making good choices in practice (e.g. in AlphaGo)

Michael Kohlhase: Artificial Intelligence 1 82 2025-02-06

We now discuss all of the four facets in a bit more detail, as they all either contribute directly
to our discussion of AI methods or characterize neighboring disciplines.

Acting humanly: The Turing test

Introduced by Alan Turing (1950) “Computing machinery and intelligence” [Tur50]:
5.1. INTRODUCTION: RATIONALITY IN ARTIFICIAL INTELLIGENCE 67

“Can machines think?” −→ “Can machines behave intelligently?”

Definition 5.1.1. The Turing test is an operational test for intelligent behavior
based on an imitation game over teletext (arbitrary topic)

It was predicted that by 2000, a machine might have a 30% chance of fooling a lay
person for 5 minutes.
Note: In [Tur50], Alan Turing

anticipated all major arguments against AI in following 50 years and

suggested major components of AI: knowledge, reasoning, language understand-
ing, learning
Problem: Turing test is not reproducible, constructive, or amenable to mathe-
matical analysis!

Michael Kohlhase: Artificial Intelligence 1 83 2025-02-06

Thinking humanly: Cognitive Science

1960s: “cognitive revolution”: information processing psychology replaced prevail-
ing orthodoxy of behaviorism.
Requires scientific theories of internal activities of the brain

What level of abstraction? “Knowledge” or “circuits”?

How to validate?: Requires
1. Predicting and testing behavior of human subjects or (top-down)
2. Direct identification from neurological data. (bottom-up)

Definition 5.1.2. Cognitive science is the interdisciplinary, scientific study of the

mind and its processes. It examines the nature, the tasks, and the functions of
cognition.
Definition 5.1.3. Cognitive neuroscience studies the biological processes and as-
pects that underlie cognition, with a specific focus on the neural connections in the
brain which are involved in mental processes.
Both approaches/disciplines are now distinct from AI.
Both share with AI the following characteristic: the available theories do not explain
(or engender) anything resembling human-level general intelligence

Hence, all three fields share one principal direction!

68 CHAPTER 5. RATIONAL AGENTS: AN AI FRAMEWORK

Michael Kohlhase: Artificial Intelligence 1 84 2025-02-06

Thinking rationally: Laws of Thought

Normative (or prescriptive) rather than descriptive
Aristotle: what are correct arguments/thought processes?
Several Greek schools developed various forms of logic: notation and rules of
derivation for thoughts; may or may not have proceeded to the idea of mechaniza-
tion.
Direct line through mathematics and philosophy to modern AI
Problems:

1. Not all intelligent behavior is mediated by logical deliberation

2. What is the purpose of thinking? What thoughts should I have out of all the
thoughts (logical or otherwise) that I could have?

Michael Kohlhase: Artificial Intelligence 1 85 2025-02-06

Acting Rationally
Idea: Rational behavior =
b doing the right thing!
Definition 5.1.4. Rational behavior consists of always doing what is expected to
maximize goal achievement given the available information.
Rational behavior does not necessarily involve thinking e.g., blinking reflex — but
thinking should be in the service of rational action.
Aristotle: Every art and every inquiry, and similarly every action and pursuit, is
thought to aim at some good. (Nicomachean Ethics)

Michael Kohlhase: Artificial Intelligence 1 86 2025-02-06

The Rational Agents

Definition 5.1.5. An agent is an entity that perceives and acts.
Central Idea: This course is about designing agent that exhibit rational behavior,
i.e. for any given class of environments and tasks, we seek the agent (or class of
agents) with the best performance.

Caveat: Computational limitations make perfect rationality unachievable

; design best program for given machine resources.

Michael Kohlhase: Artificial Intelligence 1 87 2025-02-06

5.2. AGENT/ENV. AS A FRAMEWORK 69

5.2 Agents and Environments as a Framework for AI

A Video Nugget covering this section can be found at https://fau.tv/clip/id/21843.
Given the discussion in the previous section, especially the ideas that “behaving rationally” could
be a suitable – since operational – goal for AI research, we build this into the paradigm “rational
agents” introduced by Stuart Russell and Eric H. Wefald in [RW91].

Agents and Environments

Definition 5.2.1. An agent is anything that
perceives its environment via sensors (a means of sensing the environment)
acts on it with actuators (means of changing the environment).

Definition 5.2.2. Any recognizable, coherent employment of the actuators of an

agent is called an action.

Example 5.2.3. Agents include humans, robots, softbots, thermostats, etc.

remark: The notion of an agent and its environment is intentionally designed to

be inclusive. We will classify and discuss subclasses of both later

Michael Kohlhase: Artificial Intelligence 1 88 2025-02-06

One possible objection to this is that the agent and the environment are conceptualized as separate
entities; in particular, that the image suggests that the agent itself is not part of the environment.
Indeed that is intended, since it makes thinking about agents and environments easier and is of
little consequence in practice. In particular, the offending separation is relatively easily fixed if
needed.
Let us now try to express the agent/environment ideas introduced above in mathematical language
to add the precision we need to start the process towards the implementation of rational agents.

Modeling Agents Mathematically and Computationally

Definition 5.2.4. A percept is the perceptual input of an agent at a specific time
instant.
Definition 5.2.5. Any recognizable, coherent employment of the actuators of an
agent is called an action.

Definition 5.2.6. The agent function f a of an agent a maps from percept histories
to actions:
f a : P∗ → A
70 CHAPTER 5. RATIONAL AGENTS: AN AI FRAMEWORK

We assume that agents can always perceive their own actions. (but not necessarily
their consequences)
Problem: Agent functions can become very big and may be uncomputable.
(theoretical tool only)

Definition 5.2.7. An agent function can be implemented by an agent program

that runs on a (physical or hypothetical) agent architecture.

Michael Kohlhase: Artificial Intelligence 1 89 2025-02-06

Here we already see a problem that will recur often in this course: The mathematical formulation
gives us an abstract specification of what we want (here the agent function), but not directly a
way of how to obtain it. Here, the solution is to choose a computational model for agents (an
agent architecture) and see how the agent function can be implemented in a agent program.

Agent Schema: Visualizing the Internal Agent Structure

Agent Schema: We will use the following kind of agent schema to visualize the
internal
Section structure
Agents and of
2.1. an agent:
Environments 35

Agent Sensors
Percepts

Environment

Actions
Actuators

Figure 2.1 Agents interact with environments through sensors and actuators.
Different agents differ on the contents of the white box in the center.
there is to say about the agent. Mathematically speaking, we say that an agent’s behavior is
described by the agent function that maps any given percept sequence to an action.
AGENT FUNCTION

We can
Michael imagine
Kohlhase: Intelligence the
tabulating
Artificial 1 agent function that90describes any given agent; for most
2025-02-06

agents, this would be a very large table—infinite, in fact, unless we place a bound on the
Let us fortify our intuition about
length of percept all ofwethis
sequences wantwith an example,
to consider. which
Given an agent we will
to experiment use
with, weoften
can, in the course
in principle, construct this table by trying out all possible percept sequences and recording
of the AI-1 course.which actions the agent does in response.1 The table is, of course, an external characterization
of the agent. Internally, the agent function for an artificial agent will be implemented by an
Example: Vacuum-Cleaner World and Agent
AGENT PROGRAM agent program. It is important to keep these two ideas distinct. The agent function is an
abstract mathematical description; the agent program is a concrete implementation, running
within some physical system.
To illustrate these ideas, we use a very simple example—the vacuum-cleaner world
shown in Figure 2.2. This world is so simple that we can describe everything that happens;
it’s also a made-up world, so we can invent many variations. This particular world has just two
locations: squares A and B. The vacuum agent perceives which square it is in and whether
there is dirt in the square. It can choose to move left, move right, suck up the dirt, or do
nothing. One very simple agent function is the following: if the current square is dirty, then
suck; otherwise, move to the other square. A partial tabulation of this agent function is shown
in Figure 2.3 and an agent program that implements it appears in Figure 2.8 on page 48.
Looking at Figure 2.3, we see that various vacuum-world agents can be defined simply
by filling in the right-hand column in various ways. The obvious question, then, is this: What
is the right way to fill out the table? In other words, what makes an agent good or bad,
intelligent or stupid? We answer these questions in the next section.

1 If the agent uses some randomization to choose its actions, then we would have to try each sequence many
times to identify the probability of each action. One might imagine that acting randomly is rather silly, but we
show later in this chapter that it can be very intelligent.
5.2. AGENT/ENV. AS A FRAMEWORK 71

Percept sequence Action

[A, Clean] Right
[A, Dirty] Suck
[B, Clean] Lef t
[B, Dirty] Suck
[A, Clean], [A, Clean] Right
[A, Clean], [A, Dirty] Suck
[A, Clean], [B, Clean] Lef t
[A, Clean], [B, Dirty] Suck
percepts: location and con- [A, Dirty], [A, Clean] Right
[A, Dirty], [A, Dirty] Suck
tents, e.g., [A, Dirty] .. ..
. .
actions: Lef t, Right, Suck, [A, Clean], [A, Clean], [A, Clean] Right
N oOp [A, Clean], [A, Clean], [A, Dirty] Suck
.. ..
. .

Science Question: What is the right agent function?

AI Question: Is there an agent architecture and agent program that implements
it.

Michael Kohlhase: Artificial Intelligence 1 91 2025-02-06

The first implementation idea inspired by the table in last slide would just be table lookup algo-
rithm.

Table-Driven Agents
Idea: We can just implement the agent function as a lookup table and lookup
actions.
We can directly implement this:
function Table−Driven−Agent(percept) returns an action
persistent table /∗ a table of actions indexed by percept sequences ∗/
var percepts /∗ a sequence, initially empty ∗/
append percept to the end of percepts
action := lookup(percepts, table)
return action

Problem: Why is this not a good idea?

The table is much too large: even with n binary percepts whose order of occur-
rence does not matter, we have 2n rows in the table.
Who is supposed to write this table anyways, even if it “only” has a million
entries?

Michael Kohlhase: Artificial Intelligence 1 92 2025-02-06

Example: Vacuum-Cleaner Agent Program

A much better implementation idea is to trigger actions from specific percepts.

Example 5.2.8 (Agent Program).

procedure Reflex−Vacuum−Agent [location,status] returns an action
72 CHAPTER 5. RATIONAL AGENTS: AN AI FRAMEWORK

if status = Dirty then return Suck

else if location = A then return Right
else if location = B then return Left

This is the kind of agent programs we will be looking for in AI-1.

Michael Kohlhase: Artificial Intelligence 1 93 2025-02-06

5.3 Good Behavior ; Rationality

Now we try understand the mathematics of rational behavior in our quest to make the rational
agents paradigm implementable and take steps for realizing AI. A Video Nugget covering this
section can be found at https://fau.tv/clip/id/21844.

Rationality
Idea: Try to design agents that are successful! (aka. “do the right thing”)

Problem: What do we mean by “successful”, how do we measure “success”?

Definition 5.3.1. A performance measure is a function that evaluates a sequence
of environments.
Example 5.3.2. A performance measure for a vacuum cleaner could

award one point per “square” cleaned up in time T ?

award one point per clean “square” per time step, minus one per move?
penalize for > k dirty squares?
Definition 5.3.3. An agent is called rational, if it chooses whichever action max-
imizes the expected value of the performance measure given the percept sequence
to date.
Critical Observation: We only need to maximize the expected value, not the
actual value of the performance measure!
Question: Why is rationality a good quality to aim for?

Michael Kohlhase: Artificial Intelligence 1 94 2025-02-06

Let us see how the observation that we only need to maximize the expected value, not the actual
value of the performance measure affects the consequences.

Consequences of Rationality: Exploration, Learning, Autonomy

Note: A rational agent need not be perfect:
It only needs to maximize expected value (rational ̸= omniscient)
need not predict e.g. very unlikely but catastrophic events in the future
Percepts may not supply all relevant information (rational ̸= clairvoyant)
if we cannot perceive things we do not need to react to them.
but we may need to try to find out about hidden dangers (exploration)
5.3. GOOD BEHAVIOR ; RATIONALITY 73

Action outcomes may not be as expected (rational ̸= successful)

but we may need to take action to ensure that they do (more often)
(learning)
Note: Rationality may entail exploration, learning, autonomy (depending on the
environment / task)
Definition 5.3.4. An agent is called autonomous, if it does not rely on the prior
knowledge about the environment of the designer.
Autonomy avoids fixed behaviors that can become unsuccessful in a changing en-
vironment. (anything else would be
irrational)
The agent may have to learn all relevant traits, invariants, properties of the envi-
ronment and actions.

Michael Kohlhase: Artificial Intelligence 1 95 2025-02-06

For the design of agent for a specific task – i.e. choose an agent architecture and design an
agent program, we have to take into account the performance measure, the environment, and the
characteristics of the agent itself; in particular its actions and sensors.

PEAS: Describing the Task Environment

Observation: To design a rational agent, we must specify the task environment in
terms of performance measure, environment, actuators, and sensors, together called
the PEAS components.
Example 5.3.5. When designing an automated taxi:
Performance measure: safety, destination, profits, legality, comfort, . . .
Environment: US streets/freeways, traffic, pedestrians, weather, . . .
Actuators: steering, accelerator, brake, horn, speaker/display, . . .
Sensors: video, accelerometers, gauges, engine sensors, keyboard, GPS, . . .
Example 5.3.6 (Internet Shopping Agent). The task environment:

Performance measure: price, quality, appropriateness, efficiency

Environment: current and future WWW sites, vendors, shippers
Actuators: display to user, follow URL, fill in form
Sensors: HTML pages (text, graphics, scripts)

Michael Kohlhase: Artificial Intelligence 1 96 2025-02-06

The PEAS criteria are essentially a laundry list of what an agent design task description should
include.

Examples of Agents: PEAS descriptions

74 CHAPTER 5. RATIONAL AGENTS: AN AI FRAMEWORK

Agent Type Performance Environment Actuators Sensors

measure
Chess/Go player win/loose/draw game board moves board position
Medical diagno- accuracy of di- patient, staff display ques- keyboard entry
sis system agnosis tions, diagnoses of symptoms
Part-picking percentage of conveyor belt jointed arm and camera, joint
robot parts in correct with parts, bins hand angle sensors
bins
Refinery con- purity, yield, refinery, opera- valves, pumps, temperature,
troller safety tors heaters, displays pressure, chem-
ical sensors
Interactive En- student’s score set of students, display exer- keyboard entry
glish tutor on test testing accuracy cises, sugges-
tions, correc-
tions

Michael Kohlhase: Artificial Intelligence 1 97 2025-02-06

Agents
Which are agents?
(A) James Bond.
(B) Your dog.
(C) Vacuum cleaner.
(D) Thermometer.

Answer: reserved for the plenary sessions ; be there!

Michael Kohlhase: Artificial Intelligence 1 98 2025-02-06

5.4 Classifying Environments

A Video Nugget covering this section can be found at https://fau.tv/clip/id/21869.
It is important to understand that the kind of the environment has a very profound effect on the
agent design. Depending on the kind, different kinds of agents are needed to be successful. So be-
fore we discuss common kind of agents in ??, we will classify kinds environments.

Environment types
Observation 5.4.1. Agent design is largely determined by the type of environment
it is intended for.

Problem: There is a vast number of possible kinds of environments in AI.

Solution: Classify along a few “dimensions”. (independent characteristics)
Definition 5.4.2. For an agent a we classify the environment e of a by its type,
which is one of the following. We call e

1. fully observable, iff the a’s sensors give it access to the complete state of the
environment at any point in time, else partially observable.
5.5. TYPES OF AGENTS 75

2. deterministic, iff the next state of the environment is completely determined by

the current state and a’s action, else stochastic.
3. episodic, iff a’s experience is divided into atomic episodes, where it perceives and
then performs a single action. Crucially, the next episode does not depend on
previous ones. Non-episodic environments are called sequential.
4. dynamic, iff the environment can change without an action performed by a, else
static. If the environment does not change but a’s performance measure does,
we call e semidynamic.
5. discrete, iff the sets of e’s state and a’s actions are countable, else continuous.
6. single-agent, iff only a acts on e; else multi-agent (when must we count parts of
e as agents?)

Michael Kohlhase: Artificial Intelligence 1 99 2025-02-06

Some examples will help us understand the classification of environments better.

Environment Types (Examples)

Example 5.4.3. Some environments classified:

Solitaire Backgammon Internet shopping Taxi

fully observable No Yes No No
deterministic Yes No Partly No
episodic No Yes No No
static Yes Semi Semi No
discrete Yes Yes Yes No
single-agent Yes No Yes (except auctions) No

Note: Take the example above with a grain of salt. There are often multiple
interpretations that yield different classifications and different agents. (agent
designer’s choice)
Example 5.4.4. Seen as a multi-agent game, chess is deterministic, as a single-
agent game, it is stochastic.
Observation 5.4.5. The real world is (of course) a partially observable, stochastic,
sequential, dynamic, continuous, and multi-agent environment. (worst case for AI)
Preview: We will concentrate on the “easy” environment types (fully observ-
able, deterministic, episodic, static, and single-agent) in AI-1 and extend them to
“realworld”-compatible ones in AI-2.

Michael Kohlhase: Artificial Intelligence 1 100 2025-02-06

In the AI-1 course we will work our way from the simpler environment types to the more general
ones. Each environment type wil need its own agent types specialized to surviving and doing well
in them.

5.5 Types of Agents

We will now discuss the main types of agents we will encounter in this course, get an impression
of the variety, and what they can and cannot do. We will start from simple reflex agents, add
76 CHAPTER 5. RATIONAL AGENTS: AN AI FRAMEWORK

state, and utility, and finally add learning. A Video Nugget covering this section can be found
at https://fau.tv/clip/id/21926.

Agent Types
Observation: So far we have described (and analyzed) agents only by their be-
havior (cf. agent function f : P ∗ → A).
Problem: This does not help us to build agents. (the goal of AI)

To build an agent, we need to fix an agent architecture and come up with an agent
program that runs on it.
Preview: Four basic types of agent architectures in order of increasing generality:
1. simple reflex agents
2. model-based agents
3. goal-based agents
4. utility-based agents
All these can be turned into learning agents.

Michael Kohlhase: Artificial Intelligence 1 101 2025-02-06

Simple reflex agents

Definition 5.5.1. A simple reflex agent is an agent a that only bases its actions
on the last percept: so the agent function simplifies to f a : P → A.
Agent
Section 2.4. Schema:
The Structure of Agents 49

Agent Sensors

What the world

is like now
Environment

Condition-action rules What action I

should do now

Actuators

Figure 2.9 Schematic diagram of a simple reflex agent.

Example 5.5.2 (Agent Program).
procedurefunction
Reflex−Vacuum−Agent ) returns an action returns an action
[location,status]
S IMPLE -R EFLEX -AGENT( percept
if status = thena set
Dirty rules,
persistent: . . of. condition–action rules
state ← I NTERPRET-I NPUT( percept )
rule ← RULE -M ATCH(state, rules)
action ← rule.ACTION
returnKohlhase:
Michael action Artificial Intelligence 1 102 2025-02-06

Figure 2.10 A simple reflex agent. It acts according to a rule whose condition matches
the current state, as defined by the percept.

trivial; it gets more interesting shortly.) We use rectangles to denote the current internal state
of the agent’s decision process, and ovals to represent the background information used in
the process. The agent program, which is also very simple, is shown in Figure 2.10. The
I NTERPRET-I NPUT function generates an abstracted description of the current state from the
percept, and the RULE -M ATCH function returns the first rule in the set of rules that matches
5.5. TYPES OF AGENTS 77

Simple reflex agents (continued)

General Agent Program:
function Simple−Reflex−Agent (percept) returns an action
persistent: rules /∗ a set of condition−action rules∗/
state := Interpret−Input(percept)
rule := Rule−Match(state,rules)
action := Rule−action[rule]
return action

Problem: Simple reflex agents can only react to the perceived state of the envi-
ronment, not to changes.
Example 5.5.3. Automobile tail lights signal braking by brightening. A simple
reflex agent would have to compare subsequent percepts to realize.

Problem: Partially observable environments get simple reflex agents into trouble.
Example 5.5.4. Vacuum cleaner robot with defective location sensor ; infinite
loops.

Michael Kohlhase: Artificial Intelligence 1 103 2025-02-06

Model-based Reflex Agents: Idea

Idea: Keep track of the state of the world we cannot see in an internal model.
Section 2.4. The Structure of Agents 51
Agent Schema:

Sensors
State
How the world evolves What the world
is like now
Environment

What my actions do

Condition-action rules What action I

should do now

Agent Actuators

Figure 2.11 A model-based reflex agent.

Michael Kohlhase: Artificial Intelligence 1 104 2025-02-06

function M ODEL -BASED -R EFLEX -AGENT( percept ) returns an action

Figure 2.12 A model-based reflex agent. It keeps track of the current state of the world,
using an internal model. It then chooses an action in the same way as the reflex agent.
78 CHAPTER 5. RATIONAL AGENTS: AN AI FRAMEWORK

a sensor model S that given a state s and a percepts p determines a new state
S(s, p).
a transition model T , that predicts a new state T (s, a) from a state s and an
action a.
An action function f that maps (new) states to an actions.
If the world model of a model-based agent A is in state s and A has taken action
a, A will transition to state s′ = T (S(p, s), a) and take action a′ = f (s′ ).
Note: As different percept sequences lead to different states, so the agent function
f a : P ∗ → A no longer depends only on the last percept.

Example 5.5.6 (Tail Lights Again). Model-based agents can do the ?? if the
states include a concept of tail light brightness.

Michael Kohlhase: Artificial Intelligence 1 105 2025-02-06

Model-Based Agents (continued)

Observation 5.5.7. The agent program for a model-based agent is of the following
form:
function Model−Based−Agent (percept) returns an action
var state /∗ a description of the current state of the world ∗/
persistent rules /∗ a set of condition−action rules ∗/
var action /∗ the most recent action, initially none ∗/
state := Update−State(state,action,percept)
rule := Rule−Match(state,rules)
action := Rule−action(rule)
return action

Problem: Having a world model does not always determine what to do (rationally).
Example 5.5.8. Coming to an intersection, where the agent has to decide between
going left and right.

Michael Kohlhase: Artificial Intelligence 1 106 2025-02-06

Goal-based Agents
Problem: A world model does not always determine what to do (rationally).
Observation: Having a goal in mind does! (determines future actions)
Agent Schema:
52
5.5. TYPES OF AGENTS Chapter 2. Intelligent Agents 79

Sensors

State
What the world
How the world evolves is like now

Environment
What it will be like
What my actions do if I do action A

What action I
Goals should do now

Agent Actuators

Figure 2.13 A model-based, goal-based agent. It keeps track of the world state as well as
a set of goals it is trying to achieve, and chooses an action that will (eventually) lead to the
Michael Kohlhase: Artificial Intelligence 1 107 2025-02-06
achievement of its goals.

Goal-based
example, theagents
taxi may be(continued)
driving back home, and it may have a rule telling it to fill up with
gas on the way home unless it has at least half a tank. Although “driving back home” may
seem to an aspect of the world state, the fact of the taxi’s destination is actually an aspect of
Definition 5.5.9. state.
the agent’s internal A goal-based
If you findagent is a model-based
this puzzling, agent
consider that withcould
the taxi transition model
be in exactly
that
Tthe deliberates
same place at theactions based
same time, but on 3 andtoa reach
intending worlda model:
different It employs
destination.
′
a set G of goals and a goal function f that given a (new) state s selects an
2.4.4 Goal-based agents
action a to best reach G.
Knowing something about the current state of the environment is not always enough to decide
The
whataction
to do. function is then
For example, 7→ fjunction,
at a sroad (T (s), G).
the taxi can turn left, turn right, or go straight
on. The correct decision depends on where the taxi is trying to get to. In other words, as well
GOAL
Observation:
as a current stateAdescription,
goal-based theagent
agent is more
needs flexible
some in goal
sort of the knowledge it can
information that utilize.
describes
situations that
Example are desirable—for
5.5.10. A goal-basedexample, beingeasily
agent can at thebe
passenger’s
changed destination. The agent
to go to a new desti-
program can combine this with the model (the same information as was used in the model-
nation, a model-based agent’s rules make it go to exactly one destination.
based reflex agent) to choose actions that achieve the goal. Figure 2.13 shows the goal-based
agent’s structure.
Sometimes goal-based
Michael Kohlhase: action selection
Artificial Intelligence 1 is straightforward—for
108 example, when goal sat-
2025-02-06
isfaction results immediately from a single action. Sometimes it will be more tricky—for
example, when the agent has to consider long sequences of twists and turns in order to find a
Utility-based the goal. Search (Chapters 3 to 5) and planning (Chapters 10 and 11) are the
way to achieve Agents
subfields of AI devoted to finding action sequences that achieve the agent’s goals.
Notice that decision making of this kind is fundamentally different from the condition–
Definition 5.5.11. earlier,
action rules described A utility-based agent consideration
in that it involves uses a worldofmodel along with
the future—both a utility
“What will
function
happen ifthat
I do models its preferences
such-and-such?” and “Willamong the me
that make states of that
happy?” world.
In the reflex It chooses
agent the
designs,
action that leadsistonotthe
this information best expected
explicitly utility.
represented, because the built-in rules map directly from

Agent Schema:
54
80 Chapter
CHAPTER 5. RATIONAL 2.
AGENTS: Intelligent
AN Agents
AI FRAMEWORK

Sensors
State
What the world
How the world evolves is like now

Environment
What it will be like
What my actions do if I do action A

Utility How happy I will be

in such a state

What action I
should do now

Agent Actuators

Figure 2.14 A model-based, utility-based agent. It uses a model of the world, along with
a utility function that measures its preferences among states of the world. Then it chooses the
Michael Kohlhase: Artificial Intelligence 1 109 2025-02-06
action that leads to the best expected utility, where expected utility is computed by averaging
over all possible outcome states, weighted by the probability of the outcome.

Utility-based vs. Goal-based Agents

outcome. (Appendix A defines expectation more precisely.) In Chapter 16, we show that any
rational agent must behave as if it possesses a utility function whose expected value it tries
Question:
to maximize. WhatAn agent is the
that difference
possesses anbetween goal-based
explicit utility functionand
canutility-based
make rational agents?
decisions
with a general-purpose algorithm that does not depend on
Utility-based Agents are a Generalization: We can always force goal-directedness the specific utility function being
maximized. In this way, the “global” definition of rationality—designating as rational those
by a utility function that only rewards goal states.
agent functions that have the highest performance—is turned into a “local” constraint on
rational-agent Agents
Goal-based designs that cancan dobeless: expressed in a simple
A utility program.
function allows rational decisions where
The utility-based
mere goals are inadequate: agent structure appears in Figure 2.14. Utility-based agent programs
appear in Part IV, where we design decision-making agents that must handle the uncertainty
conflicting
inherent goals or partially observable
in stochastic (utilityenvironments.
gives tradeoff to make rational decisions)
At this point, the reader may be wondering, “Is it that simple? We just build agents that
goals obtainable by uncertain actions (utility × likelihood helps)
maximize expected utility, and we’re done?” It’s true that such agents would be intelligent,
but it’s not simple. A utility-based agent has to model and keep track of its environment,
tasks that have involved a great deal of research on 110
Michael Kohlhase: Artificial Intelligence 1
perception, representation,
2025-02-06
reasoning,
and learning. The results of this research fill many of the chapters of this book. Choosing
the utility-maximizing course of action is also a difficult task, requiring ingenious algorithms
Learning Agents
that fill several more chapters. Even with these algorithms, perfect rationality is usually
unachievable in practice because of computational complexity, as we noted in Chapter 1.

Definition 5.5.12. A learning agent is an agent that augments the performance

2.4.6 Learning agents
element – which determines actions from percept sequences with
We have described agent programs with various methods for selecting actions. We have
a so
not, learning element
far, explained howwhich makes
the agent improvements
programs come intotobeing.
the agent’s components,
In his famous early paper,
Turing (1950)
a critic considers
which givesthe idea of actually
feedback programming
to the learning his intelligent
element based onmachines by hand.
an external per-
formance standard,
a problem generator which suggests actions that lead to new and informative
experiences.
The performance element is what we took for the whole agent above.

Michael Kohlhase: Artificial Intelligence 1 111 2025-02-06

Learning Agents
Agent Schema:
Section 2.4. The Structure of Agents 55
5.5. TYPES OF AGENTS 81

Performance standard

Critic Sensors

feedback

Environment
changes
Learning Performance
element element
knowledge
learning
goals

Problem
generator

Actuators
Agent

Figure 2.15 A general learning agent.

Michael Kohlhase: Artificial Intelligence 1 112 2025-02-06

He estimates how much

Learning Agents: workExample this might take and concludes “Some more expeditious method
seems desirable.” The method he proposes is to build learning machines and then to teach
them. In many
Example areas
5.5.13of AI, this isTaxi
(Learning nowAgent).
the preferred
It has themethod
componentsfor creating state-of-the-art
systems. Learning has another advantage, as we noted earlier: it allows the agent to operate
Performance element: the knowledge and procedures for selecting driving actions.
in initially unknown environments and to become more competent than its initial knowledge
(this controls the actual driving)
alone mightallow. In this section, we briefly introduce the main ideas(e.g.
critic: observes the world and informs the learning element
of learning
when
agents.
Throughout the book, we
passengers comment
complain brutalon opportunities and methods for learning in particular
braking)
kinds of agents. Part V goes into much more depth on the learning algorithms (e.g.
Learning element modifies the braking rules in the performance element
themselves.
A learning agent
earlier, softer) can be divided into four conceptual components, as shown in Fig-
LEARNING ELEMENT ure 2.15. The most generator
Problem importantmight distinction
experiment is with
between
brakingtheon learning
different road element,
surfaces which is re-
PERFORMANCE
sponsible for making improvements, and the performance element, which is responsible for
ELEMENT
The learning element can make changes to any “knowledge components” of the
selecting external
diagram, actions.
e.g. in theThe performance element is what we have previously considered
to be the entire agent: it takes in percepts and decides on actions. The learning element uses
model from the percept sequence (how the world evolves)
CRITIC feedback from the critic on how the agent is doing and determines how the performance
success likelihoods by observing action outcomes (what my actions do)
element should be modified to do better in the future.
Thedesign of the learning
Observation: here, the element passengerdepends very
complaints much
serve onofthe
as part thedesign
“externalof perfor-
the performance
element. When trying to design an agent that learns a certain capability, the offirst
mance standard” since they correlate to the overall outcome – e.g. in form tipsquestion is
or blacklists.
not “How am I going to get it to learn this?” but “What kind of performance element will my
agent need to do this once it has learned how?” Given an agent design, learning mechanisms
Michael Kohlhase: Artificial Intelligence 1 113 2025-02-06
can be constructed to improve every part of the agent.
The critic tells the learning element how well the agent is doing with respect to a fixed
Domain-Specific
performance standard. Thevs. critic Generalis necessary Agents because the percepts themselves provide no
indication of the agent’s success. For example, a chess program could receive a percept
indicating that it has checkmated its opponent, but it needs a performance standard to know
that this is a good thing; the percept itself does not say so. It is important that the performance
82 CHAPTER 5. RATIONAL AGENTS: AN AI FRAMEWORK

Domain-Specific Agent vs. General Agent

vs.
Solver specific to a particular prob- vs. Solver based on description in a
lem (“domain”). general problem-description language
(e.g., the rules of any board game).
More efficient. vs. Much less design/maintenance work.

What kind of agent are you?

Michael Kohlhase: Artificial Intelligence 1 114 2025-02-06

5.6 Representing the Environment in Agents

We now come to a very important topic, which has a great influence on agent design: how does
the agent represent the environment. After all, in all agent designs above (except the simple
reflex agent) maintain a notion of world state and how the world state evolves given percepts and
actions. The form of this model crucially influences the algorithms we can build. A Video
Nugget covering this section can be found at https://fau.tv/clip/id/21925.

Representing the Environment in Agents

We have seen various components of agents that answer questions like

What is the world like now?

What action should I do now?
What do my actions do?
Next natural question: How do these work? (see the rest of the course)

Important Distinction: How the agent implements the world model.

Definition 5.6.1. We call a state representation
atomic, iff it has no internal structure (black box)
factored, iff each state is characterized by attributes and their values.
structured, iff the state includes representations of objects, their properties and
relationships.
Intuition: From atomic to structured, the representations agent designer more
flexibility and the algorithms more components to process.

Also The additional internal structure will make the algorithms more complex.

Michael Kohlhase: Artificial Intelligence 1 115 2025-02-06

5.7. RATIONAL AGENTS: SUMMARY 83

Again, we fortify our intuitions with a an illustration and an example.

Atomic/Factored/Structured State Representations

Schematically: We can visualize the three kinds by

B C

(a) Atomic (b) Factored (b) Structured

Example 5.6.2. Consider the problem of finding a driving route from one end of
a country to the other via some sequence of cities.
In an atomic representation the state is represented by the name of a city.
In a factored representation we may have attributes “gps-location”, “gas”,. . .
(allows information sharing between states and uncertainty)
But how to represent a situation, where a large truck blocking the road, since it
is trying to back into a driveway, but a loose cow is blocking its path. (attribute
“TruckAheadBackingIntoDairyFarmDrivewayBlockedByLooseCow” is unlikely)
In a structured representation, we can have objects for trucks, cows, etc. and
their relationships. (at “run-time”)

Michael Kohlhase: Artificial Intelligence 1 116 2025-02-06

Note: The set of states in atomic representations and attributes in factored ones is determined
at design time, while the objects and their relationships in structured ones are discovered at
“runtime”.
Here – as always when we evaluate representations – the crucial aspect to look out for are the
idendity conditions: when do we consider two representations equal, and when can we (or more
crucially algorithms) distinguish them.
For instance for factored representations, make world representations equal, iff the values of
the attributes – that are determined at agent design time and thus immutable by the agent –
are all equual. So the agent designer has to make sure to add all the attributes to the chosen
representation that are necessary to distinguish environments that the agent program needs to
treat differently.
It is tempting to think that the situation with atomic representations is easier, since we can
“simply” add enough states for the necesssary distictions, but in practice this set of states may
have to be infinite, while in factored or structured representations we can keep representations
finite.

5.7 Rational Agents: Summary

Summary
Agents interact with environments through actuators and sensors.
84 CHAPTER 5. RATIONAL AGENTS: AN AI FRAMEWORK

The agent function describes what the agent does in all circumstances.
The performance measure evaluates the environment sequence.
A perfectly rational agent maximizes expected performance.

Agent programs implement (some) agent functions.

PEAS descriptions define task environments.
Environments are categorized along several dimensions:
fully observable? deterministic? episodic? static? discrete? single-agent?

Several basic agent architectures exist:

reflex, model-based, goal-based, utility-based

Michael Kohlhase: Artificial Intelligence 1 117 2025-02-06

Corollary: We are Agent Designers!

State: We have seen (and will add more details to) different

agent architectures,
corresponding agent programs and algorithms, and
world representation paradigms.
Problem: Which one is the best?

Answer: That really depends on the environment type they have to survive/thrive
in! The agent designer – i.e. you – has to choose!
The course gives you the necessary competencies.

There is often more than one reasonable choice.

Often we have to build agents and let them compete to
see what really works.

Consequence: The rational agents paradigm used in this course challenges you
to become a good agent designer.

Michael Kohlhase: Artificial Intelligence 1 118 2025-02-06

Part II

General Problem Solving

85
87

This part introduces search-based methods for general problem solving using atomic and factored
representations of states.
Concretely, we discuss the basic techniques of search-based symbolic AI. First in the shape of
classical and heuristic search and adversarial search paradigms. Then in constraint propagation,
where we see the first instances of inference-based methods.
88
Chapter 6

Problem Solving and Search

In this chapter, we will look at a class of algorithms called search algorithms. These are
algorithms that help in quite general situations, where there is a precisely described problem, that
needs to be solved. Hence the name “General Problem Solving” for the area.

6.1 Problem Solving

A Video Nugget covering this section can be found at https://fau.tv/clip/id/21927.
Before we come to the search algorithms themselves, we need to get a grip on the types of problems
themselves and how we can represent them, and on what the various types entail for the problem
solving process.
The first step is to classify the problem solving process by the amount of knowledge we have
available. It makes a difference, whether we know all the factors involved in the problem before
we actually are in the situation. In this case, we can solve the problem in the abstract, i.e. make
a plan before we actually enter the situation (i.e. offline), and then when the problem arises, only
execute the plan. If we do not have complete knowledge, then we can only make partial plans, and
have to be in the situation to obtain new knowledge (e.g. by observing the effects of our actions or
the actions of others). As this is much more difficult we will restrict ourselves to offline problem
solving.

Problem Solving: Introduction

Recap: Agents perceive the environment and compute an action.
In other words: Agents continually solve “the problem of what to do next”.

AI Goal: Find algorithms that help solving problems in general.

Idea: If we can describe/represent problems in a standardized way, we may have
a chance to find general algorithms.
Concretely: We will use the following two concepts to describe problems

States: A set of possible situations in our problem domain (=

b environments)
Actions: that get us from one state to another. (=
b agents)
A sequence of actions is a solution, if it brings us from an initial state to a goal
state. Problem solving computes solutions from problem formulations.

89
90 CHAPTER 6. PROBLEM SOLVING AND SEARCH

Definition 6.1.1. In offline problem solving an agent computing an action sequence

based complete knowledge of the environment.
Remark 6.1.2. Offline problem solving only works in fully observable, deterministic,
static, and episodic environments.

Definition 6.1.3. In online problem solving an agent computes one action at a

time based on incoming perceptions.
This Semester: We largely restrict ourselves to offline problem solving. (easier)

Michael Kohlhase: Artificial Intelligence 1 119 2025-02-06

We will use the following problem as a running example. It is simple enough to fit on one slide
and complex enough to show the relevant features of the problem solving algorithms we want to
talk about.

Example: Traveling in Romania

Scenario: An agent is on holiday in Romania; currently in Arad; flight home leaves
tomorrow
68 from Bucharest; how to get there?
Chapter We
3. have
SolvingaProblems
map:by Searching

Oradea
71
Neamt

Zerind 87
75 151
Iasi
Arad
140
92
Sibiu Fagaras
99
118
Vaslui
80
Rimnicu Vilcea
Timisoara
142
111 Pitesti 211
Lugoj 97
70 98
85 Hirsova
Mehadia 146 101 Urziceni
75 138 86
Bucharest
Drobeta 120
90
Craiova Eforie
Giurgiu

Figure 3.2 A simplified road map of part of Romania.

Formulate the Problem:

Sometimes the goal is specified by an abstract property rather than an explicitly enumer-
ated set of states. For example, in chess, the goal is to reach a state called “checkmate,”
States: various
wherecities.
the opponent’s king is under attack and can’t escape.
• A path cost function that assigns a numeric cost to each path. The problem-solving
Actions: drive
PATH COST
between cities.
agent chooses a cost function that reflects its own performance measure. For the agent
trying to get to Bucharest, time is of the essence, so the cost of a path might be its length
Solution: Appropriate
in kilometers.sequence
In this chapter,of
we cities, e.g.:
assume that the costArad,
of a pathSibiu, Fagaras,
can be described as the Bucharest
STEP COST sum of the costs of the individual actions along the path.3 The step cost of taking action
a in state s to reach state s! is denoted by c(s, a, s! ). The step costs for Romania are
shown in Figure 3.2 as route distances. We assume that step costs are nonnegative.4
Michael Kohlhase: Artificial Intelligence 1 120 2025-02-06
The preceding elements define a problem and can be gathered into a single data structure
that is given as input to a problem-solving algorithm. A solution to a problem is an action
Given this example to sequence fortifythatourleadsintuitions, wetocan
from the initial state a goal now turn quality
state. Solution to the formal
is measured definition
by the of problem
path cost function, and an optimal solution has the lowest path cost among all solutions.
formulation and their solutions.
OPTIMAL SOLUTION

3.1.2 Formulating problems

Problem Formulation
In the preceding section we proposed a formulation of the problem of getting to Bucharest in
terms of the initial state, actions, transition model, goal test, and path cost. This formulation
seems reasonable, but it is still a model—an abstract mathematical description—and not the
Definition 6.1.4. A problem formulation models a situation using states and
This assumption is algorithmically convenient but also theoretically justifiable—see page 649 in Chapter 17.
3

actions at an appropriate levelcostsofareabstraction.(do

The implications of negative
4 explored in Exercise 3.8. not model things like “put on my
left sock”, etc.)
it describes the initial state (we are in Arad)
6.1. PROBLEM SOLVING 91

it also limits the objectives by specifying goal states. (excludes, e.g. to stay
another couple of weeks.)
A solution is a sequence of actions that leads from the initial state to a goal state.
Problem solving computes solutions from problem formulations.

Finding the right level of abstraction and the required (not more!) information is
often the key to success.

Michael Kohlhase: Artificial Intelligence 1 121 2025-02-06

The Math of Problem Formulation: Search Problems

Definition 6.1.5. A search problem Π := ⟨S , A, T , I , G ⟩ consists of a set S of
states, a set A of actions, and a transition model T : A×S → P(S) that assigns to
any action a ∈ A and state s ∈ S a set of successor states.
Certain states in S are designated as goal states (also called terminal state) (G ⊆ S
with G ̸= ∅) and initial states I ⊆ S.
Definition 6.1.6. We say that an action a ∈ A is applicable in state s ∈ S, iff
T (a, s) ̸= ∅ and that any s′ ∈ T (a, s) is a result of applying action a to state s.
We
S call Ta : S → P(S) with Ta (s) := T (a, s) the result relation for a and TA :=
a∈A Ta the result relation of Π.

Definition 6.1.7. The graph ⟨S, TA ⟩ is called the state space induced by Π.
Definition 6.1.8. A solution for Π consists of a sequence a1 , . . ., an of actions
such that for all 1 < i ≤ n

ai is applicable to state si−1 , where s0 ∈ I and

si ∈ Tai (si−1 ), and sn ∈ G.
Idea: A solution bring us from I to a goal state via applicable actions.

Definition 6.1.9. Often we add a cost function c : A → R+ 0 that associates a step

cost c(a) to an action a ∈ A. The cost of a solution is the sum of the step costs of
its actions.

Michael Kohlhase: Artificial Intelligence 1 122 2025-02-06

Observation: The formulation of problems from ?? uses an atomic (black-box) state represen-
tation. It has enough functionality to construct the state space but nothing else. We will come
back to this in slide ??.
Remark 6.1.10. Note that search problems formalize problem formulations by making many of
the implicit constraints explicit.

Structure Overview: Search Problem

The structure overview for search problems:
92 CHAPTER 6. PROBLEM SOLVING AND SEARCH

S Set states,
* +
A Set actions,
search problem = T A×S → P(S) transition model,
I S initial state,
G P(S) goal states

Michael Kohlhase: Artificial Intelligence 1 123 2025-02-06

We will now specialize ?? to deterministic, fully observable environments, i.e. environments where
actions only have one – assured – outcome state.

Search Problems in deterministic, fully observable Environments

This semester, we will restrict ourselves to search problems, where(extend in AI II)

|T (a, s)| ≤ 1 for the transition models and (⇝ deterministic environment)

I = {s0 } (⇝ fully observable environment)
Definition 6.1.11. We call a search problem with transition model T deterministic,
iff |T (a, s)| ≤ 1.

Definition 6.1.12. In a deterministic search problem, Ta induces partial function

Sa : S ⇀S whose natural domain is the set of states where a is applicable: Sa (s):=s′
if Ta = {s′ } and undefined at s otherwise. We call Sa the successor function for a
and Sa (s) the successor state of s.

Definition 6.1.13. The predicate that tests for goal states is called a goal test.

Michael Kohlhase: Artificial Intelligence 1 124 2025-02-06

6.2 Problem Types

Note that the definition of a search problem is very general, it applies to many many real-world
problems. So we will try to characterize these by difficulty. A Video Nugget covering this
section can be found at https://fau.tv/clip/id/21928.

Problem types
Definition 6.2.1. A search problem is called a single state problem, iff it is
fully observable (at least the initial state)
deterministic (unique successor states)
static (states do not change other than by our own actions)
discrete (a countable number of states)
Definition 6.2.2. A search problem is called a multi state problem
states partially observable (e.g. multiple initial states)
deterministic, static, discrete
6.2. PROBLEM TYPES 93

Definition 6.2.3. A search problem is called a contingency problem, iff

the environment is non deterministic (solution can branch, depending on
contingencies)
the state space is unknown (like a baby, agent has to learn about states and
actions)

Michael Kohlhase: Artificial Intelligence 1 125 2025-02-06

We will explain these problem types with another example. The problem P is very simple: We
have a vacuum cleaner and two rooms. The vacuum cleaner is in one room at a time. The floor
can be dirty or clean.
The possible states are determined by the position of the vacuum cleaner and the information,
whether each room is dirty or not. Obviously, there are eight states: S = {1, 2, 3, 4, 5, 6, 7, 8} for
simplicity.
The goal is to have both rooms clean, the vacuum cleaner can be anywhere. So the set G of
goal states is {7, 8}. In the single-state version of the problem, [right, suck] shortest solution, but
[suck, right, suck] is also one. In the multiple-state version we have

[right{2, 4, 6, 8}, suck{4, 8}, lef t{3, 7}, suck{7}]

Example: vacuum-cleaner world

70 Chapter 3. Solving Problems by Searching
Single-state Problem:
R
L R

L
S S

Start in 5 L
R
R L
R
R

L L

Solution: [right, suck] S

S S
S
R
L R

S S

Figure 3.3 The state space for the vacuum world. Links denote actions: L = Left, R =
Multiple-state Problem: Right, S = Suck.

3.2.1 Toy problems

Start in {1, 2, 3, 4, 5, 6, 7, 8} The first example we examine is the vacuum world first introduced in Chapter 2. (See
Figure 2.2.) This can be formulated as a problem as follows:
Solution: [right, suck, lef t, suck] right →state{2,
• States: The 4, 6,by8}
is determined both the agent location and the dirt locations. The
agent is in one of two locations, each of which might or might not contain dirt. Thus,
there are → {4, 8}
suckn · 2 states.
2 × 2 = 8 possible world
n
2
states. A larger environment with n locations has

lef t
• Initial → {3, 7}
state: Any state can be designated as the initial state.
• Actions: In this simple environment, each state has just three actions: Left, Right, and
suckSuck. Larger→environments
{7} might also include Up and Down.
• Transition model: The actions have their expected effects, except that moving Left in
the leftmost square, moving Right in the rightmost square, and Sucking in a clean square
have no effect. The complete state space is shown in Figure 3.3.
• Goal test: This checks whether all the squares are clean.
Michael Kohlhase: Artificial Intelligence 1 126 Each step costs 1, so the path cost
• Path cost: 2025-02-06
is the number of steps in the path.
Compared with the real world, this toy problem has discrete locations, discrete dirt, reliable
cleaning, and it never gets any dirtier. Chapter 4 relaxes some of these assumptions.
8-PUZZLE The 8-puzzle, an instance of which is shown in Figure 3.4, consists of a 3×3 board with
eight numbered tiles and a blank space. A tile adjacent to the blank space can slide into the
Example: Vacuum-Cleaner World (continued) space. The object is to reach a specified goal state, such as the one shown on the right of the
figure. The standard formulation is as follows:

Contingency Problem:
94 CHAPTER 6. PROBLEM SOLVING AND SEARCH

Murphy’s Law: suck can dirty a clean

carpet 70 Chapter 3. Solving Problems by Searching

Local sensing: dirty/notdirty at lo-

R
L R

cation only
L
S S

R R
L R L R

Start in: {1, 3} S

L
S S
L

S
R

Solution:
L R

[suck, right, suck] L

suck → {5, 7} S S

Figure 3.3 The state space for the vacuum world. Links denote actions: L = Left, R =
right → {6, 8} Right, S = Suck.

suck → {6, 8} 3.2.1 Toy problems

The first example we examine is the vacuum world first introduced in Chapter 2. (See
Figure 2.2.) This can be formulated as a problem as follows:
• States: The state is determined by both the agent location and the dirt locations. The
better: [suck, right, if dirt then suck] (decide whether in 6 or 8 using local
agent is in one of two locations, each of which might or might not contain dirt. Thus,
there are 2 × 22 = 8 possible world states. A larger environment with n locations has

sensing) n · 2n states.
• Initial state: Any state can be designated as the initial state.
• Actions: In this simple environment, each state has just three actions: Left, Right, and
Suck. Larger environments might also include Up and Down.
• Transition model: The actions have their expected effects, except that moving Left in
the leftmost square, moving Right in the rightmost square, and Sucking in a clean square
Michael Kohlhase: Artificial Intelligence 1 have no effect. The complete state space is2025-02-06
127 shown in Figure 3.3.
• Goal test: This checks whether all the squares are clean.
• Path cost: Each step costs 1, so the path cost is the number of steps in the path.

In the contingency version of P a solution is the following:

Compared with the real world, this toy problem has discrete locations, discrete dirt, reliable
cleaning, and it never gets any dirtier. Chapter 4 relaxes some of these assumptions.
8-PUZZLE The 8-puzzle, an instance of which is shown in Figure 3.4, consists of a 3×3 board with
eight numbered tiles and a blank space. A tile adjacent to the blank space can slide into the
space. The object is to reach a specified goal state, such as the one shown on the right of the
[suck{5, 7}, right → {6, 8}, suck → {6, 8}, suck{5, 7}] figure. The standard formulation is as follows:

etc. Of course, local sensing can help: narrow {6, 8} to {6} or {8}, if we are in the first, then
suck.

Single-state problem formulation

Defined by the following four items
1. Initial state: (e.g. Arad)
2. Successor function Sa (s): (e.g. SgoZer = {(Arad,Zerind), (goSib,Sibiu), . . . })
3. Goal test: (e.g. x = Bucharest (explicit test) )
noDirt(x) (implicit test)
4. Path cost (optional):(e.g. sum of distances, number of operators executed, etc.)
Solution: A sequence of actions leading from the initial state to a goal state.

Michael Kohlhase: Artificial Intelligence 1 128 2025-02-06

“Path cost”: There may be more than one solution and we might want to have the “best” one in
a certain sense.

Selecting a state space

Abstraction: Real world is absurdly complex!
State space must be abstracted for problem solving.

(Abstract) state: Set of real states.

(Abstract) operator: Complex combination of real actions.
Example: Arad → Zerind represents complex set of possible routes.
(Abstract) solution: Set of real paths that are solutions in the real world.
6.2. PROBLEM TYPES 95

Michael Kohlhase: Artificial Intelligence 1 129 2025-02-06

“State”: e.g., we don’t care about tourist attractions found in the cities along the way. But this is
problem dependent. In a different problem it may well be appropriate to include such information
in the notion of state.
“Realizability”: one could also say that the abstraction must be sound wrt. reality.

Example:
Section 3.2. The
Example 8-puzzle
Problems 71

7 2 4 1 2

5 6 3 4 5

8 3 1 6 7 8

Start State Goal State

States integer locations of tiles
Figure 3.4 A typical instance of the 8-puzzle.
Actions lef t, right, up, down
States? Actions?. . .
Goal test = goal state?
• States: A state description specifies the location of each of the eight tiles and the blank
in one of the nine squares.
Path cost 1 per move
• Initial state: Any state can be designated as the initial state. Note that any given goal
can be reached from exactly half of the possible initial states (Exercise 3.4).
Michael Kohlhase: Artificial Intelligence 1 130 2025-02-06
• Actions: The simplest formulation defines the actions as movements of the blank space
Left, Right, Up, or Down. Different subsets of these are possible depending on where
How many states the areblank
there?is. N factorial, so it is not obvious that the problem is in NP. One
needs to show, for example,
• Transition thatGiven
model: polynomial length
a state and action, thissolutions do always
returns the resulting state;exist. Can be done by
for example,
combinatorial arguments
if we applyon Leftstate
to the space
start state graph (really
in Figure ?).resulting state has the 5 and the blank
3.4, the
Some rule-books give a different goal state for the 8-puzzle: starting with 1, 2, 3 in the top row
switched.
and having the •holdGoalintest:
theThis lowerchecks rightwhether corner. This
the state is completely
matches irrelevant
the goal configuration shown forinthe
Fig- example and
its significance to ure 3.4. (Other goal configurations are possible.)
AI-1.
• Path cost: Each step costs 1, so the path cost is the number of steps in the path.
36 Example: Vacuum-cleaner
What abstractions have we included here? The actions are abstracted
Chapter 2. toIntelligent
their beginning
Agents and
final states, ignoring the intermediate locations where the block is sliding. We have abstracted
away actions such as shaking the board when pieces get stuck and ruled out extracting the
pieces with a knife and putting
A them back again.BWe are left with a description of the rules of
the puzzle, avoiding all the details of physical manipulations.
SLIDING-BLOCK
PUZZLES The 8-puzzle belongs to the family of sliding-block puzzles, which are often used as
test problems for new search algorithms in AI. This family is known to be NP-complete,
so one does not expect to find methods significantly better in the worst case than the search
algorithms described in this chapter and the next. The 8-puzzle has 9!/2 = 181, 440 reachable
states and is easily solved. The 15-puzzle (on a 4 × 4 board) has around 1.3 trillion states, and
States in a fewinteger
random instances can be solved optimally dirtbyand
milliseconds the robot locations
best search algorithms.
Figure 2.2 A vacuum-cleaner world with just25
The 24-puzzle has around 10 two
(on a 5 × 5 board)Actions lef
locations.
states, and random
t, right, suck,instances
noOp take several
States?
hours to solveActions?.
optimally. . .
sequence Goal test notdirty?
8-QUEENS PROBLEM The goal ofPercept
the 8-queens problem is to place eight queens on a chessboardAction
such that
no queen attacks [A, anyClean]
Path cost 1 per operation (0 forRight
other. (A queen attacks any piece in the same row, column
noOp)
or diago-
[A, Dirty]
nal.) Figure 3.5 shows Suck column is
an attempted solution that fails: the queen in the rightmost
[B, Clean]
attacked by the queen at the top left. Left
[B, Dirty] Suck
[A, Clean], [A, Clean] Right
[A, Clean], [A, Dirty] Suck
..
Michael Kohlhase: Artificial Intelligence 1 131 ..
2025-02-06
. .
[A, Clean], [A, Clean], [A, Clean] Right
[A, Clean], [A, Clean], [A, Dirty] Suck
Example: Robotic assembly ..
.
..
.
Figure 2.3 Partial tabulation of a simple agent function for the vacuum-cleaner world
shown in Figure 2.2.

Before closing this section, we should emphasize that the notion of an agent is meant to
be a tool for analyzing systems, not an absolute characterization that divides the world into
agents and non-agents. One could view a hand-held calculator as an agent that chooses the
96 CHAPTER 6. PROBLEM SOLVING AND SEARCH

States? Actions?. . .
States real-valued coordinates of
robot joint angles and parts of the object to be assembled
Actions continuous motions of robot joints
Goal test assembly complete?
Path cost time to execute

Michael Kohlhase: Artificial Intelligence 1 132 2025-02-06

General Problems
Question: Which are “Problems”?

(A) You didn’t understand any of the lecture.

(B) Your bus today will probably be late.
(C) Your vacuum cleaner wants to clean your apartment.
(D) You want to win a chess game.

Answer: reserved for the plenary sessions ; be there!

Michael Kohlhase: Artificial Intelligence 1 133 2025-02-06

6.3 Search
A Video Nugget covering this section can be found at https://fau.tv/clip/id/21956.

Tree Search Algorithms

Note: The state space of a search problem ⟨S , A, T , I , G ⟩ is a graph ⟨S, TA ⟩.

As graphs are difficult to compute with, we often compute a corresponding tree

and work on that. (standard trick in graph algorithms)
Definition 6.3.1. Given a search problem P := ⟨S , A, T , I , G ⟩, the tree search
algorithm consists of the simulated exploration of state space ⟨S, TA ⟩ in a search
tree formed by successively expanding already explored states. (offline algorithm)
procedure Tree−Search (problem, strategy) : <a solution or failure>
6.3. SEARCH 97

<initialize the search tree using the initial state of problem>

loop
if <there are no candidates for expansion> <return failure> end if
<choose a leaf node for expansion according to strategy>
if <the node contains a goal state> return <the corresponding solution>
else <expand the node and add the resulting nodes to the search tree>
end if
end loop
end procedure

We expand a node n by generating all successors of n and inserting them as children

of n in the search tree.

Michael Kohlhase: Artificial Intelligence 1 134 2025-02-06

Tree Search: Example

Arad

Sibiu Timisoara Zerind

Arad Fagaras Oradea R. Vilcea Arad Lugoj Oradea Arad

Arad

Sibiu Timisoara Zerind

Arad Fagaras Oradea R. Vilcea Arad Lugoj Oradea Arad

Arad

Sibiu Timisoara Zerind

Arad Fagaras Oradea R. Vilcea Arad Lugoj Oradea Arad

Arad

Sibiu Timisoara Zerind

Arad Fagaras Oradea R. Vilcea Arad Lugoj Oradea Arad

Michael Kohlhase: Artificial Intelligence 1 135 2025-02-06

Let us now think a bit more about the implementation of tree search algorithms based on the
ideas discussed above. The abstract, mathematical notions of a search problem and the induced
tree search algorithm gets further refined here.

Implementation: States vs. nodes

98 CHAPTER 6. PROBLEM SOLVING AND SEARCH

Recap: A state is a (representation of) a physical configuration.

Definition 6.3.2 (Implementing a Search Tree).
Section 3.3. Searching for Solutions 79

A search tree node is a data structure that in-

cludes accessors for parent, children, depth, path
PARENT

cost, insertion order, etc. 5 4 Node ACTION = Right

PATH-COST = 6
A goal node (initial node) is a search tree node 6 1 88
STATE

labeled with a goal state (initial state).

7 3 22

Figure 3.10 Nodes are the data structures from which the search tree is constructed. Each
has a parent, a state, and various bookkeeping fields. Arrows point from child to parent.
Observation: A set of search tree nodes that can all (recursively) reach a single
initial node form a search tree. components for a child node. (they
The functionimplement
C HILD -N ODE takesit)
Given the components for a parent node, it is easy to see how to compute the necessary
a parent node and an action
and returns the resulting child node:

Observation: Paths in the search tree correspond to paths in the state space.
function C HILD -N ODE( problem, parent , action) returns a node
return a node with
Definition 6.3.3. We define the path cost of a node
S =n in a search
problem.R (parent.S tree ), to be
, actionT
TATE ESULT TATE
P = parent , A = action,ARENT CTION
the sum of the step costs on the path from n to theP root
-C of T .P. -C + problem.S -C (parent.S
= parent ATH OST ATH OST TEP OST TATE, action )

Observation: As a search tree node has access to parents, we can read off the
The node data structure is depicted in Figure 3.10. Notice how the PARENT pointers
string the nodes together into a tree structure. These pointers also allow the solution path to be
solution from a goal node. extracted when a goal node is found; we use the S OLUTION function to return the sequence
of actions obtained by following parent pointers back to the root.
Up to now, we have not been very careful to distinguish between nodes and states, but in
writing detailed algorithms it’s important to make that distinction. A node is a bookkeeping
data structure used to represent the search tree. A state corresponds to a configuration of the
Michael Kohlhase: Artificial Intelligence 1 136 2025-02-06
world. Thus, nodes are on particular paths, as defined by PARENT pointers, whereas states
are not. Furthermore, two different nodes can contain the same world state if that state is
generated via two different search paths.
Now that we have nodes, we need somewhere to put them. The frontier needs to be
It is very important to understand the fundamental difference between a state in a search problem,
stored in such a way that the search algorithm can easily choose the next node to expand
according to its preferred strategy. The appropriate data structure for this is a queue. The
a node search tree employed by the tree search algorithm, and the implementation in a search tree
QUEUE

operations on a queue are as follows:

node. The implementation above is faithful in the sense ••that the implemented data structures
E MPTY ?(queue) returns true only if there are no more elements in the queue.
P OP(queue) removes the first element of the queue and returns it.
contain all the information needed in the tree search algorithm.
• I NSERT (element, queue) inserts an element and returns the resulting queue.

So we can use it to refine the idea of a tree search algorithm into an implementation.

Implementation of Search Algorithms

Definition 6.3.4 (Implemented Tree Search Algorithm).
procedure Tree_Search (problem,strategy)
fringe := insert(make_node(initial_state(problem)))
loop
if empty(fringe) fail end if
node := first(fringe,strategy)
if GoalTest(node) return node
else fringe := insert(expand(node,problem))
end if
end loop
end procedure

The fringe is the set of search tree nodes not yet expanded in tree search.
Idea: We treat the fringe as an abstract data type with three accessors: the
binary function first retrieves an element from the fringe according to a strategy.
binary function insert adds a (set of) search tree node into a fringe.
unary predicate empty to determine whether a fringe is the empty set.
The strategy determines the behavior of the fringe (data structure) (see below)

Michael Kohlhase: Artificial Intelligence 1 137 2025-02-06

6.4. UNINFORMED SEARCH STRATEGIES 99

Note: The pseudocode in ?? is still relatively underspecified – leaves many implementation

details unspecified. Here are the specifications of the functions used without.
•
• make_node constructs a search tree node from a state.
• initial_state accesses the initial state of a search problem.
• State returns the state associated with its aregument.
• GoalNode checks whether its argument is a goal node
• expand = creates new search tree nodes by for all successor states.
Essentially, only the first function is non-trivial (as the strategy argument shows) In fact it is the
only place, where the strategy is used in the algorithm.
An alternative implementation would have been to make the fringe a queue, and insert order
the fringe as the strategy sees fit. Then first can just return the first element of the queue. This
would have lead to a different signature, possibly different runtimes, but the same overall result
of the algorithm.

Search strategies
Definition 6.3.5. A strategy is a function that picks a node from the fringe of a
search tree. (equivalently, orders the fringe and picks the first.)

Definition 6.3.6 (Important Properties of Strategies).

completeness does it always find a solution if one exists?

time complexity number of nodes generated/expanded
space complexity maximum number of nodes in memory
optimality does it always find a least cost solution?

Time and space complexity measured in terms of:

b maximum branching factor of the search tree

d minimal graph depth of a solution in the search tree
m maximum graph depth of the search tree (may be ∞)

Complexity means here always worst-case complexity!

Michael Kohlhase: Artificial Intelligence 1 138 2025-02-06

Note that there can be infinite branches, see the search tree for Romania.

6.4 Uninformed Search Strategies

Video Nuggets covering this section can be found at https://fau.tv/clip/id/21994 and
https://fau.tv/clip/id/21995.

Uninformed search strategies

Definition 6.4.1. We speak of an uninformed search algorithm, if it only uses the
information available in the problem definition.
100 CHAPTER 6. PROBLEM SOLVING AND SEARCH

Next: Frequently used search algorithms

Breadth first search
Uniform cost search
Depth first search
Depth limited search
Iterative deepening search

Michael Kohlhase: Artificial Intelligence 1 139 2025-02-06

The opposite of uninformed search is informed or heuristic search that uses a heuristic function
that adds external guidance to the search process. In the Romania example, one could add the
heuristic to prefer cities that lie in the general direction of the goal (here SE).
Even though heuristic search is usually much more efficient, uninformed search is important
nonetheless, because many problems do not allow to extract good heuristics.

6.4.1 Breadth-First Search Strategies

Breadth-First Search
Idea: Expand the shallowest unexpanded node.

Definition 6.4.2. The breadth first search (BFS) strategy treats the fringe as a
FIFO queue, i.e. successors go in at the end of the fringe.
Example 6.4.3 (Synthetic).

B C

D E F G

H I J K L M N O

B C

D E F G

H I J K L M N O
6.4. UNINFORMED SEARCH STRATEGIES 101

B C

D E F G

H I J K L M N O

B C

D E F G

H I J K L M N O

B C

D E F G

H I J K L M N O

B C

D E F G

H I J K L M N O

Michael Kohlhase: Artificial Intelligence 1 140 2025-02-06

We will now apply the breadth first search strategy to our running example: Traveling in Romania.
Note that we leave out the green dashed nodes that allow us a preview over what the search tree
will look like (if expanded). This gives a much cleaner picture we assume that the readers already
have grasped the mechanism sufficiently.

Breadth-First Search: Romania

Example 6.4.4.

Arad
102 CHAPTER 6. PROBLEM SOLVING AND SEARCH

Arad

Sibiu Timisoara Zerind

Arad

Sibiu Timisoara Zerind

Arad Fagaras Oradea R. Vilcea

Arad

Sibiu Timisoara Zerind

Arad Fagaras Oradea R. Vilcea Arad Lugoj

Arad

Sibiu Timisoara Zerind

Arad Fagaras Oradea R. Vilcea Arad Lugoj Oradea Arad

Michael Kohlhase: Artificial Intelligence 1 141 2025-02-06

Breadth-first search: Properties

Completeness Yes (if b is finite)

Time complexity 1+b+b2 +b3 +. . .+bd , so O(bd ), i.e. exponential
in d
Space complexity O(bd ) (fringe may be whole level)
Optimality Yes (if cost = 1 per step), not optimal in general

Disadvantage: Space is the big problem (can easily generate nodes at

500MB/sec =b 1.8TB/h)
Optimal?: No! If cost varies for different steps, there might be better solutions
below the level of the first one.

An alternative is to generate all solutions and then pick an optimal one. This works
only, if m is finite.

Michael Kohlhase: Artificial Intelligence 1 142 2025-02-06

The next idea is to let cost drive the search. For this, we will need a non-trivial cost function: we
will take the distance between cities, since this is very natural. Alternatives would be the driving
time, train ticket cost, or the number of tourist attractions along the way.
Of course we need to update our problem formulation with the necessary information.
6.4. UNINFORMED SEARCH STRATEGIES 103

68 Romania with Step Costs as Distances

Chapter 3. Solving Problems by Searching

Oradea
71
Neamt

Figure 3.2 A simplified road map of part of Romania.

Michael Kohlhase: Artificial Intelligence 1 143 2025-02-06

Sometimes the goal is specified by an abstract property rather than an explicitly enumer-
ated set of states. For example, in chess, the goal is to reach a state called “checkmate,”
Uniform-cost search
where the opponent’s king is under attack and can’t escape.
PATH COST • A path cost function that assigns a numeric cost to each path. The problem-solving
Idea: agent chooses a cost function that reflects its own performance measure. For the agent
Expand least cost unexpanded node.
trying to get to Bucharest, time is of the essence, so the cost of a path might be its length
in kilometers.
Definition 6.4.5.InUniform-cost
this chapter, we search
assume that the cost
(UCS) of a path
is the can bewhere
strategy described
theasfringe
the is
sum of the costs of the individual actions along the path.3 The step cost of taking action
STEP COST
ordered by increasing path cost. ! !
a in state s to reach state s is denoted by c(s, a, s ). The step costs for Romania are
shown in Figure to
Note: Equivalent 3.2breadth
as route distances. We assume
first search that step
if all step costs
costs nonnegative.4
areareequal.
The preceding elements define a problem and can be gathered into a single data structure
Synthetic
that is givenExample:
as input to a problem-solving algorithm. A solution to a problem is an action
sequence that leads from the initial state to a goal state. Solution quality is measured by the
OPTIMAL SOLUTION path cost function, and an optimal solution has the lowest path cost among all solutions.
Arad

3.1.2 Formulating problems

Arad
In the preceding section we proposed a formulation of the problem of getting to Bucharest in
terms of the initial state, actions,
140 transition model,
118 goal test,
75 and path cost. This formulation
seems reasonable, but it is still a model—an abstract mathematical description—and not the
Sibiu Timisoara Zerind
3 This assumption is algorithmically convenient but also theoretically justifiable—see page 649 in Chapter 17.
4 The implications of negative costs are explored in Exercise 3.8.

Arad
140 118 75
Sibiu Timisoara Zerind
71 75

Oradea Arad
104 CHAPTER 6. PROBLEM SOLVING AND SEARCH

Arad
140 118 75
Sibiu Timisoara Zerind
118 111 71 75

Arad Lugoj Oradea Arad

Arad
140 118 75
Sibiu Timisoara Zerind
140 99 151 80 118 111 71 75

Arad Fagaras Oradea R. Vilcea Arad Lugoj Oradea Arad

Michael Kohlhase: Artificial Intelligence 1 144 2025-02-06

Note that we must sum the distances to each leaf. That is, we go back to the first level after the
third step.

Uniform-cost search: Properties

Completeness Yes (if step costs ≥ ϵ > 0)

Time complexity number of nodes with path cost less than that of opti-
mal solution
Space complexity ditto
Optimality Yes

Michael Kohlhase: Artificial Intelligence 1 145 2025-02-06

If step cost is negative, the same situation as in breadth first search can occur: later solutions may
be cheaper than the current one.
If step cost is 0, one can run into infinite branches. UCS then degenerates into depth first
search, the next kind of search algorithm we will encounter. Even if we have infinite branches,
where the sum of step costs converges, we can get into trouble, since the search is forced down
these infinite paths before a solution can be found.
Worst case is often worse than BFS, because large trees with small steps tend to be searched
first. If step costs are uniform, it degenerates to BFS.

6.4.2 Depth-First Search Strategies

Depth-first Search
Idea: Expand deepest unexpanded node.

Definition 6.4.6. Depth-first search (DFS) is the strategy where the fringe is
organized as a (LIFO) stack i.e. successors go in at front of the fringe.
Definition 6.4.7. Every node that is pushed to the stack is called a backtrack
point. The action of popping a non-goal node from the stack and continuing the
search with the new top element of the stack (a backtrack point by construction)
is called backtracking, and correspondingly the DFS algorithm backtracking search.
6.4. UNINFORMED SEARCH STRATEGIES 105

Note: Depth first search can perform infinite cyclic excursions

Need a finite, non cyclic state space (or repeated state checking)

Michael Kohlhase: Artificial Intelligence 1 146 2025-02-06

Depth-First Search
Example 6.4.8 (Synthetic).

B C

D E F G

H I J K L M N O

B C

D E F G

H I J K L M N O

B C

D E F G

H I J K L M N O

B C

D E F G

H I J K L M N O
106 CHAPTER 6. PROBLEM SOLVING AND SEARCH

B C

D E F G

H I J K L M N O

B C

D E F G

H I J K L M N O

B C

D E F G

H I J K L M N O

B C

D E F G

H I J K L M N O

B C

D E F G

H I J K L M N O
6.4. UNINFORMED SEARCH STRATEGIES 107

B C

D E F G

H I J K L M N O

B C

D E F G

H I J K L M N O

B C

D E F G

H I J K L M N O

B C

D E F G

H I J K L M N O

B C

D E F G

H I J K L M N O

Michael Kohlhase: Artificial Intelligence 1 147 2025-02-06

108 CHAPTER 6. PROBLEM SOLVING AND SEARCH

Depth-First Search: Romania

Example 6.4.9 (Romania).

Arad

Sibiu Timisoara Zerind

Arad

Sibiu Timisoara Zerind

Arad Fagaras Oradea R. Vilcea

Arad

Sibiu Timisoara Zerind

Arad Fagaras Oradea R. Vilcea

Sibiu Timisoara Zerind

Michael Kohlhase: Artificial Intelligence 1 148 2025-02-06

Depth-first search: Properties

Completeness Yes: if search tree finite

No: if search tree contains infinite paths or
loops
Time complexity O(bm )
(we need to explore until max depth m in any
case!)
Space complexity O(bm) (i.e. linear space)
(need at most store m levels and at each level
at most b nodes)
Optimality No (there can be many better solutions in the
unexplored part of the search tree)

Disadvantage: Time terrible if m much larger than d.

Advantage: Time may be much less than breadth first search if solutions are
dense.
6.4. UNINFORMED SEARCH STRATEGIES 109

Michael Kohlhase: Artificial Intelligence 1 149 2025-02-06

Iterative deepening search

Definition 6.4.10. Depth limited search is depth first search with a depth limit.

Definition 6.4.11. Iterative deepening search (IDS) is depth limited search with
ever increasing depth limits. We call the difference between successive depth limits
the step size.

procedure Tree_Search (problem)

<initialize the search tree using the initial state of problem>
for depth = 0 to ∞
result := Depth_Limited_search(problem,depth)
if depth ̸= cutoff return result end if
end for
end procedure

Michael Kohlhase: Artificial Intelligence 1 150 2025-02-06

Ilustration: Iterative Deepening Search at various Limit Depths

A A

A A A A

B C B C B C B C

A A A A

B C B C B C B C

D E F G D E F G D E F G D E F G

A A A A

B C B C B C B C

D E F G D E F G D E F G D E F G
110 CHAPTER 6. PROBLEM SOLVING AND SEARCH

Michael Kohlhase: Artificial Intelligence 1 151 2025-02-06

Iterative deepening search: Properties

Completeness Yes
Time complexity (d+1)·b0 +d·b1 +(d−1)·b2 +. . .+bd ∈ O(bd+1 )

Space complexity O(b · d)
Optimality Yes (if step cost = 1)

Consequence: IDS used in practice for search spaces of large, infinite, or unknown
depth.

Michael Kohlhase: Artificial Intelligence 1 152 2025-02-06

Note: To find a solution (at depth d) we have to search the whole tree up to d. Of course since
we do not save the search state, we have to re-compute the upper part of the tree for the next
level. This seems like a great waste of resources at first, however, IDS tries to be complete without
the space penalties.
However, the space complexity is as good as DFS, since we are using DFS along the way. Like
in BFS, the whole tree on level d (of optimal solution) is explored, so optimality is inherited from
there. Like BFS, one can modify this to incorporate uniform cost search behavior.
As a consequence, variants of IDS are the method of choice if we do not have additional
information.

Comparison BFS (optimal) and IDS (not)

Example 6.4.12. IDS may fail to be be optimal at step sizes > 1.
Comparison
Comparison
6.4. UNINFORMED SEARCH STRATEGIES 111
Breadth-first search Iterative deepening search
Breadth-first search Iterative deepening search
Breadth first search Iterative deepening search

Kohlhase:
Kohlhase:Künstliche
KünstlicheIntelligenz 1 1
Intelligenz 150150 JulyJuly
5, 2018
5, 2018

Michael Kohlhase: Artificial Intelligence 1 153 2025-02-06

6.4.3 Further Topics

Tree Search vs. Graph Search

We have only covered tree search algorithms.
States duplicated in nodes are a huge problem for efficiency.

Definition 6.4.13. A graph search algorithm is a variant of a tree search algorithm

that prunes nodes whose state has already been considered (duplicate pruning),
essentially using a DAG data structure.
Observation 6.4.14. Tree search is memory intensive it has to store the fringe so
keeping a list of “explored states” does not lose much.

Graph versions of all the tree search algorithms considered here exist, but are more
difficult to understand (and to prove properties about).
The (time complexity) properties are largely stable under duplicate pruning. (no
gain in the worst case)

Definition 6.4.15. We speak of a search algorithm, when we do not want to

distinguish whether it is a tree or graph search algorithm. (difference considered an
implementation detail)

Michael Kohlhase: Artificial Intelligence 1 154 2025-02-06

Uninformed Search Summary

Tree/Graph Search Algorithms: Systematically explore the state tree/graph
112 CHAPTER 6. PROBLEM SOLVING AND SEARCH

induced by a search problem in search of a goal state. Search strategies only differ
by the treatment of the fringe.
Search Strategies and their Properties: We have discussed

Breadth Uniform Depth Iterative

Criterion first cost first deepening
Completeness Yes1 Yes2 No Yes
Time complexity bd ≈ bd bm bd+1
Space complexity bd ≈ bd bm bd
Optimality Yes∗ Yes No Yes∗
1 2
Conditions b finite 0 < ϵ ≤ cost

Michael Kohlhase: Artificial Intelligence 1 155 2025-02-06

Search Strategies; the XKCD Take

More Search Strategies?: (from https://xkcd.com/2407/)

Michael Kohlhase: Artificial Intelligence 1 156 2025-02-06

6.5 Informed Search Strategies

Summary: Uninformed Search/Informed Search

Problem formulation usually requires abstracting away real-world details to define
a state space that can feasibly be explored.
Variety of uninformed search strategies.
Iterative deepening search uses only linear space and not much more time than
6.5. INFORMED SEARCH STRATEGIES 113

other uninformed algorithms.

Next Step: Introduce additional knowledge about the problem (heuristic search)
Best-first-, A∗ -strategies (guide the search by heuristics)
Iterative improvement algorithms.
Definition 6.5.1. A search algorithm is called informed, iff it uses some form of
external information – that is not part of the search problem – to guide the search.

Michael Kohlhase: Artificial Intelligence 1 157 2025-02-06

6.5.1 Greedy Search

A Video Nugget covering this subsection can be found at https://fau.tv/clip/id/22015.

Best-first search
Idea: Order the fringe by estimated “desirability” (Expand most desirable
unexpanded node)
Definition 6.5.2. An evaluation function assigns a desirability value to each node
of the search tree.

Note: A evaluation function is not part of the search problem, but must be added
externally.
Definition 6.5.3. In best first search, the fringe is a queue sorted in decreasing
order of desirability.

Special cases: Greedy search, A∗ search

Michael Kohlhase: Artificial Intelligence 1 158 2025-02-06

This is like UCS, but with an evaluation function related to problem at hand replacing the path
cost function.
If the heuristic is arbitrary, we expect incompleteness!
Depends on how we measure “desirability”.
Concrete examples follow.

Greedy search
Idea: Expand the node that appears to be closest to the goal.
Definition 6.5.4. A heuristic is an evaluation function h on states that estimates
the cost from n to the nearest goal state. We speak of heuristic search if the search
algorithm uses a heuristic in some way.
Note: All nodes for the same state must have the same h-value!
Definition 6.5.5. Given a heuristic h, greedy search is the strategy where the
fringe is organized as a queue sorted by increasing h value.

Example 6.5.6. Straight-line distance from/to Bucharest.

114 CHAPTER 6. PROBLEM SOLVING AND SEARCH

Note: Unlike uniform cost search the node evaluation function has nothing to do
with the nodes expanded so far

internal search control ; external search control

partial solution cost ; goal cost estimation

Michael Kohlhase: Artificial Intelligence 1 159 2025-02-06

In greedy search we replace the objective cost to construct the current solution with a heuristic or
subjective measure from which we think it gives a good idea how far we are from a solution. Two
things have shifted:

• we went from internal (determined only by features inherent in the search space) to an external/heuris-
tic cost
• instead of measuring the cost to build the current partial solution, we estimate how far we are
from the desired goal

Romania with Straight-Line Distances

Example 6.5.7 (Informed Travel). hSLD (n) = straight − line distance to Bucharest

Arad 366 Mehadia 241 Bucharest 0 Neamt 234

Craiova 160 Oradea 380 Drobeta 242 Pitesti 100
Eforie 161 Rimnicu Vilcea 193 Fragaras 176 Sibiu 253
Giurgiu 77 Timisoara 329 Hirsova 151 Urziceni 80
68 Iasi 226 Vaslui 199
ChapterLugoj
3. 244
Solving Zerind
Problems 374
by Searching

Oradea
71
Neamt

Figure 3.2 A simplified road map of part of Romania.

Michael Kohlhase: Artificial Intelligence 1 160 2025-02-06

Sometimes the goal is specified by an abstract property rather than an explicitly enumer-
ated set of states. For example, in chess, the goal is to reach a state called “checkmate,”
where the opponent’s king is under attack and can’t escape.
Greedy Search: Romania
PATH COST • A path cost function that assigns a numeric cost to each path. The problem-solving
agent chooses a cost function that reflects its own performance measure. For the agent
trying to get to Bucharest, time is of the essence, so the cost of a path might be its length
in kilometers. In this chapter, we assume that the cost of a path can be described as the
Arad
STEP COST sum of the costs of the individual actions along the path.3 The step cost of taking action
! 366 by c(s, a, s! ). The step costs for Romania are
a in state s to reach state s is denoted
shown in Figure 3.2 as route distances. We assume that step costs are nonnegative.4
The preceding elements define a problem and can be gathered into a single data structure
that is given as input to a problem-solving algorithm. A solution to a problem is an action
sequence that leads from the initial state to a goal state. Solution quality is measured by the
OPTIMAL SOLUTION path cost function, and an optimal solution has the lowest path cost among all solutions.
6.5. INFORMED SEARCH STRATEGIES 115

Arad
366
Sibiu Timisoara Zerind
253 329 374

Arad
366
Sibiu Timisoara Zerind
253 329 374
Arad Fagaras Oradea R. Vilcea

366 176 380 193

Arad
366
Sibiu Timisoara Zerind
253 329 374
Arad Fagaras Oradea R. Vilcea

366 176 380 193

Sibiu Bucharest

253 0

Michael Kohlhase: Artificial Intelligence 1 161 2025-02-06

Let us fortify our intuitions with another example: navigation in a simple maze. Here the states
are the cells in the grid underlying the maze and the actions navigating to one of the adjoining
cells. The initial and goal states are the left upper and right lower corners of the grid. To see the
influence of the chosen heuristic (indicated by the red number in the cell), we compare the search
induced goal distance function with a heuristic based on the Manhattan distance. Just follow the
greedy search by following the heuristic gradient.
HeuristicFunctions
Heuristic FunctionsininPath
PathPlanning
Planning

I Example 6.5.8
Example 4.4 (The
(Themaze
mazesolved).
solved). We indicate h∗ by giving the goal distance:
We indicate h⇤ by giving the goal distance
I 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
1 24 18 17 16 15 14 12 13 14 15 16 17 18
2 23 19 18 17 13 12 11 12 13 14 15 16 17
3 22 21 20 16 12 11 10
4 23 22 21 15 14 13 9 8 4 3 2 1
5 24 23 22 16 15 9 8 7 6 5 1 0

G
I Example 4.5 (Maze Heuristic: the good case).
Example
We use the6.5.9 (Maze distance
Manhattan Heuristic: The
to the goalgood
as a case).
heuristicWe use the Manhattan
distance to the goal as a heuristic:

Kohlhase: Künstliche Intelligenz 1 160 July 5, 2018

Heuristic Functions in Path Planning

I Example 4.4 (The maze solved).

We indicate h⇤ by giving the goal distance
116 I Example 4.5 (Maze Heuristic:CHAPTER 6. PROBLEM SOLVING AND SEARCH
the good case).
We use the Manhattan distance to the goal as a heuristic
I 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
1 18 16 15 14 13 12 10 9 8 7 6 5 4
2 17 15 14 13 11 10 9 8 7 6 5 4 3
3 16 15 14 12
Heuristic Functions in Path10Planning
9 8
4 15 14 13 11 10 9 7 6 4 3 2 1
I Example
5 144.413(The 12 maze10solved). 9 7 6 5 4 3 1 0
⇤
We indicate h by giving the goal distance
I Example 4.5 (Maze Heuristic: the good case).
G
We use the Manhattan distance to the goal as a heuristic
Example 6.5.10 (Maze Heuristic: The bad case). We use the Manhattan
Idistance
Example to 4.6
the (Maze
goal as Heuristic: the bad case).
a heuristic again:
We use the Manhattan distance to the goal
Kohlhase: Künstliche Intelligenz 1 160
as a heuristicJuly
again
5, 2018

I 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
1 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4
2 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3
3 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2
4 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1
5 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0

G
Kohlhase: Künstliche Intelligenz 1 160 July 5, 2018

Michael Kohlhase: Artificial Intelligence 1 162 2025-02-06

Not surprisingly, the first maze is searchless, since we are guided by the perfect heuristic. In cases,
where there is a choice, the this has no influence on the length (or in other cases cost) of the
solution.
In the “good case” example, greedy search performs well, but there is some limited backtracking
needed, for instance when exploring the left lower corner 3×3 area before climbing over the second
wall.
In the “bad case”, greedy search is led down the lower garden path, which has a dead end, and
does not lead to the goal. This suggests that there we can construct adversary examples – i.e.
example mazes where we can force greedy search into arbitrarily bad performance.

Greedy search: Properties

Completeness No: Can get stuck in infinite loops.

Complete in finite state spaces with repeated
state checking

Time complexity O(bm )
Space complexity O(bm )
Optimality No

Example 6.5.11. Greedy search can get stuck going from Iasi to Oradea:
Iasi → Neamt → Iasi → Neamt → · · ·
6.5. INFORMED SEARCH STRATEGIES 117
68 Chapter 3. Solving Problems by Searching

Oradea
71
Neamt

Figure 3.2 A simplified road map of part of Romania.

Worst-case Time: Sometimes

Sametheasgoaldepth first
is specified by search.
an abstract property rather than an explicitly enumer-
ated set of states. For example, in chess, the goal is to reach a state called “checkmate,”
where the opponent’s king is under attack and can’t escape.
Worst-case Space:• A Same as breadth
path cost function
PATH COST first cost
that assigns a numeric search. (⇝
to each path. The repeated state checking)
problem-solving
agent chooses a cost function that reflects its own performance measure. For the agent
trying to get to Bucharest, time is of the essence, so the cost of a path might be its length
But: A good heuristic can Ingive
in kilometers. dramatic
this chapter, improvements.
we assume that the cost of a path can be described as the
STEP COST sum of the costs of the individual actions along the path.3 The step cost of taking action
a in state s to reach state s! is denoted by c(s, a, s! ). The step costs for Romania are
shown in Figure 3.2 as route distances. We assume that step costs are nonnegative.4

Michael Kohlhase:The preceding elements define

Artificial Intelligence 1 a problem and can be gathered 163 into a single data structure2025-02-06
that is given as input to a problem-solving algorithm. A solution to a problem is an action
sequence that leads from the initial state to a goal state. Solution quality is measured by the
path cost function, and an optimal solution has the lowest path cost among all solutions.
Remark 6.5.12. Greedy search is similar to UCS. Unlike the latter, the node evaluation function
OPTIMAL SOLUTION

3.1.2 Formulating problems

has nothing to do with the nodes explored so far. This can prevent nodes from being enumerated
In the preceding section we proposed a formulation of the problem of getting to Bucharest in
systematically as they are interms
UCS andstate,BFS.
of the initial actions, transition model, goal test, and path cost. This formulation
seems reasonable, but it is still a model—an abstract mathematical description—and not the
For completeness, we need repeated state checking
This assumption is algorithmically
3 as the
convenient but also theoretically example
justifiable—see page 649 inshows.
Chapter 17. This enforces complete
enumeration of the state spaceThe (provided that it is finite), and thus gives us completeness.
implications of negative costs are explored in Exercise 3.8.
4

Note that nothing prevents from all nodes being searched in worst case; e.g. if the heuristic
function gives us the same (low) estimate on all nodes except where the heuristic mis-estimates
the distance to be high. So in the worst case, greedy search is even worse than BFS, where d
(depth of first solution) replaces m.
The search procedure cannot be optimal, since actual cost of solution is not considered.
For both, completeness and optimality, therefore, it is necessary to take the actual cost of
partial solutions, i.e. the path cost, into account. This way, paths that are known to be expensive
are avoided.

6.5.2 Heuristics and their Properties

A Video Nugget covering this subsection can be found at https://fau.tv/clip/id/22019.

Heuristic Functions
Definition 6.5.13. Let Π be a search problem with states S. A heuristic function
(or short heuristic) for Π is a function h : S → R+
0 ∪ {∞} so that h(s) = 0 whenever
s is a goal state.

h(s) is intended as an estimate the distance between state s and the nearest goal
state.
Definition 6.5.14. Let Π be a search problem with states S, then the function
h∗ : S → R+ ∗
0 ∪ {∞}, where h (s) is the cost of a cheapest path from s to a goal
state, or ∞ if no such path exists, is called the goal distance function for Π.

Notes:
h(s) = 0 on goal states: If your estimator returns “I think it’s still a long way”
on a goal state, then its intelligence is, um . . .
118 CHAPTER 6. PROBLEM SOLVING AND SEARCH

Return value ∞: To indicate dead ends, from which the goal state can’t be
reached anymore.
The distance estimate depends only on the state s, not on the node (i.e., the
path we took to reach s).

Michael Kohlhase: Artificial Intelligence 1 164 2025-02-06

Where does the word “Heuristic” come from?

Ancient Greek word ϵυρισκϵιν (=
b “I find”) (aka. ϵυρϵκα!)
Popularized in modern science by George Polya: “How to solve it” [Pól73]
Same word often used for “rule of thumb” or “imprecise solution method”.

Michael Kohlhase: Artificial Intelligence 1 165 2025-02-06

Heuristic Functions: The Eternal Trade-Off

“Distance Estimate”? (h is an arbitrary function in principle)
In practice, we want it to be accurate (aka: informative), i.e., close to the actual
goal distance.
We also want it to be fast, i.e., a small overhead for computing h.
These two wishes are in contradiction!
Example 6.5.15 (Extreme cases).

h = 0: no overhead at all, completely un-informative.

h = h∗ : perfectly accurate, overhead =
b solving the problem in the first place.
Observation 6.5.16. We need to trade off the accuracy of h against the overhead
for computing it.

Michael Kohlhase: Artificial Intelligence 1 166 2025-02-06

Properties of Heuristic Functions

Definition 6.5.17. Let Π be a search problem with states S and actions A. We
say that a heuristic h for Π is admissible if h(s) ≤ h∗ (s) for all s ∈ S.
We say that h is consistent if h(s) − h(s′ ) ≤ c(a) for all s ∈ S, a ∈ A, and
s′ ∈ T (s, a).
In other words . . . :

h is admissible if it is a lower bound on goal distance.

h is consistent if, when applying an action a, the heuristic value cannot decrease
by more than the cost of a.
6.5. INFORMED SEARCH STRATEGIES 119

Michael Kohlhase: Artificial Intelligence 1 167 2025-02-06

Properties of Heuristic Functions, ctd.

Let Π be a search problem, and let h be a heuristic for Π. If h is consistent, then
h is admissible.
Proof: we prove h(s) ≤ h∗ (s) for all s ∈ S by induction over the length of the cheapest
path to a goal node.
1. base case
1.1. h(s) = 0 by definition of heuristic, so h(s) ≤ h∗ (s) as desired.
2. step case
2.1. We assume that h(s′ ) ≤ h∗ (s) for all states s′ with a cheapest goal node path
of length n.
2.2. Let s be a state whose cheapest goal path has length n+1 and the first transition
is o = (s,s′ ).
2.3. By consistency, we have h(s) − h(s′ ) ≤ c(o) and thus h(s) ≤ h(s′ ) + c(o).
2.4. By construction, h∗ (s) has a cheapest goal path of length n and thus, by induc-
tion hypothesis h(s′ ) ≤ h∗ (s′ ).
2.5. By construction, h∗ (s) = h∗ (s′ ) + c(o).
2.6. Together this gives us h(s) ≤ h∗ (s) as desired.

Consistency is a sufficient condition for admissibility (easier to check)

Michael Kohlhase: Artificial Intelligence 1 168 2025-02-06

Properties of Heuristic Functions: Examples

Example 6.5.18. Straight line distance is admissible and consistent by the triangle
inequality.
If you drive 100km, then the straight line distance to Rome can’t decrease by more
than 100km.

Observation: In practice, admissible heuristics are typically consistent.

Example 6.5.19 (An admissible, but inconsistent heuristic). When traveling
to Rome, let h(M unich) = 300 and h(Innsbruck) = 100.
Inadmissible heuristics typically arise as approximations of admissible heuristics
that are too costly to compute. (see later)

Michael Kohlhase: Artificial Intelligence 1 169 2025-02-06

6.5.3 A-Star Search

A Video Nugget covering this subsection can be found at https://fau.tv/clip/id/22020.

A∗ Search: Evaluation Function

Idea: Avoid expanding paths that are already expensive(make use of actual cost)
The simplest way to combine heuristic and path cost is to simply add them.
120 CHAPTER 6. PROBLEM SOLVING AND SEARCH

Definition 6.5.20. The evaluation function for A∗ search is given by f (n) =

g(n) + h(n), where g(n) is the path cost for n and h(n) is the estimated cost to
the nearest goal from n.
Thus f (n) is the estimated total cost of the path through n to a goal.

Definition 6.5.21. Best first search with evaluation function g + h is called A∗

search.

Michael Kohlhase: Artificial Intelligence 1 170 2025-02-06

This works, provided that h does not overestimate the true cost to achieve the goal. In other
words, h must be optimistic wrt. the real cost h∗ . If we are too pessimistic, then non-optimal
solutions have a chance.

A∗ Search: Optimality
Theorem 6.5.22. A∗ search with admissible heuristic is optimal.

Proof: We show that sub-optimal nodes are never expanded by A∗

1. Suppose a suboptimal goal node G has been generated then we are in the
following situation:
start
n

O G

2. Let n be an unexpanded node on a path to an optimality goal node O, then

f (G) = g(G) since h(G) = 0
g(G) > g(O) since G suboptimal
g(O) = g(n) + h∗ (n) n on optimal path
g(n) + h∗ (n) ≥ g(n) + h(n) since h is admissible
g(n) + h(n) = f (n)
3. Thus, f (G) > f (n) and A∗ never expands G.

Michael Kohlhase: Artificial Intelligence 1 171 2025-02-06

A∗ Search Example

Arad
366=0+366

Arad

Sibiu Timisoara Zerind

393=140+253 447=118+329 449=75+374
6.5. INFORMED SEARCH STRATEGIES 121

Arad

Sibiu Timisoara Zerind

447=118+329 449=75+374
Arad Fagaras Oradea R. Vilcea

646=280+366 415=239+176 671=291+380 413=220+193

Arad

Sibiu Timisoara Zerind

447=118+329 449=75+374
Arad Fagaras Oradea R. Vilcea

646=280+366 415=239+176 671=291+380

Craiova Pitesti Sibiu

526=366+160 417=317+100 553=300+253

Arad

Sibiu Timisoara Zerind

447=118+329 449=75+374
Arad Fagaras Oradea R. Vilcea

646=280+366 671=291+380

Sibiu Bucharest Craiova Pitesti Sibiu

591=338+253 450=450+0 526=366+160 417=317+100 553=300+253

Arad

Sibiu Timisoara Zerind

447=118+329 449=75+374
Arad Fagaras Oradea R. Vilcea

646=280+366 671=291+380

Sibiu Bucharest Craiova Pitesti Sibiu

591=338+253 450=450+0 526=366+160 553=300+253

Bucharest Craiova Sibiu

418=418+0 615=455+160 607=414+193

Michael Kohlhase: Artificial Intelligence 1 172 2025-02-06

To extend our intuitions about informed search algorithms to A∗ -search, we take up the maze
examples from above again. We first show the good maze with Manhattan distance again.

Additional Observations (Not Limited to Path Planning)

Example 6.5.23 (Greedy best-first search, “good case”).
Heuristic Functions in Path Planning

I Example 4.4 (The maze solved).

We indicate h⇤ by giving the goal distance
122 I Example 4.5 (Maze Heuristic:CHAPTER 6. PROBLEM SOLVING AND SEARCH
the good case).
We use the Manhattan distance to the goal as a heuristic
I 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
1 18 16 15 14 13 12 10 9 8 7 6 5 4
2 17 15 14 13 11 10 9 8 7 6 5 4 3
3 16 15 14 12 10 9 8
4 15 14 13 11 10 9 7 6 4 3 2 1
5 14 13 12 10 9 7 6 5 4 3 1 0

G
We will find a solution with little search.
Kohlhase: Künstliche Intelligenz 1 160 July 5, 2018

Michael Kohlhase: Artificial Intelligence 1 173 2025-02-06

To compare it to A∗ -search, here is the same maze but now with the numbers in red for the
evaluation function f where h is the Manhattan distance.

Additional Observations (Not Limited to Path Planning)

Additional Observations (Not Limited to Path Planning) II
Example 6.5.24 (A∗ (g + h), “good case”).
I Example 4.21 (A⇤ (g + h), “good case”).

I 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
1 18 22 22 22 22 22 24 24 24 24 24 24 24
2 18 20 20 20 22 22 22 22 22 22 22 22 22
3 18 18 18 20 22 22 22
4 18 18 18 20 20 20 22 22 24 24 24 24
5 18 18 18 20 20 24 22 22 22 22 24 24

G
I A⇤ with
∗ a consistent heuristic g + h always increases monotonically (h cannot
In A with a consistent heuristic, g + h always increases monotonically (h
decrease mor than g increases)
cannot decrease more than g increases)
I We need more search, in the “right upper half”. This is typical: Greedy best-first
We need
search more
tends to besearch, in the
faster than A⇤“right
. upper half”. This is typical: Greedy best
first search tends to be faster than A∗ .

Kohlhase: Künstliche Intelligenz 1 177 July 5, 2018

Michael Kohlhase: Artificial Intelligence 1 174 2025-02-06

Let’s now consider the “bad maze” with Manhattan distance again.

Additional Observations (Not Limited to Path Planning)

Example 6.5.25 (Greedy best-first search, “bad case”).
I Example 4.4 (The maze solved).
We indicate h⇤ by giving the goal distance
I Example 4.5 (Maze Heuristic: the good case).
We use the Manhattan distance to the goal as a heuristic
I Example
6.5. INFORMED SEARCH STRATEGIES
4.6 (Maze Heuristic: the bad case). 123
We use the Manhattan distance to the goal as a heuristic again
I 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
1 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4
2 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3
3 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2
4 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1
5 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0

G
Kohlhase: Künstliche Intelligenz 1 160 July 5, 2018

Search will be mis-guided into the “dead-end street”.

Michael Kohlhase: Artificial Intelligence 1 175 2025-02-06

And we compare it to A∗ -search; again the numbers in red are for the evaluation function f .

Additional Observations (Not Limited to Path Planning)

Additional Observations (Not Limited to Path Planning) IV
Example 6.5.26 (A ∗
(g + h), “bad case”).
I Example 4.23 (A⇤ (g + h), “bad case”).

I 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
1 18 17 24 24 24 24 24 24 24 24 24 24 24 24 24
2 18 16 22 14 13 12 11 10 9 8 7 6 5 4 24
3 18 15 20 13 22 22 22 9 26 26 26 5 30 3 24
4 18 18 18 12 20 10 22 8 24 6 26 4 28 2 24
5 18 18 18 18 18 9 22 22 22 5 26 26 26 1 24

G
We will search less of the “dead-end street”. Sometimes g + h gives better
We will search
search lessthan
guidance of the
h. “dead-end street”. Sometimes g +
(;h gives
A⇤ is better search
faster there)
guidance than h. (; A∗ is faster there)

Michael Kohlhase: Artificial Intelligence 1 176 2025-02-06

Finally, we compare thatKünstliche

Kohlhase: with the goal1 distance function
Intelligenz 179 for the “bad
July maze”.
5, 2018 Here we see that the
lower garden path is under-estimated by the evaluation function f , but still large enough to keep
the search out of it, thanks to the admissibility of the Manhattan distance.

Additional Observations (Not Limited to Path Planning)

Example 6.5.27 (A∗ (g + h) using h∗ ).
Additional Observations (Not Limited to Path Planning) V

124 CHAPTER 6. PROBLEM SOLVING AND SEARCH

I Example 4.24 (A⇤ (g + h) using h⇤ ).

I 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
1 24 17 24 24 24 24 24 24 24 24 24 24 24 24 24
2 24 16 24 14 13 12 11 10 9 8 7 6 5 4 24
3 24 15 24 13 34 36 38 9 50 52 54 5 66 3 24
4 24 24 24 12 32 10 40 8 48 6 56 4 64 2 24
5 26 26 26 28 30 9 42 44 46 5 58 60 62 1 24

G
In A⇤ , node values always increase monotonically (with any heuristic). If the
A∗ , node
Inheuristic is values
perfect,always increaseconstant
they remain monotonically (withpaths.
on optimal any heuristic). If the heuris-
tic is perfect, they remain constant on optimal paths.

Michael Kohlhase: Artificial Intelligence 1 177 2025-02-06

Kohlhase: Künstliche Intelligenz 1 180 July 5, 2018

A∗ search: f -contours
Section 3.5. Informed (Heuristic) Search Strategies 97
Intuition: A∗ -search gradually adds “f -contours” (areas of the same f -value) to
the search.

Z N

A I
380 S
F
V
400
T R
L P

H
M U
420 B
D
C E
G

Figure 3.25 Map of Romania showing contours at178f = 380, f = 400,

Michael Kohlhase: Artificial Intelligence 1
and f = 420, with
2025-02-06
Arad as the start state. Nodes inside a given contour have f -costs less than or equal to the
contour value.
A∗ search: Properties
Figure 3.9; because f is nondecreasing along any path, n! would have lower f -cost than n
and would haveorbeen
Properties selected first.
A∗ -search:
From the two preceding observations, it follows that the sequence of nodes expanded
by A∗ usingCompleteness
G RAPH -S EARCH isYes in nondecreasing
(unless there areorderinfinitely
of f (n).many
Hence, the first
nodes n goal node
selected for expansion must be with
an optimal
f (n) ≤solution
f (0)) because f is the true cost for goal nodes
(which haveTime 0) and all laterExponential
h = complexity goal nodes will be at leasterror
in [relative as expensive.
in h × length of
n
The fact that f -costs are solution]
nondecreasing along any path also means that we can draw
CONTOUR contours inSpace complexity
the state space, justSame as time
like the (variant
contours in a of BFS)
topographic map. Figure 3.25 shows
Optimality Yes
an example. Inside the contour labeled 400, all nodes have f (n) less than or equal to 400,
and so on. Then, because A∗ expands the frontier node of lowest f -cost, we can see that an
A∗ search fans out from the start node, adding nodes in concentric bands of increasing f -cost.
With uniform-cost search (A∗ search using h(n) = 0), the bands will be “circular”
around the start state. With more accurate heuristics, the bands will stretch toward the goal
∗
6.5. INFORMED SEARCH STRATEGIES 125

A∗ -search expands all (some/no) nodes with f (n) < h∗ (n)

The run-time depends on how well we approximated the real cost h∗ with h.

Michael Kohlhase: Artificial Intelligence 1 179 2025-02-06

6.5.4 Finding Good Heuristics

A Video Nugget covering this subsection can be found at https://fau.tv/clip/id/22021.
Since the availability of admissible heuristics is so important for informed search (particularly for
A∗ -search), let us see how such heuristics can be obtained in practice. We will look at an example,
and then derive a general procedure from that.

Admissible
Section 3.2. heuristics:
Example Problems Example 8-puzzle 71

7 2 4 1 2

5 6 3 4 5

8 3 1 6 7 8

Start State Goal State

Figure 3.4 A typical instance of the 8-puzzle.

Example 6.5.28. Let h1 (n) be the number of misplaced tiles in node n.
(h1 (S) = 9)• States: A state description specifies the location of each of the eight tiles and the blank
in one of the nine squares.
Example 6.5.29. LetAny
• Initial state: h2state
(n)canbebethe total asManhattan
designated the initial state.distance from
Note that any given desired
goal location
of each tile.• can be reached from exactly half of the possible initial states (Exercise 3.4).
(h (S) = 3 + 1 + 2 + 2 + 2 + 3 +
2 defines the actions as movements of the blank space
Actions: The simplest formulation
2 + 2 + 3 = 20)
Left, Right, Up, or Down. Different subsets of these are possible depending on where
Observationthe6.5.30
blank is. (Typical search costs). (IDS =
b iterative deepening search)
• Transition model: Given a state and action, this returns the resulting state; for example,
if we apply Left to the start state in Figure 3.4, the resulting state has the 5 and the blank
nodes explored IDS
switched.
A∗ (h1 ) A∗ (h2 )
• Goal test:
d =This14checks whether the3,473,941
state matches the539
goal configuration
113shown in Fig-
ure 3.4. (Other goal configurations are possible.)
d = 24 too many 39,135 1,641
• Path cost: Each step costs 1, so the path cost is the number of steps in the path.
What abstractions have we included here? The actions are abstracted to their beginning and
final states, ignoring the intermediate locations where the block is sliding. We have abstracted
away actions
Michael suchArtificial
Kohlhase: as shaking the board
Intelligence 1 when pieces get 180 stuck and ruled out extracting the
2025-02-06
pieces with a knife and putting them back again. We are left with a description of the rules of
the puzzle, avoiding all the details of physical manipulations.
Actually, the crucial difference
SLIDING-BLOCK
PUZZLES The 8-puzzlebetween
belongs to the the heuristics
family h1 and
of sliding-block is that
h2which
puzzles, – used
are often not asonly in the example
configuration above,testbut for for
problems allnew
configurations
search algorithms in–AI. theThisthe value
family of tothe
is known latter is larger than that of
be NP-complete,
so one does not expect to find methods significantly better in the worst case than the search
the former. We willalgorithms explore this in
described next.
this chapter and the next. The 8-puzzle has 9!/2 = 181, 440 reachable
states and is easily solved. The 15-puzzle (on a 4 × 4 board) has around 1.3 trillion states, and
Dominance random instances can be solved optimally in a few milliseconds by the best search algorithms.
The 24-puzzle (on a 5 × 5 board) has around 1025 states, and random instances take several
hours to solve optimally.
Definition
8-QUEENS PROBLEM 6.5.31.
The goal of theLet h1 and
8-queens h2 isbe
problem twoeight
to place admissible heuristics
queens on a chessboard suchwethat say that h2
no queen attacks any other. (A queen attacks any piece in the same row, column or diago-
dominates h if h2 (n)
nal.) 1Figure 3.5
≥ h1 (n) forsolution
shows an attempted
all n. that fails: the queen in the rightmost column is
attacked by the queen at the top left.
Theorem 6.5.32. If h2 dominates h1 , then h2 is better for search than h1 .
Proof sketch: If h2 dominates h1 , then h2 is “closer to h∗ ” than h1 , which means
better search performance.

Michael Kohlhase: Artificial Intelligence 1 181 2025-02-06

We now try to generalize these insights into (the beginnings of) a general method for obtaining
126 CHAPTER 6. PROBLEM SOLVING AND SEARCH

admissible heuristics.

Relaxed problems
Observation: Finding good admissible heuristics is an art!
Idea: Admissible heuristics can be derived from the exact solution cost of a relaxed
version of the problem.

Example 6.5.33. If the rules of the 8-puzzle are relaxed so that a tile can move
anywhere, then we get heuristic h1 .
Example 6.5.34. If the rules are relaxed so that a tile can move to any adjacent
square, then we get heuristic h2 . (Manhattan distance)
Definition 6.5.35. Let Π := ⟨S , A, T , I , G ⟩ be a search problem, then we call
a search problem P r := ⟨S, Ar , T r , I r , G r ⟩ a relaxed problem (wrt. Π; or simply
relaxation of Π), iff A ⊆ Ar , T ⊆ T r , I ⊆ I r , and G ⊆ G r .
Lemma 6.5.36. If P r relaxes Π, then every solution for Π is one for P r .
Key point: The optimal solution cost of a relaxed problem is not greater than the
optimal solution cost of the real problem.

Michael Kohlhase: Artificial Intelligence 1 182 2025-02-06

Relaxation means to remove some of the constraints or requirements of the original problem,
so that a solution becomes easy to find. Then the cost of this easy solution can be used as an
optimistic approximation of the problem.

Empirical Performance: A∗ in Path Planning

Example 6.5.37 (Live Demo vs. Breadth-First Search).

See http://qiao.github.io/PathFinding.js/visual/
Difference to Breadth-first Search?: That would explore all grid cells in a circle
around the initial state!

Michael Kohlhase: Artificial Intelligence 1 183 2025-02-06

6.6. LOCAL SEARCH 127

6.6 Local Search

Video Nuggets covering this section can be found at https://fau.tv/clip/id/22050 and
https://fau.tv/clip/id/22051.

Systematic Search vs. Local Search

Definition 6.6.1. We call a search algorithm systematic, if it considers all states
at some point.
Example 6.6.2. All tree search algorithms (except pure depth first search) are
systematic. (given reasonable assumptions e.g. about costs.)

Observation 6.6.3. Systematic search algorithms are complete.

Observation 6.6.4. In systematic search algorithms there is no limit of the number
of nodes that are kept in memory at any time.
Alternative: Keep only one (or a few) nodes at a time

; no systematic exploration of all options, ; incomplete.

Michael Kohlhase: Artificial Intelligence 1 184 2025-02-06

Local Search Problems

Idea: Sometimes the path to the solution is irrelevant.

Example 6.6.5 (8 Queens Problem). Place 8

queens on a chess board, so that no two queens
threaten each other.
This problem has various solutions (the one of
the right isn’t one of them)
Definition 6.6.6. A local search algorithm is a
search algorithm that operates on a single state,
the current state (rather than multiple paths).
(advantage: constant space)
Typically local search algorithms only move to successor of the current state, and
do not retain search paths.

Applications include: integrated circuit design, factory-floor layout, job-shop schedul-

ing, portfolio management, fleet deployment,. . .

Michael Kohlhase: Artificial Intelligence 1 185 2025-02-06

Local Search: Iterative improvement algorithms

Definition 6.6.7. The traveling salesman problem (TSP is to find shortest trip
through set of cities such that each city is visited exactly once.
Local Search: Iterative improvement algorithms
Local Search: Iterative improvement algorithms
128 CHAPTER 6. PROBLEM SOLVING AND SEARCH
I Definition 5.7 (Traveling Salesman Problem). Find shortest trip through set
of cities such that each city is visited exactly once.
I Idea:
Start 5.7
Definition with(Traveling
any complete tour, perform
Salesman pairwise
Problem). Findexchanges
shortest trip through set
I Idea: Start with any complete tour, perform pairwise exchanges
of cities such that each city is visited exactly once.
I Idea: Start with any complete tour, perform pairwise exchanges

I Definition 5.8 (n-queens problem). Put n queens on n ⇥ n board such that

no two queens
Definition 6.6.8.inThe
the same row, columns,
n-queens or to
problem is diagonal.
put n queens on n × n board such
Ithat
I no two
Definition
Idea: queen
5.8
Move in the
(n-queens
a queen same row, columns,
problem).
to reduce number Put n or diagonal.
queens
of conflicts on n ⇥ n board such that
no two queens in the same row, columns, or diagonal.
Idea: Move a queen to reduce number of conflicts
I Idea: Move a queen to reduce number of conflicts

Kohlhase: Künstliche Intelligenz 1 189 July 5, 2018

Michael Kohlhase: Artificial Intelligence 1 186 2025-02-06

Kohlhase: Künstliche Intelligenz 1 189 July 5, 2018

Hill-climbing (gradient ascent/descent)

Idea: Start anywhere and go in the direction of the steepest ascent.
Definition 6.6.9. Hill climbing (also gradient ascent) is a local search algorithm
that iteratively selects the best successor:
procedure Hill−Climbing (problem) /∗ a state that is a local minimum ∗/
local current, neighbor /∗ nodes ∗/
current := Make−Node(Initial−State[problem])
loop
neighbor := <a highest−valued successor of current>
if Value[neighbor] < Value[current] return [current] end if
current := neighbor
end loop
end procedure

Intuition: Like best first search without memory.

Works, if solutions are dense and local maxima can be escaped.

Michael Kohlhase: Artificial Intelligence 1 187 2025-02-06

In order to understand the procedure on a more intuitive level, let us consider the following
scenario: We are in a dark landscape (or we are blind), and we want to find the highest hill. The
search procedure above tells us to start our search anywhere, and for every step first feel around,
and then take a step into the direction with the steepest ascent. If we reach a place, where the
next step would take us down, we are finished.
Of course, this will only get us into local maxima, and has no guarantee of getting us into
global ones (remember, we are blind). The solution to this problem is to re-start the search at
random (we do not have any information) places, and hope that one of the random jumps will get
us to a slope that leads to a global maximum.

Example Hill Climbing with 8 Queens

6.6. LOCAL SEARCH 129

Idea: Consider h = b number of

Section 4.1. Local Search Algorithms and Optimization Problems 121
queens that threaten each other.
If the path to the goal does not matter, we might consider a different class of algo-
Example 6.6.10. An 8-queens rithms,
stateones that do not worry about paths at all. Local search algorithms operate using
LOCAL SEARCH

a single current node (rather than multiple paths) and generally move only to neighbors
CURRENT NODE
with heuristic cost estimate h of=that17node. Typically, the paths followed by the search are not retained. Although local
showing h-values for moving a queen
search algorithms are not systematic, they have two key advantages: (1) they use very little
memory—usually a constant amount; and (2) they can often find reasonable solutions in large
within its column: or infinite (continuous) state spaces for which systematic algorithms are unsuitable.
In addition to finding goals, local search algorithms are useful for solving pure op-
OPTIMIZATION
timization problems, in which the aim is to find the best state according to an objective
Problem: The state space hasfunction.
local Many optimization problems do not fit the “standard” search model introduced in
PROBLEM
OBJECTIVE
FUNCTION

minima. e.g. the board on the Chapter

right3. For example, nature provides an objective function—reproductive fitness—that
Darwinian evolution could be seen as attempting to optimize, but there is no “goal test” and
has h = 1 but every successorno “path
hascost” for this problem.
To understand local search, we find it useful to consider the state-space landscape (as
STATE-SPACE
h > 1. LANDSCAPE
in Figure 4.1). A landscape has both “location” (defined by the state) and “elevation” (defined
by the value of the heuristic cost function or objective function). If elevation corresponds to
GLOBAL MINIMUM cost, then the aim is to find the lowest valley—a global minimum; if elevation corresponds
Michael Kohlhase: Artificial Intelligence 1
GLOBAL MAXIMUM to an objective function,
188 then the aim is to find the highest peak—a global maximum. (You
2025-02-06
can convert from one to the other just by inserting a minus sign.) Local search algorithms
explore this landscape. A complete local search algorithm always finds a goal if one exists;
an optimal algorithm always finds a global minimum/maximum.
Hill-climbing
objective function
global maximum

Problem: Depending on initial

state, can get stuck on local max- shoulder

ima/minima and plateaux. local maximum

“flat” local maximum

“Hill-climbing search is like climbing

Everest in thick fog with amnesia”.
state space
current
state

Figure 4.1 A one-dimensional state-space landscape in which elevation corresponds to the

Idea: Escape local maxima by allowing some
objective “bad”
function. oris torandom
The aim moves.
find the global maximum. Hill-climbing search modifies
the current state to try to improve it, as shown by the arrow. The various topographic features
are defined in the text.
Example 6.6.11. local search, simulated annealing, . . .

Properties: All are incomplete, nonoptimal.

Sometimes performs well in practice (if (optimal) solutions are dense)

Michael Kohlhase: Artificial Intelligence 1 189 2025-02-06

Recent work on hill climbing algorithms tries to combine complete search with randomization to
escape certain odd phenomena occurring in statistical distribution of solutions.
124 Chapter 4. Beyond Classical Search
Simulated annealing (Idea)

Definition 6.6.12. Ridges are ascending

successions of local maxima.

Problem: They are extremely difficult to

bv navigate for local search algorithms.
Idea: Escape local maxima by allowing
some “bad” moves, but gradually decrease
their size and frequency.

Annealing is the process of heating steel and let it cool gradually to give it time to
Figure 4.4 Illustration of why ridges cause difficulties for hill climbing. The grid of states
(dark circles) is superimposed on a ridge rising from left to right, creating a sequence of local
maxima that are not directly connected to each other. From each local maximum, all the
available actions point downhill.
130 CHAPTER 6. PROBLEM SOLVING AND SEARCH

grow an optimal cristal structure.

Simulated annealing is like shaking a ping pong ball occasionally on a bumpy surface
to free it. (so it does not get stuck)

Devised by Metropolis et al for physical process modelling [Met+53]

Widely used in VLSI layout, airline scheduling, etc.

Michael Kohlhase: Artificial Intelligence 1 190 2025-02-06

Simulated annealing (Implementation)

Definition 6.6.13. The following algorithm is called simulated annealing:
procedure Simulated−Annealing (problem,schedule) /∗ a solution state ∗/
local node, next /∗ nodes ∗/
local T /∗ a ‘‘temperature’’ controlling prob.~of downward steps ∗/
current := Make−Node(Initial−State[problem])
for t :=1 to ∞
T := schedule[t]
if T = 0 return current end if
next := <a randomly selected successor of current>
∆(E) := Value[next]−Value[current]
if ∆(E) > 0 current := next
else
current := next <only with probability> e∆(E)/T
end if
end for
end procedure

A schedule is a mapping from time to “temperature”.

Michael Kohlhase: Artificial Intelligence 1 191 2025-02-06

Properties of simulated annealing

At fixed “temperature” T , state occupation probability reaches Boltzman distribu-
tion E(x)
p(x) = αe kT
T decreased slowly enough ; always reach best state x∗ because
E(x∗ )
e kT E(x∗ )−E(x)

E(x)
=e kT ≫1
e kT

for small T .

Question: Is this necessarily an interesting guarantee?

Michael Kohlhase: Artificial Intelligence 1 192 2025-02-06

6.6. LOCAL SEARCH 131

Local beam search

Definition 6.6.14. Local beam search is a search algorithm that keep k states
instead of 1 and chooses the top k of all their successors.

Observation: Local beam search is not the same as k searches run in parallel!
(Searches that find good states recruit other searches to join them)
Problem: Quite often, all k searches end up on the same local hill!
Idea: Choose k successors randomly, biased towards good ones. (Observe the
close analogy to natural selection!)

Michael Kohlhase: Artificial Intelligence 1 193 2025-02-06

Genetic algorithms (very briefly)

Definition 6.6.15. A genetic algorithm is a variant of local beam search that
generates successors by

randomly modifying states (mutation)

mixing pairs of states (sexual reproduction or crossover)
to optimize a fitness function. (survival of the fittest)
Section 4.1. Local Search Algorithms and Optimization Problems 127
Example 6.6.16. Generating successors for 8 queens

24748552 24 31% 32752411 32748552 32748152

32752411 23 29% 24748552 24752411 24752411
24415124 20 26% 32752411 32752124 32252124
32543213 11 14% 24415124 24415411 24415417

(a) (b) (c) (d) (e)

Initial Population Fitness Function Selection Crossover Mutation

Figure 4.6 The genetic algorithm, illustrated for digit strings representing 8-queens states.
The initial population in (a) is ranked by the fitness function in (b), resulting in pairs for
Michael Kohlhase: Artificial Intelligence 1 194 2025-02-06
mating in (c). They produce offspring in (d), which are subject to mutation in (e).

Genetic algorithms (continued)

Problem: Genetic algorithms require states encoded as strings.
+ =
Crossover only helps iff substrings are meaningful components.
Example 6.6.17 (Evolving 8 Queens). First crossover

Figure 4.7 The 8-queens states corresponding to the first two parents in Figure 4.6(c) and
the first offspring in Figure 4.6(d). The shaded columns are lost in the crossover step and the
unshaded columns are retained.

Like beam searches, GAs begin with a set of k randomly generated states, called the
POPULATION population. Each state, or individual, is represented as a string over a finite alphabet—most
INDIVIDUAL commonly, a string of 0s and 1s. For example, an 8-queens state must specify the positions of
8 queens, each in a column of 8 squares, and so requires 8 × log2 8 = 24 bits. Alternatively,
the state could be represented as 8 digits, each in the range from 1 to 8. (We demonstrate later
that the two encodings behave differently.) Figure 4.6(a) shows a population of four 8-digit
24415124 20 26% 32752411 32752124 32252124
32543213 11 14% 24415124 24415411 24415417

(a) (b) (c) (d) (e)

Initial Population Fitness Function Selection Crossover Mutation

Figure 4.6 The genetic algorithm, illustrated for digit strings representing 8-queens states.
The initial population in (a) is ranked by the fitness function in (b), resulting in pairs for
mating in (c). They produce offspring in (d), which are subject to mutation in (e).
132 CHAPTER 6. PROBLEM SOLVING AND SEARCH

+ =

Like beam searches, GAs begin with a set of k randomly generated states, called the
POPULATION population. Each state,
Michael Kohlhase: Artificialor individual,
Intelligence 1 is represented as195a string over a finite alphabet—most
2025-02-06

INDIVIDUAL commonly, a string of 0s and 1s. For example, an 8-queens state must specify the positions of
8 queens, each in a column of 8 squares, and so requires 8 × log2 8 = 24 bits. Alternatively,
the state could be represented as 8 digits, each in the range from 1 to 8. (We demonstrate later
that the two encodings behave differently.) Figure 4.6(a) shows a population of four 8-digit
strings representing 8-queens states.
The production of the next generation of states is shown in Figure 4.6(b)–(e). In (b),
FITNESS FUNCTION each state is rated by the objective function, or (in GA terminology) the fitness function. A
fitness function should return higher values for better states, so, for the 8-queens problem
we use the number of nonattacking pairs of queens, which has a value of 28 for a solution.
The values of the four states are 24, 23, 20, and 11. In this particular variant of the genetic
algorithm, the probability of being chosen for reproducing is directly proportional to the
fitness score, and the percentages are shown next to the raw scores.
In (c), two pairs are selected at random for reproduction, in accordance with the prob-
Chapter 7

Adversarial Search for Game Playing

A Video Nugget covering this chapter can be found at https://fau.tv/clip/id/22079.

7.1 Introduction
Video Nuggets covering this section can be found at https://fau.tv/clip/id/22060 and
https://fau.tv/clip/id/22061.

The Problem
The Problem of Game-Play: cf. ??
Example 7.1.1.

Definition 7.1.2. Adversarial search =

b Game playing against an opponent.

Michael Kohlhase: Artificial Intelligence 1 196 2025-02-06

Why Game Playing?

What do you think?

133
134 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

Playing a game well clearly requires a form of “intelligence”.

Games capture a pure form of competition between opponents.
Games are abstract and precisely defined, thus very easy to formalize.

Game playing is one of the oldest sub-areas of AI (ca. 1950).

The dream of a machine that plays chess is, indeed, much older than AI!

“Schachtürke” (1769) “El Ajedrecista” (1912)

Michael Kohlhase: Artificial Intelligence 1 197 2025-02-06

“Game” Playing? Which Games?

. . . sorry, we’re not gonna do soccer here.

Definition 7.1.3 (Restrictions). A game in the sense of AI-1 is one where

Game state discrete, number of game state finite.
Finite number of possible moves.
The game state is fully observable.
The outcome of each move is deterministic.
Two players: Max and Min.
Turn-taking: It’s each player’s turn alternatingly. Max begins.
Terminal game states have a utility u. Max tries to maximize u, Min tries to
minimize u.
In that sense, the utility for Min is the exact opposite of the utility for Max
(“zero sum”).
There are no infinite runs of the game (no matter what moves are chosen, a
terminal state is reached after a finite number of moves).

Michael Kohlhase: Artificial Intelligence 1 198 2025-02-06

An Example Game
7.1. INTRODUCTION 135

Game states: Positions of figures.

Moves: Given by rules.

Players: white (Max), black (Min).

Terminal states: checkmate.
Utility of terminal states, e.g.:
+100 if black is checkmated.
0 if stalemate.
−100 if white is checkmated.

Michael Kohlhase: Artificial Intelligence 1 199 2025-02-06

“Game” Playing? Which Games Not?

Soccer (sorry guys; not even RoboCup)

Important types of games that we don’t tackle here:

Chance. (E.g., backgammon)

More than two players. (E.g., Halma)
Hidden information. (E.g., most card games)
Simultaneous moves. (E.g., Diplomacy)
Not zero-sum, i.e., outcomes may be beneficial (or detrimental) for both players.
(cf. Game theory: Auctions, elections, economy, politics, . . . )
Many of these more general game types can be handled by similar/extended algo-
rithms.

Michael Kohlhase: Artificial Intelligence 1 200 2025-02-06

(A Brief Note On) Formalization

Definition 7.1.4. An adversarial search problem is a search problem ⟨S , A, T , I , G ⟩,
where

1. S = S Max ⊎ S Min ⊎ G and A = AMax ⊎ AMin

a
→ s′ then s ∈ S Max and s′ ∈ (S Min ∪ G).
2. For a ∈ AMax , if s −
a
→ s′ then s ∈ S Min and s′ ∈ (S Max ∪ G).
3. For a ∈ AMin , if s −

together with a game utility function u : G → R. (the “score” of the game)

Definition 7.1.5 (Commonly used terminology).
position =
b state, move =
b action, end state =
b terminal state =
b goal state.
Remark: A round of the game – one move Max, one move Min – is often referred
to as a “move”, and individual actions as “half-moves” (we don’t in AI-1)
136 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

Michael Kohlhase: Artificial Intelligence 1 201 2025-02-06

Why Games are Hard to Solve: I

What is a “solution” here?

Definition 7.1.6. Let Θ be an adversarial search problem, and let X ∈ {Max, Min}.
A strategy for X is a function σ X : S X → AX so that a is applicable to s whenever
σ X (s) = a.
We don’t know how the opponent will react, and need to prepare for all possibilities.

Definition 7.1.7. A strategy is called optimal if it yields the best possible utility
for X assuming perfect opponent play (not formalized here).
Problem: In (almost) all games, computing an optimal strategy is infeasible.
(state/search tree too huge)
Solution: Compute the next move “on demand”, given the current state instead.

Michael Kohlhase: Artificial Intelligence 1 202 2025-02-06

Why Games are hard to solve II

Example 7.1.8. Number of reachable states in chess: 1040 .

Example 7.1.9. Number of reachable states in go: 10100 .

It’s even worse: Our algorithms here look at search trees (game trees), no
duplicate pruning.
Example 7.1.10.
Chess without duplicate pruning: 35100 ≃ 10154 .
Go without duplicate pruning: 200300 ≃ 10690 .

Michael Kohlhase: Artificial Intelligence 1 203 2025-02-06

How To Describe a Game State Space?

Like for classical search problems, there are three possible ways to describe a game:
blackbox/API description, declarative description, explicit game state space.
Question: Which ones do humans use?

Explicit ≈ Hand over a book with all 1040 moves in chess.

Blackbox ≈ Give possible chess moves on demand but don’t say how they are
generated.
Answer: Declarative!
With “game description language” =
b natural language.
7.2. MINIMAX SEARCH 137

Michael Kohlhase: Artificial Intelligence 1 204 2025-02-06

Specialized vs. General Game Playing

And which game descriptions do computers use?

Explicit: Only in illustrations.

Blackbox/API: Assumed description in (This Chapter)
Method of choice for all those game players out there in the market (Chess
computers, video game opponents, you name it).
Programs designed for, and specialized to, a particular game.

Human knowledge is key: evaluation functions (see later), opening databases

(chess!!), end game databases.
Declarative: General game playing, active area of research in AI.
Generic game description language (GDL), based on logic.
Solvers are given only “the rules of the game”, no other knowledge/input
whatsoever (cf. ??).
Regular academic competitions since 2005.

Michael Kohlhase: Artificial Intelligence 1 205 2025-02-06

Our Agenda for This Chapter

Minimax Search: How to compute an optimal strategy?
Minimax is the canonical (and easiest to understand) algorithm for solving
games, i.e., computing an optimal strategy.
Evaluation functions: But what if we don’t have the time/memory to solve the
entire game?
Given limited time, the best we can do is look ahead as far as we can. Evaluation
functions tell us how to evaluate the leaf states at the cut off.

Alphabeta search: How to prune unnecessary parts of the tree?

Often, we can detect early on that a particular action choice cannot be part of
the optimal strategy. We can then stop considering this part of the game tree.
State of the art: What is the state of affairs, for prominent games, of computer
game playing vs. human experts?
Just FYI (not part of the technical content of this course).

Michael Kohlhase: Artificial Intelligence 1 206 2025-02-06

7.2 Minimax Search

A Video Nugget covering this section can be found at https://fau.tv/clip/id/22061.
138 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

“Minimax”?
We want to compute an optimal strategy for player “Max”.
In other words: We are Max, and our opponent is Min.

Recall: We compute the strategy offline, before the game begins.

During the game, whenever it’s our turn, we just look up the corresponding action.
Idea: Use tree search using an extension û of the utility function u to inner nodes.
û is computed recursively from u during search:

Max attempts to maximize û(s) of the terminal states reachable during play.
Min attempts to minimize û(s).
Section 5.2. Optimal Decisions in Games 163
The computation alternates between minimization and maximization ; hence “min-
imax”.
until we reach leaf nodes corresponding to terminal states such that one player has three in
a row or all the squares are filled. The number on each leaf node indicates the utility value
of the terminal state from the point of view of MAX; high values are assumed to be good for
Michael Kohlhase: Artificial Intelligence 1 207 2025-02-06
MAX and bad for MIN (which is how the players get their names).
For tic-tac-toe the game tree is relatively small—fewer than 9! = 362, 880 terminal
nodes. But for chess there are over 1040 nodes, so the game tree is best thought of as a
Example Tic-Tac-Toe
theoretical construct that we cannot realize in the physical world. But regardless of the size
SEARCH TREE of the game tree, it is MAX’s job to search for a good move. We use the term search tree for a
tree that is superimposed on the full game tree, and examines enough nodes to allow a player
Example 7.2.1.
to determine whatA move
full gameto make.tree for tic-tac-toe

MAX (X)

X X X
MIN (O) X X X
X X X

XO X O X ...
MAX (X) O

X O X X O X O ...
MIN (O) X X

... ... ... ...

X O X X O X X O X ...
TERMINAL O X O O X X
O X X O X O O
Utility –1 0 +1

Figure 5.1 A (partial) game tree for the game of tic-tac-toe. The top node is the initial
current
state,player and
and MAX action
moves marked
first, placing an X on
in anthe left.
empty square. We show part of the tree, giving
alternating moves by MIN ( O ) and MAX ( X ), until we eventually reach terminal states, which
Last can
row: terminal positions with their utility.
be assigned utilities according to the rules of the game.

Michael Kohlhase: Artificial Intelligence 1 208 2025-02-06

5.2 O PTIMAL D ECISIONS IN G AMES

Minimax: Outline
In a normal search problem, the optimal solution would be a sequence of actions leading to
a goal state—a terminal state that is a win. In adversarial search, MIN has something to say
STRATEGY
We max, we min, we max, we min . . .
about it. MAX therefore must find a contingent strategy, which specifies MAX’s move in
the initial state, then MAX’s moves in the states resulting from every possible response by
1. Depth first search in game tree, with Max in the root.
7.2. MINIMAX SEARCH 139

2. Apply game utility function to terminal positions.

3. Bottom-up for each inner node n in the search tree, compute the utility û(n) of
n as follows:
If it’s Max’s turn: Set û(n) to the maximum of the utilities of n’s successor
nodes.
If it’s Min’s turn: Set û(n) to the minimum of the utilities of n’s successor
nodes.
4. Selecting a move for Max at the root: Choose one move that leads to a successor
node with maximal utility.

Michael Kohlhase: Artificial Intelligence 1 209 2025-02-06

Minimax: Example

Max 3

Min 3 Min 2 Min 2

3 12 8 2 4 6 14 5 2

Blue numbers: Utility function u applied to terminal positions.

Red numbers: Utilities of inner nodes, as computed by the minimax algorithm.

Michael Kohlhase: Artificial Intelligence 1 210 2025-02-06

The Minimax Algorithm: Pseudo-Code

Definition 7.2.2. The minimax algorithm (often just called minimax) is given by
the following functions whose argument is a state s ∈ S Max , in which Max is to
move.
function Minimax−Decision(s) returns an action
v := Max−Value(s)
return an action yielding value v in the previous function call
function Max−Value(s) returns a utility value
if Terminal−Test(s) then return u(s)
v := −∞
for each a ∈ Actions(s) do
v := max(v,Min−Value(ChildState(s,a)))
return v
function Min−Value(s) returns a utility value
140 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

if Terminal−Test(s) then return u(s)

v := +∞
for each a ∈ Actions(s) do
v := min(v,Max−Value(ChildState(s,a)))
return v

We call nodes, where Max/Min acts Max-nodes/Min-nodes.

Michael Kohlhase: Artificial Intelligence 1 211 2025-02-06

Minimax: Example, Now in Detail

Max −∞

Min ∞

Max −∞

Min ∞

Max −∞

Min 3

3
7.2. MINIMAX SEARCH 141

Max −∞

Min 3

3 12

Max −∞

Min 3

3 12 8

Max 3

Min 3

3 12 8

Max 3

Min 3 Min ∞

3 12 8
142 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

Max 3

Min 3 Min ∞

3 12 8 2

Max 3

Min 3 Min ∞

3 12 8 2

Max 3

Min 3 Min 2

3 12 8 2 4

Max 3

Min 3 Min 2

3 12 8 2 4 6
7.2. MINIMAX SEARCH 143

Max 3

Min 3 Min 2 Min ∞

3 12 8 2 4 6

Max 3

Min 3 Min 2 Min ∞

3 12 8 2 4 6 14

Max 3

Min 3 Min 2 Min 14

3 12 8 2 4 6 14

Max 3

Min 3 Min 2 Min 5

3 12 8 2 4 6 14 5
144 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

Max 3

Min 3 Min 2 Min 2

3 12 8 2 4 6 14 5 2

Max 3

Min 3 Min 2 Min 2

3 12 8 2 4 6 14 5 2

Max 3

Min 3 Min 2 Min 2

3 12 8 2 4 6 14 5 2

So which action for Max is returned?

Leftmost branch.
Note: The maximal possible pay-off is higher for the rightmost branch, but as-
suming perfect play of Min, it’s better to go left. (Going right would be “relying on
your opponent to do something stupid”.)

Michael Kohlhase: Artificial Intelligence 1 212 2025-02-06

Minimax, Pro and Contra

Minimax advantages:
Minimax is the simplest possible (reasonable) search algorithm for games.
7.3. EVALUATION FUNCTIONS 145

(If any of you sat down, prior to this lecture, to implement a Tic-Tac-Toe player,
chances are you either looked this up on Wikipedia, or invented it in the process.)
Returns an optimal action, assuming perfect opponent play.
No matter how the opponent plays, the utility of the terminal state reached
will be at least the value computed for the root.
If the opponent plays perfectly, exactly that value will be reached.

There’s no need to re-run minimax for every game state: Run it once, offline
before the game starts. During the actual game, just follow the branches taken
in the tree. Whenever it’s your turn, choose an action maximizing the value of
the successor states.
Minimax disadvantages: It’s completely infeasible in practice.

When the search tree is too large, we need to limit the search depth and apply
an evaluation function to the cut off states.

Michael Kohlhase: Artificial Intelligence 1 213 2025-02-06

7.3 Evaluation Functions

A Video Nugget covering this section can be found at https://fau.tv/clip/id/22064.
We now address the problem that minimax is infeasible in practice. As so often, the solution is
to eschew optimal strategies and to approximate them. In this case, instead of a computed utility
function, we estimate one that is easy to compute: the evaluation function.

Evaluation Functions for Minimax

Problem: Search tree are too big to search through in minimax.

Solution: We impose a search depth limit (also called horizon) d, and apply an
evaluation function to the cut-off states, i.e. states s with dp(s) = d.
Definition 7.3.1. An evaluation function f maps game states to numbers:
f (s) is an estimate of the actual value of s (as would be computed by unlimited-
depth minimax for s).
If cut-off state is terminal: Just use û instead of f .
Analogy to heuristic functions (cf. ??): We want f to be both (a) accurate and
(b) fast.

Another analogy: (a) and (b) are in contradiction ; need to trade-off accuracy
against overhead.
In typical game playing algorithms today, f is inaccurate but very fast.
(usually no good methods known for computing accurate f )

Michael Kohlhase: Artificial Intelligence 1 214 2025-02-06

Example Revisited: Minimax With Depth Limit d = 2

146 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

Max 3

Min 3 Min 2 Min 2

3 12 8 2 4 6 14 5 2

Blue numbers: evaluation function f , applied to the cut-off states at d = 2.

Red numbers: utilities of inner node, as computed by minimax using f .

Michael Kohlhase: Artificial Intelligence 1 215 2025-02-06

Example Chess

Evaluation function in chess:

Material: Pawn 1, Knight 3, Bishop 3, Rook 5,

Queen 9.
3 points advantage ; safe win.
Mobility: How many fields do you control?
King safety, Pawn structure, . . .

Note how simple this is! (probably is not how

Kasparov evaluates his positions)

Michael Kohlhase: Artificial Intelligence 1 216 2025-02-06

Linear Evaluation Functions

Problem: How to come up with evaluation functions?

Definition 7.3.2. A common approach is to use a weighted linear function for f ,

i.e. given a sequence of features f i : S →R and a corresponding sequence of weights
wi ∈ R, f is of the form f (s):=w1 · f 1 (s) + w2 · f 2 (s) + · · · + wn · f n (s)
Problem: How to obtain these weighted linear functions?
Weights wi can be learned automatically. (learning agent)
The features f i , however, have to be designed by human experts.
Note: Very fast, very simplistic.
a
Observation: Can be computed incrementally: In transition s − → s′ , adapt f (s)
′
to f (s ) by considering only those features whose values have changed.
7.3. EVALUATION FUNCTIONS 147

Michael Kohlhase: Artificial Intelligence 1 217 2025-02-06

This assumes that the features (their contribution towards the actual value of the state) are
independent. That’s usually not the case (e.g. the value of a rook depends on the pawn struc-
ture).

The Horizon Problem

Problem: Critical aspects of the game can be cut off by the horizon.
We call this the horizon problem.
Example 7.3.3.

Who’s gonna win here?

White wins (pawn cannot be prevented from

becoming a queen.)
Black has a +4 advantage in material, so if
we cut-off here then our evaluation function
will say “100%, black wins”.
The loss for black is “beyond our horizon” un-
less we search extremely deeply: black can
hold off the end by repeatedly giving check to
white’s king.
Black to move

Michael Kohlhase: Artificial Intelligence 1 218 2025-02-06

So, How Deeply to Search?

Goal: In given time, search as deeply as possible.
Problem: Very difficult to predict search running time. (need an anytime
algorithm)

Solution: Iterative deepening search.

Search with depth limit d = 1, 2, 3, . . .
When time is up: return result of deepest completed search.

Definition 7.3.4 (Better Solution). The quiescent search algorithm uses a dy-
namically adapted search depth d: It searches more deeply in unquiet positions,
where value of evaluation function changes a lot in neighboring states.
Example 7.3.5. In quiescent search for chess:
piece exchange situations (“you take mine, I take yours”) are very unquiet
; Keep searching until the end of the piece exchange is reached.

Michael Kohlhase: Artificial Intelligence 1 219 2025-02-06

148 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

7.4 Alpha-Beta Search

We have seen that evaluation functions can overcome the combinatorial explosion induced by
minimax search. But we can do even better: certain parts of the minimax search tree can be safely
ignored, since we can prove that they will only sub-optimal results. We discuss the technique of
alphabeta-pruning in detail as an example of such pruning methods in search algorithms.

When We Already Know We Can Do Better Than This

Max (A) Say n > m.

By choosing to go to the left in search

node (A), Max already can get utility
Min of at least n in this part of the game.
value: n
So, if “later on” (further down in the
same subtree), in search node (B) we
Min (B) already know that Min can force Max
to get value m < n.
Then Max will play differently in (A)
so we will never actually get to (B).
Max
value: m

Michael Kohlhase: Artificial Intelligence 1 220 2025-02-06

Alpha Pruning: Basic Idea

Question: Can we save some work here?

Max 3

Min 3 Min 2 Min 2

3 12 8 2 4 6 14 5 2

Michael Kohlhase: Artificial Intelligence 1 221 2025-02-06

7.4. ALPHA-BETA SEARCH 149

Alpha Pruning: Basic Idea (Continued)

Answer: Yes! We already know at this point that the middle action won’t be
taken by Max.

Max ≥3

Min 3 Min ≤2 Min

3 12 8 2

Idea: We can use this to prune the search tree ; better algorithm

Michael Kohlhase: Artificial Intelligence 1 222 2025-02-06

Alpha Pruning
Definition 7.4.1. For each node n in a minimax search tree, the alpha value α(n)
is the highest Max-node utility that search has encountered on its path from the
root to n.
Example 7.4.2 (Computing alpha values).

Max −∞; α = −∞

Min ∞; α = −∞

Max −∞; α = −∞

Min ∞; α = −∞

3
150 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

Max −∞; α = −∞

Min 3; α = −∞

Max −∞; α = −∞

Min 3; α = −∞

3 12

Max −∞; α = −∞

Min 3; α = −∞

3 12 8

Max 3; α = 3

Min 3; α = −∞

3 12 8
7.4. ALPHA-BETA SEARCH 151

Max 3; α = 3

Min 3; α = −∞ Min ∞; α = 3

3 12 8

Max 3; α = 3

Min 3; α = −∞ Min ∞; α = 3

3 12 8 2

Max 3; α = 3

Min 3; α = −∞ Min 2; α = 3

3 12 8 2

Max 3; α = 3

Min 3; α = −∞ Min 2; α = 3 Min

3 12 8 2

How to use α?: In a Min-node n, if û(n′ ) ≤ α(n) for one of the successors, then
stop considering n. (pruning out its remaining successors)
152 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

Michael Kohlhase: Artificial Intelligence 1 223 2025-02-06

Alpha-Beta Pruning
Recall:
What is α: For each search node n, the highest Max-node utility that search
has encountered on its path from the root to n.
How to use α: In a Min-node n, if one of the successors already has utility
≤ α(n), then stop considering n. (Pruning out its remaining successors)

Idea: We can use a dual method for Min!

Definition 7.4.3. For each node n in a minimax search tree, the beta value β(n) is
the highest Min-node utility that search has encountered on its path from the root
to n.

How to use β: In a Max-node n, if one of the successors already has utility

≥ β(n), then stop considering n. (pruning out its remaining successors)
. . . and of course we can use α and β together! ; alphabeta-pruning

Michael Kohlhase: Artificial Intelligence 1 224 2025-02-06

Alpha-Beta Search: Pseudocode

Definition 7.4.4. The alphabeta search algorithm is given by the following pseu-
docode
function Alpha−Beta−Search (s) returns an action
v := Max−Value(s, −∞, +∞)
return an action yielding value v in the previous function call

function Max−Value(s, α, β) returns a utility value

if Terminal−Test(s) then return u(s)
v:= −∞
for each a ∈ Actions(s) do
v := max(v,Min−Value(ChildState(s,a), α, β))
α := max(α, v)
if v ≥ β then return v /∗ Here: v ≥ β ⇔ α ≥ β ∗/
return v

function Min−Value(s, α, β) returns a utility value

if Terminal−Test(s) then return u(s)
v := +∞
for each a ∈ Actions(s) do
v := min(v,Max−Value(ChildState(s,a), α, β))
β := min(β, v)
if v ≤ α then return v /∗ Here: v ≤ α ⇔ α ≥ β ∗/
return v

b Minimax (slide 211) + α/β book-keeping and pruning.

Michael Kohlhase: Artificial Intelligence 1 225 2025-02-06

Note: Note that α only gets assigned a value in Max-nodes, and β only gets assigned a value in
Min-nodes.
7.4. ALPHA-BETA SEARCH 153

Alpha-Beta Search: Example

Notation: v; [α, β]

Max −∞; [−∞, ∞]

Min ∞; [−∞, ∞]

Max −∞; [−∞, ∞]

Min ∞; [−∞, ∞]

Max −∞; [−∞, ∞]

Min 3; [−∞, 3]

Max −∞; [−∞, ∞]

Min 3; [−∞, 3]

3 12
154 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

Max −∞; [−∞, ∞]

Min 3; [−∞, 3]

3 12 8

Max 3; [3, ∞]

Min 3; [−∞, 3]

3 12 8

Max 3; [3, ∞]

Min 3; [−∞, 3] Min ∞; [3, ∞]

3 12 8

Max 3; [3, ∞]

Min 3; [−∞, 3] Min ∞; [3, ∞]

3 12 8 2
7.4. ALPHA-BETA SEARCH 155

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2]

3 12 8 2

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2] Min ∞; [3, ∞]

3 12 8 2

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2] Min ∞; [3, ∞]

3 12 8 2 14

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2] Min 14; [3, 14]

3 12 8 2 14
156 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2] Min 14; [3, 14]

3 12 8 2 14 5

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2] Min 5; [3, 5]

3 12 8 2 14 5

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2] Min 5; [3, 5]

3 12 8 2 14 5 2

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2] Min 2; [3, 2]

3 12 8 2 14 5 2

Note: We could have saved work by choosing the opposite order for the successors
7.4. ALPHA-BETA SEARCH 157

of the rightmost Min-node.

Choosing the best moves (for each of Max and Min) first yields more pruning!

Michael Kohlhase: Artificial Intelligence 1 226 2025-02-06

Alpha-Beta Search: Modified Example

Showing off some actual β pruning:

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2] Min ∞; [3, ∞]

3 12 8 2

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2] Min ∞; [3, ∞]

3 12 8 2 5

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2] Min 5; [3, 5]

3 12 8 2 5
158 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2] Min 5; [3, 5]

Max −∞; [3, 5]

3 12 8 2 5

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2] Min 5; [3, 5]

Max −∞; [3, 5]

3 12 8 2 5

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2] Min 5; [3, 5]

Max 14; [14, 5]

3 12 8 2 5

14
7.4. ALPHA-BETA SEARCH 159

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2] Min 5; [3, 5]

Max 14; [14, 5]

3 12 8 2 5 2

Max 3; [3, ∞]

Min 3; [−∞, 3] Min 2; [3, 2] Min 2; [3, 2]

Max 14; [14, 5]

3 12 8 2 5 2

Michael Kohlhase: Artificial Intelligence 1 227 2025-02-06

How Much Pruning Do We Get?

Choosing the best moves first yields most pruning in alphabeta search.
The maximizing moves for Max, the minimizing moves for Min.

Observation: Assuming game tree with branching factor b and depth limit d:
Minimax would have to search bd nodes.
Best case: If we always choose the best moves first, then the search tree is
d
reduced to b 2 nodes!
Practice: It is often possible to get very close to the best case by simple move-
ordering methods.
Example 7.4.5 (Chess).
160 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

Move ordering: Try captures first, then threats, then forward moves, then back-
ward moves.
d
From 35d to 35 2 . E.g., if we have the time to search a billion (109 ) nodes, then
minimax looks ahead d = 6 moves, i.e., 3 rounds (white-black) of the game.
Alpha-beta search looks ahead 6 rounds.

Michael Kohlhase: Artificial Intelligence 1 228 2025-02-06

7.5 Monte-Carlo Tree Search (MCTS)

Video Nuggets covering this section can be found at https://fau.tv/clip/id/22259 and
https://fau.tv/clip/id/22262.
We will now come to the most visible game-play program in recent times: The AlphaGo system
for the game of go. This has been out of reach of the state of the art (and thus for alphabeta
search) until 2016. This challenge was cracked by a different technique, which we will discuss in
this section.

And now . . .
AlphaGo = Monte Carlo tree search (AI-1) + neural networks (AI-2)

CC-BY-SA: Buster Benson@ https://www.flickr.com/photos/erikbenson/25717574115

Michael Kohlhase: Artificial Intelligence 1 229 2025-02-06

Monte-Carlo Tree Search: Basic Ideas

Observation: We do not always have good evaluation functions.
Definition 7.5.1. For Monte Carlo sampling we evaluate actions through sampling.
When deciding which action to take on game state s:
while time not up do
select action a applicable to s
run a random sample from a until terminal state t
return an a for s with maximal average u(t)

Definition 7.5.2. For the Monte Carlo tree search algorithm (MCTS) we maintain
a search tree T , the MCTS tree.
7.5. MONTE-CARLO TREE SEARCH (MCTS) 161

while time not up do

apply actions within T to select a leaf state s′
select action a′ applicable to s′ , run random sample from a′
add s′ to T , update averages etc.
return an a for s with maximal average u(t)
When executing a, keep the part of T below a.

Compared to alphabeta search: no exhaustive enumeration.

Pro: running time & memory.
Contra: need good guidance how to select and sample.

Michael Kohlhase: Artificial Intelligence 1 230 2025-02-06

This looks only at a fraction of the search tree, so it is crucial to have good guidance where to go,
i.e. which part of the search tree to look at.

Monte-Carlo Sampling: Illustration of Sampling

Idea: Sample the search tree keeping track of the average utilities.
Example 7.5.3 (Single-player, for simplicity). (with adversary, distinguish
max/min nodes)

Expansions: 0, 0, 0
avg. reward: 0, 0, 0 Expan-
sions: 0, 1, 0
avg. reward: 0, 10, 0 Ex-
pansions: 1, 1, 0
avg. reward: 70, 10, 0 Ex-
pansions: 1, 1, 1
avg. reward: 70, 10, 40 Ex-
pansions: 1, 1, 2
avg. reward: 70, 10, 35 Ex- Expansions: 0, 0
pansions: 2, 1, 2 avg. reward: 0, 0
avg. reward: 60, 10, 35 Ex-
pansions: 2, 2, 2
avg. reward: 60, 55, 35 Ex-
pansions: 2, 2, 2
avg. reward: 60, 55, 35 40

70 50 30

100 10

Michael Kohlhase: Artificial Intelligence 1 231 2025-02-06

The sampling goes middle, left, right, right, left, middle. Then it stops and selects the highest-
average action, 60, left. After first sample, when values in initial state are being updated, we
have the following “expansions” and “avg. reward fields”: small number of expansions favored for
exploration: visit parts of the tree rarely visited before, what is out there? avg. reward: high
values favored for exploitation: focus on promising parts of the search tree.
162 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

Monte-Carlo Tree Search: Building the Tree

Idea: We can save work by building the tree as we go along.
Example 7.5.4 (Redoing the previous example).

Expansions: 0, 0, 0
avg. reward: 0, 0, 0 Expan-
sions: 0, 1, 0
avg. reward: 0, 10, 0 Expan-
sions: 1, 1, 0
avg. reward: 70, 10, 0 Ex-
Expansions: 1, 0 Expansions: 1 pansions: 1, 1, 1
avg. reward: 70, 0 Ex- avg. reward: 10 avg. reward: 70, 10, 40 Ex-
pansions: 2, 0 Expansions: 2 pansions: 1, 1, 2
avg. reward: 60, 0 avg. reward: 55 avg. reward: 70, 10, 35 Ex-
Expansions: 1, 0
Expansions: 1 pansions: 2, 1, 2 avg. reward: 40, 0 Ex-
avg. reward: 100 avg. reward: 60, 10, 35 Ex- 2, 0
pansions:
pansions: 2, 2, 2 avg. reward: 35, 0
avg. reward: 60, 55, 35 Ex-
Expansions: 0, 1 pansions: 2, 2, 2 Expansions: 0, 1
avg. reward: 0, 50 avg. reward: 60, 55, 35 avg. reward: 0, 30

70 50 30

100 10

Michael Kohlhase: Artificial Intelligence 1 232 2025-02-06

This is the exact same search as on previous slide, but incrementally building the search tree, by
always keeping the first state of the sample. The first three iterations middle, left, right, go to
show the tree extension; do point out here that, like the root node, the nodes added to the tree
have expansions and avg reward counters for every applicable action. Then in next iteration right,
after 30 leaf node was found, an important thing is that the averages get updated *along the entire
path*, i.e., not only in the root as we did before, but also in the nodes along the way. After all
six iterations have been done, as before we select the action left, value 60; but we keep the part
of the tree below that action, “saving relevant work already done before”.

How to Guide the Search in MCTS?

How to sample?: What exactly is “random”?
Classical formulation: balance exploitation vs. exploration.

Exploitation: Prefer moves that have high average already (interesting regions
of state space)
Exploration: Prefer moves that have not been tried a lot yet (don’t overlook
other, possibly better, options)
UCT: “Upper Confidence bounds applied to Trees” [KS06].
7.5. MONTE-CARLO TREE SEARCH (MCTS) 163

Inspired by Multi-Armed Bandit (as in: Casino) problems.

Basically a formula defining the balance. Very popular (buzzword).
Recent critics (e.g. [FD14]): Exploitation in search is very different from the
Casino, as the “accumulated rewards” are fictitious (we’re only thinking about
the game, not actually playing and winning/losing all the time).

Michael Kohlhase: Artificial Intelligence 1 233 2025-02-06

AlphaGo: Overview
Definition 7.5.5 (Neural Networks in AlphaGo).

Policy networks: Given a state s, output a probability distribution over the

actions applicable in s.
Value networks: Given a state s, output a number estimating the game value
of s.

Combination with MCTS:

Policy networks bias the action choices within the MCTS tree (and hence the
leaf state selection), and bias the random samples.
Value networks are an additional source of state values in the MCTS tree, along
with the random samples.

And now in a little more detail

Michael Kohlhase: Artificial Intelligence 1 234 2025-02-06

Neural Networks in AlphaGo

Neural network training pipeline and architecture: ARTICLE RESEARCH

a b
Rollout policy SL policy network RL policy network Value network Policy network Value network
Neural network

pS pV pU QT pVU (a⎪s) QT (s′)

Policy gradient
n
Cla

tio

n
ca

ssio
ssifi

lf P
ssifi

lay

gre
ca

Cla
tio

Re
n

Data

s s′
Human expert positions Self-play positions
Figure 1 | Neural network training pipeline and architecture. a, A fast the current player wins) in positions from the self-play data set.
rollout policy pπ and supervised learning (SL) policy network pσ are b, Schematic representation of the neural network architecture used in
trained to predict human expert moves in a data set of positions. AlphaGo. The policy network takes a representation of the board position
Illustration taken from [Sil+16] .
A reinforcement learning (RL) policy network pρ is initialized to the SL s as its input, passes it through many convolutional layers with parameters
policy network, and is then improved by policy gradient learning to σ (SL policy network) or ρ (RL policy network), and outputs a probability
maximize the outcome (that is, winning more games) against previous distribution pσ (a | s) or pρ (a | s) over legal moves a, represented by a
versions Rollout policy p : Simple but fast, ≈ prior work on Go.
π set is generated by playing
of the policy network. A new data probability map over the board. The value network similarly uses many
games of self-play with the RL policy network. Finally, a value network vθ convolutional layers with parameters θ, but outputs a scalar value vθ(s′)
SL policy network p : Supervised learning, human-expert data (“learn to choose
is trained by regression to predict the expectedσoutcome (that is, whether that predicts the expected outcome in position s′.
an expert action”).
sampled state-action pairs (s, a), using stochastic gradient ascent to and its weights ρ are initialized to the same values, ρ = σ. We play
maximize
RL policyof network
the likelihood the human move pρ :a selected in state s
Reinforcement games betweenself-play
learning, the current policy topρwin”).
network
(“learn and a randomly selected
previous iteration of the policy network. Randomizing from a pool
∂log pσ (a | s ) of opponents in this way stabilizes training by preventing overfitting
∆σ ∝
∂σ to the current policy. We use a reward function r(s) that is zero for all
non-terminal time steps t < T. The outcome zt = ± r(sT) is the termi-
We trained a 13-layer policy network, which we call the SL policy nal reward at the end of the game from the perspective of the current
network, from 30 million positions from the KGS Go Server. The net- player at time step t: +1 for winning and −1 for losing. Weights are
work predicted expert moves on a held out test set with an accuracy of then updated at each time step t by stochastic gradient ascent in the
57.0% using all input features, and 55.7% using only raw board posi- direction that maximizes expected outcome25
164 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

Value network vθ : Use self-play games with pρ as training data for game-position
evaluation vθ (“predict which player will win in this state”).

Michael Kohlhase: Artificial Intelligence 1 235 2025-02-06

Comments on the Figure:

a A fast rollout policy pπ and supervised learning (SL) policy network pσ are trained to predict
human expert moves in a data set of positions. A reinforcement learning (RL) policy network
pρ is initialized to the SL policy network, and is then improved by policy gradient learning to
maximize the outcome (that is, winning more games) against previous versions of the policy
network. A new data set is generated by playing games of self-play with the RL policy network.
Finally, a value network vθ is trained by regression to predict the expected outcome (that is,
whether the current player wins) in positions from the self-play data set.

b Schematic representation of the neural network architecture used in AlphaGo. The policy
network takes a representation of the board position s as its input, passes it through many con-
volutional layers with parameters σ (SL policy network) or ρ (RL policy network), and outputs a
probability distribution pσ (a|s) or pρ (a|s) over legal moves a, represented by a probability map
over the board. The value network similarly uses many convolutional layers with parameters θ,
but outputs a scalar value vθ (s′ ) that predicts the expected outcome in position s′ .

Neural Networks + MCTS in AlphaGo

Monte
RESEARCH Carlo tree search in AlphaGo:
ARTICLE

a Selection b Expansion c Evaluation d Backup

QT
P P Q Q
Q + u(P) max Q + u(P)
QT QT
Q Q
P P
Q + u(P) max Q + u(P)
pV QT QT QT

P P
pS

r r r r

Figure 3 | Monte Carlo tree search in AlphaGo. a, Each simulation is evaluated in two ways: using the value network vθ; and by running
traverses the tree by selecting the edge with maximum action value Q, a rollout to the end of the game with the fast rollout policy pπ, then
Illustration taken from [Sil+16]
plus a bonus u(P) that depends on a stored prior probability P for that
edge. b, The leaf node may be expanded; the new node is processed once
computing the winner with function r. d, Action values Q are updated to
track the mean value of all evaluations r(·) and vθ(·) in the subtree below
by the policy network pσ and the output probabilities are stored as prior that action.
probabilities Rollout policy p : Action choice in random samples.
P for each action. c, At the end
π of a simulation, the leaf node
learning ofconvolutional
SL policynetworks,
network won 11% pσ of Action
: games against Pachi23 bias
choice (s, a)within the
of the search treeUCTS tree value
stores an action (stored
Q(s, a),as visit“P ”, N(s, a),
count
and 12% against a slightly weaker program, Fuego24. and prior probability P(s, a). The tree is traversed by simulation (that
gets smaller to “u(P )” with number of is,visits); descendingalong
the treewith quality
in complete games Q.without backup), starting
Reinforcement learning of value networks from the root state. At each time step t of each simulation, an action at
RL
The final stage policy
of the trainingnetwork pρ :onNot
pipeline focuses used
position here (used
evaluation, only
is selected fromtostatelearn
st vθ ).
p
estimating a value function v (s) that predicts the outcome from posi-
Value
tion s of games played bynetwork
using policy vθp :forUsed to evaluate leaf states s, in
both players28–30
a t =linear
argmax(Q sum
(s t , a )with
+ u(s t , athe
)) value
returned by a random~sample
v p(s ) = E[z |s = s, a p]
on s. a
t t t…T
so as to maximize action value plus a bonus
Ideally, we would like to know the optimal value function under
perfect play v*(s); in practice, we instead estimate the value function P(s, a )
Michael Kohlhase: Artificial Intelligence 1 236 u(s, a ) ∝ 2025-02-06
v pρ for our strongest policy, using the RL policy network pρ. We approx- 1 + N (s, a )
imate the value function using a value network vθ(s) with weights θ,
vθ(s ) ≈ v pρ(s ) ≈ v ⁎(s ) . This neural network has a similar architecture that is proportional to the prior probability but decays with
Comments
to the policyon thebutFigure:
network, outputs a single prediction instead of a prob- repeated visits to encourage exploration. When the traversal reaches a
ability distribution. We train the weights of the value network by regres- leaf node sL at step L, the leaf node may be expanded. The leaf position
a Eachsion on state-outcome
simulation pairs (s, z), using
traverses thestochastic
tree by gradient descent to the
selecting sL isedge
processed with maximum
just once by the SL policyaction
network pvalue Q, plus
σ. The output prob- a
minimize the mean squared error (MSE) between the predicted value abilities are stored as prior probabilities P for each legal action a,
bonus
vθ(s),u(P ) that
and the depends
corresponding outcome onz a stored prior probability P(s, a ) = pP for
σ (a|s )
that
. The edge.
leaf node is evaluated in two very different ways:
first, by the value network vθ(sL); and second, by the outcome zL of a
∂vθ(s ) random rollout played out until terminal step T using the fast rollout
∆θ ∝ (z − vθ(s ))
∂θ policy pπ; these evaluations are combined, using a mixing parameter
λ, into a leaf evaluation V(sL)
The naive approach of predicting game outcomes from data con-
sisting of complete games leads to overfitting. The problem is that
7.6. STATE OF THE ART 165

b The leaf node may be expanded; the new node is processed once by the policy network pσ and
the output probabilities are stored as prior probabilities P for each action.
c At the end of a simulation, the leaf node is evaluated in two ways:

• using the value network vθ ,

• and by running a rollout to the end of the game
with the fast rollout policy p π, then computing the winner with function r.
d Action values Q are updated to track the mean value of all evaluations r(·) and vθ (·) in the
subtree below that action.
AlphaGo, Conclusion?: This is definitely a great achievement!
• “Search + neural networks” looks like a great formula for general problem solving.
• expect to see lots of research on this in the coming decade(s).

• The AlphaGo design is quite intricate (architecture, learning workflow, training data design,
neural network architectures, . . . ).
• How much of this is reusable in/generalizes to other problems?
• Still lots of human expertise in here. Not as much, like in chess, about the game itself. But
rather, in the design of the neural networks + learning architecture.

7.6 State of the Art

A Video Nugget covering this section can be found at https://fau.tv/clip/id/22250.

State of the Art

Some well-known board games:
Chess: Up next.
Othello (Reversi): In 1997, “Logistello” beat the human world champion. Best
computer players now are clearly better than best human players.
Checkers (Dame): Since 1994, “Chinook” is the offical world champion. In
2007, it was shown to be unbeatable: Checkers is solved. (We know the exact
value of, and optimal strategy for, the initial state.)
Go: In 2016, AlphaGo beat the Grandmaster Lee Sedol, cracking the “holy grail”
of board games. In 2017, “AlphaZero” – a variant of AlphaGo with zero prior
knowledge beat all reigning champion systems in all board games (including
AlphaGo) 100/0 after 24h of self-play.
Intuition: Board Games are considered a “solved problem” from the AI per-
spective.

Michael Kohlhase: Artificial Intelligence 1 237 2025-02-06

Computer Chess: “Deep Blue” beat Garry Kasparov in 1997

166 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING

6 games, final score 3.5 : 2.5.

Specialized chess hardware, 30 nodes with

16 processors each.
Alphabeta search plus human knowledge.
(more details in a moment)

Nowadays, standard PC hardware plays at

world champion level.

Michael Kohlhase: Artificial Intelligence 1 238 2025-02-06

Computer Chess: Famous Quotes

The chess machine is an ideal one to start with, since (Claude Shannon (1949))
1. the problem is sharply defined both in allowed operations (the moves) and in the
ultimate goal (checkmate),
2. it is neither so simple as to be trivial nor too difficult for satisfactory solution,
3. chess is generally considered to require “thinking” for skilful play, [. . . ]
4. the discrete structure of chess fits well into the digital nature of modern comput-
ers.
Chess is the drosophila of Artificial Intelligence. (Alexander Kronrod (1965))

Michael Kohlhase: Artificial Intelligence 1 239 2025-02-06

Computer Chess: Another Famous Quote

In 1965, the Russian mathematician Alexander Kronrod said, “Chess is the Drosophila
of artificial intelligence.”
However, computer chess has developed much as genetics might have if the geneti-
cists had concentrated their efforts starting in 1910 on breeding racing Drosophilae.
We would have some science, but mainly we would have very fast fruit flies. (John
McCarthy (1997))

Michael Kohlhase: Artificial Intelligence 1 240 2025-02-06

7.7 Conclusion
Summary
Games (2-player turn-taking zero-sum discrete and finite games) can be understood
as a simple extension of classical search problems.
Each player tries to reach a terminal state with the best possible utility (maximal
vs. minimal).
7.7. CONCLUSION 167

Minimax searches the game depth-first, max’ing and min’ing at the respective turns
of each player. It yields perfect play, but takes time O(bd ) where b is the branching
factor and d the search depth.
Except in trivial games (Tic-Tac-Toe), minimax needs a depth limit and apply an
evaluation function to estimate the value of the cut-off states.
Alpha-beta search remembers the best values achieved for each player elsewhere in
the tree already, and prunes out sub-trees that won’t be reached in the game.
Monte Carlo tree search (MCTS) samples game branches, and averages the findings.
AlphaGo controls this using neural networks: evaluation function (“value network”),
and action filter (“policy network”).

Michael Kohlhase: Artificial Intelligence 1 241 2025-02-06

Suggested Reading:
• Chapter 5: Adversarial Search, Sections 5.1 – 5.4 [RN09].
– Section 5.1 corresponds to my “Introduction”, Section 5.2 corresponds to my “Minimax Search”,
Section 5.3 corresponds to my “Alpha-Beta Search”. I have tried to add some additional clarify-
ing illustrations. RN gives many complementary explanations, nice as additional background
reading.
– Section 5.4 corresponds to my “Evaluation Functions”, but discusses additional aspects re-
lating to narrowing the search and look-up from opening/termination databases. Nice as
additional background reading.
– I suppose a discussion of MCTS and AlphaGo will be added to the next edition . . .
168 CHAPTER 7. ADVERSARIAL SEARCH FOR GAME PLAYING
Chapter 8

Constraint Satisfaction Problems

In the last chapters we have studied methods for “general problem”, i.e. such that are applicable to
all problems that are expressible in terms of states and “actions”. It is crucial to realize that these
states were atomic, which makes the algorithms employed (search algorithms) relatively simple
and generic, but does not let them exploit the any knowledge we might have about the internal
structure of states.
In this chapter, we will look into algorithms that do just that by progressing to factored states
representations. We will see that this allows for algorithms that are many orders of magnitude
more efficient than search algorithms.
To give an intuition for factored states representations we, we present some motivational examples
in ?? and go into detail of the Waltz algorithm, which gave rise to the main ideas of constraint
satisfaction algorithms in ??. ?? and ?? define constraint satisfaction problems formally and use
that to develop a class of backtracking/search based algorithms. The main contribution of the
factored states representations is that we can formulate advanced search heuristics that guide
search based on the structure of the states.

8.1 Constraint Satisfaction Problems: Motivation

A Video Nugget covering this section can be found at https://fau.tv/clip/id/22251.

A (Constraint Satisfaction) Problem

Example 8.1.1 (Tournament Schedule). Who’s going to play against who, when
and where?

169
170 CHAPTER 8. CONSTRAINT SATISFACTION PROBLEMS

Michael Kohlhase: Artificial Intelligence 1 242 2025-02-06

Constraint Satisfaction Problems (CSPs)

Standard search problem: state is a “black box” any old data structure that supports
goal test, eval, successor state, . . .

Definition 8.1.2. A constraint satisfaction problem (CSP) is a triple ⟨V , D, C ⟩

where
1. V is a finite set V of variables,
2. an V -indexed family (Dv )v∈V of domains, and
3. for some subsets {v 1 , . . ., v k } ⊆ V a constraint C {v1 ,...,vk } ⊂Dv1 × . . . × Dvk .
A variable assignment φ ∈ (v∈V ) →Dv is a solution for C, iff ⟨φ(v 1 ), . . ., φ(v k )⟩ ∈
C {v1 ,...,vk } for all {v 1 , . . ., v k } ⊆ V .
Definition 8.1.3. A CSP γ is called satisfiable, iff it has a solution: a total variable
assignment φ that satisfies all constraints.

Definition 8.1.4. The process of finding solutions to CSPs is called constraint

solving.
Remark 8.1.5. We are using factored representation for world states now!

Allows useful general-purpose algorithms with more power than standard tree
search algorithm.

Michael Kohlhase: Artificial Intelligence 1 243 2025-02-06

Another Constraint Satisfaction Problem

8.1. CONSTRAINT SATISFACTION PROBLEMS: MOTIVATION 171

Example 8.1.6 (SuDoKu). Fill the cells with row/column/block-unique digits

Variables: The 81 cells.

Domains: Numbers 1, . . . , 9.
Constraints: Each number only once in each row, column, block.

Michael Kohlhase: Artificial Intelligence 1 244 2025-02-06

CSP Example: Map-Coloring

Definition 8.1.7. Given a map M , the map coloring problem is to assign colors to
regions in a map so that no adjoining regions have the same color.
204 Chapter 6. Constraint Satisfaction Problems
Example 8.1.8 (Map coloring in Australia).
NT
Variables: WA, NT,Q Q, NSW, V, SA,
Northern WAT
Territory
Queensland Domains: Di = {red, green, blue}
Western
Australia
SA NSW
South
Australia New
Constraints: adjacent regions must
South
Wales
have different colorsV e.g., WA ̸= NT (if
Victoria the language allows this), or ⟨WA, NT⟩ ∈
Tasmania
{⟨red, green⟩, ⟨red, Tblue⟩, ⟨green, red⟩, . . . }
(a) (b)

Figure 6.1 (a) The principal states and territories of Australia. Coloring this map can
be viewed as a constraint satisfaction problem (CSP). The goal is to assign colors to each

represented as a constraint graph.

Intuition: solutions map variables
region so that no neighboring regions have the same color. (b) The map-coloring problem

to domain values satisfying all con-
immediately discard further refinements of the partial assignment. Furthermore, we can see
straints,
why the assignment is not a solution—we see which variables violate a constraint—so we can
focus attention on the variables that matter. As a result, many problems that are intractable
for regular state-space search can be solved quickly e.g.,formulated
when {WA as=a CSP.red, NT = green, . . .}
6.1.2 Example problem: Job-shop scheduling
Factories have the problem of scheduling a day’s worth of jobs, subject to various constraints.
In practice, many of these problems are solved with CSP techniques. Consider the problem of
scheduling the assembly of a car. The whole job is composed of tasks, and we can model each
task as a variable, where the value of each variable is the time that the task starts, expressed
Michael Kohlhase: Artificial Intelligence 1 245 2025-02-06
as an integer number of minutes. Constraints can assert that one task must occur before
another—for example, a wheel must be installed before the hubcap is put on—and that only
so many tasks can go on at once. Constraints can also specify that a task takes a certain
amount of time to complete.
Bundesliga ConstraintsWe consider a small part of the car assembly, consisting of 15 tasks: install axles (front
and back), affix all four wheels (right and left, front and back), tighten nuts for each wheel,
affix hubcaps, and inspect the final assembly. We can represent the tasks with 15 variables:

Variables: vAvs.B where A and B are teams, with domains {1, . . . ,34}:
X = {Axle F , Axle B , Wheel RF , Wheel LF , Wheel RB , Wheel LB , Nuts RF ,
Nuts LF , Nuts RB , Nuts LB , Cap RF , Cap LF , Cap RB , Cap LB , Inspect } .
For each
match, the The index
value of thevariable
of each weekend where
is the time that theittask
is starts.
scheduled.
Next we represent precedence
PRECEDENCE
CONSTRAINTS constraints between individual tasks. Whenever a task T1 must occur before task T2 , and
task T1 takes duration d1 to complete, we add an arithmetic constraint of the form
T1 + d1 ≤ T2 .
172 CHAPTER 8. CONSTRAINT SATISFACTION PROBLEMS

(Some) constraints:
If {A, B} ∩ {C, D} =
̸ ∅: vAvs.B ̸=
vCvs.D (each team only one match
per day).

If {A, B} = {C, D}: vAvs.B ≤ 17 <

vCvs.D or vCvs.D ≤ 17 < vAvs.B
(each pairing exactly once in each
half-season).

If A = C: vAvs.B + 1 ̸= vCvs.D
(each team alternates between home
matches and away matches).
Leading teams of last season meet
near the end of each half-season.

...

Michael Kohlhase: Artificial Intelligence 1 246 2025-02-06

How to Solve the Bundesliga Constraints?

306 nested for-loops (for each of the 306 matches), each ranging from 1 to 306.
Within the innermost loop, test whether the current values are (a) a permutation
and, if so, (b) a legal Bundesliga schedule.

Estimated running time: End of this universe, and the next couple billion ones
after it . . .
Directly enumerate all permutations of the numbers 1, . . . , 306, test for each whether
it’s a legal Bundesliga schedule.
Estimated running time: Maybe only the time span of a few thousand uni-
verses.
View this as variables/constraints and use backtracking (this chapter)
Executed running time: About 1 minute.
How do they actually do it?: Modern computers and CSP methods: fractions
of a second. 19th (20th/21st?) century: Combinatorics and manual work.
Try it yourself: with an off-the shelf CSP solver, e.g. Minion [Min]

Michael Kohlhase: Artificial Intelligence 1 247 2025-02-06

Traveling Tournament Problem Scheduling

Timetabling Radio Frequency Assignment

Michael Kohlhase: Artificial Intelligence 1 248 2025-02-06

1. U.S. Major League Baseball, 30 teams, each 162 games. There’s one crucial additional difficulty,
in comparison to Bundesliga. Which one? Travel is a major issue here!! Hence “Traveling
Tournament Problem” in reference to the TSP.
2. This particular scheduling problem is called “car sequencing”, how to most efficiently get cars
through the available machines when making the final customer configuration (non-standard/flexible/custom
extras).

3. Another common form of scheduling . . .

4. The problem of assigning radio frequencies so that all can operate together without noticeable
interference. Variable domains are available frequencies, constraints take form of |x − y| > δxy ,
where delta depends on the position of x and y as well as the physical environment.

Our Agenda for This Topic

Our treatment of the topic “Constraint Satisfaction Problems” consists of Chap-
ters 7 and 8. in [RN03]
This Chapter: Basic definitions and concepts; naïve backtracking search.
Sets up the framework. Backtracking underlies many successful algorithms for
solving constraint satisfaction problems (and, naturally, we start with the sim-
plest version thereof).
Next Chapter: Constraint propagation and decomposition methods.
Constraint propagation reduces the search space of backtracking. Decomposi-
tion methods break the problem into smaller pieces. Both are crucial for efficiency
in practice.

Michael Kohlhase: Artificial Intelligence 1 249 2025-02-06

174 CHAPTER 8. CONSTRAINT SATISFACTION PROBLEMS

Our Agenda for This Chapter

How are constraint networks, and assignments, consistency, solutions: How are
constraint satisfaction problems defined? What is a solution?
Get ourselves on firm ground.
Naïve Backtracking: How does backtracking work? What are its main weak-
nesses?
Serves to understand the basic workings of this wide-spread algorithm, and to
motivate its enhancements.
Variable- and Value Ordering: How should we guide backtracking searchs?

Simple methods for making backtracking aware of the structure of the problem,
and thereby reduce search.

Michael Kohlhase: Artificial Intelligence 1 250 2025-02-06

8.2 The Waltz Algorithm

We will now have a detailed look at the problem (and innovative solution) that started the
field of constraint satisfaction problems.
Background:
Adolfo Guzman worked on an algorithm to count the number of simple objects (like children’s
blocks) in a line drawing. David Huffman formalized the problem and limited it to objects in
general position, such that the vertices are always adjacent to three faces and each vertex is
formed from three planes at right angles (trihedral). Furthermore, the drawings could only have
three kinds of lines: object boundary, concave, and convex. Huffman enumerated all possible
configurations of lines around a vertex. This problem was too narrow for real-world situations, so
Waltz generalized it to include cracks, shadows, non-trihedral vertices and light. This resulted in
over 50 different line labels and thousands of different junctions. [ILD]

The Waltz Algorithm

Remark: One of the earliest examples of applied CSPs.
Motivation: Interpret line drawings of polyhedra.

Problem: Are intersections convex or concave? (interpret =

b label as such)

Idea: Adjacent intersections impose constraints on each other. Use CSP to find a
unique set of labelings.
8.2. THE WALTZ ALGORITHM 175

Michael Kohlhase: Artificial Intelligence 1 251 2025-02-06

Waltz Algorithm on Simple Scenes

Assumptions: All objects
have no shadows or cracks,
have only three-faced vertices,
are in “general position”, i.e. no junctions change with small movements of the
eye.

Observation 8.2.1. Then each line on the images is one of the following:
a boundary line (edge of an object) (<) with right hand of arrow denoting “solid”
and left hand denoting “space”
an interior convex edge (label with “+”)
an interior concave edge (label with “-”)

Michael Kohlhase: Artificial Intelligence 1 252 2025-02-06

18 Legal Kinds of Junctions

Observation 8.2.2. There are only 18 “legal” kinds of junctions:

Idea: given a representation of a diagram

label each junction in one of these manners (lots of possible ways)
176 CHAPTER 8. CONSTRAINT SATISFACTION PROBLEMS

junctions must be labeled, so that lines are labeled consistently

Fun Fact: CSP always works perfectly! (early success story for CSP [Wal75])

Michael Kohlhase: Artificial Intelligence 1 253 2025-02-06

Waltz’s Examples
In his dissertation 1972 [Wal75] David Waltz used the following examples

Michael Kohlhase: Artificial Intelligence 1 254 2025-02-06

Waltz Algorithm (More Examples): Ambiguous Figures

Michael Kohlhase: Artificial Intelligence 1 255 2025-02-06

8.3. CSP: TOWARDS A FORMAL DEFINITION 177

Waltz Algorithm (More Examples): Impossible Figures

Michael Kohlhase: Artificial Intelligence 1 256 2025-02-06

8.3 CSP: Towards a Formal Definition

We will now work our way towards a definition of CSPs that is formal enough so that we can
define the concept of a solution. This gives use the necessary grounding to talk about algorithms
later. A Video Nugget covering this section can be found at https://fau.tv/clip/id/22277.

Types of CSPs
Definition 8.3.1. We call a CSP discrete, iff all of the variables have countable
domains; we have two kinds:
finite domains (size d ; O(dn ) solutions)
e.g., Boolean CSPs (solvability =
b Boolean satisfiability ; NP complete)
infinite domains (e.g. integers, strings, etc.)
e.g., job scheduling, variables are start/end days for each job
need a “constraint language”, e.g., StartJob1 + 5 ≤ StartJob3
linear constraints decidable, nonlinear ones undecidable

Definition 8.3.2. We call a CSP continuous, iff one domain is uncountable.

Example 8.3.3. Start/end times for Hubble Telescope observations form a contin-
uous CSP.
Theorem 8.3.4. Linear constraints solvable in poly time by linear programming
methods.
178 CHAPTER 8. CONSTRAINT SATISFACTION PROBLEMS

Theorem 8.3.5. There cannot be optimal algorithms for nonlinear constraint

systems.

Michael Kohlhase: Artificial Intelligence 1 257 2025-02-06

Types of Constraints
We classify the constraints by the number of variables they involve.

Definition 8.3.6. Unary constraints involve a single variable, e.g., SA ̸= green.

Definition 8.3.7. Binary constraints involve pairs of variables, e.g., SA ̸= WA.
Definition 8.3.8. Higher-order constraints involve n = 3 or more variables, e.g.,
cryptarithmetic column constraints.
The number n of variables is called the order of the constraint.
Definition 8.3.9. Preferences (soft constraint) (e.g., red is better than green)
are often representable by a cost for each variable assignment ; constrained opti-
mization problems.

Michael Kohlhase: Artificial Intelligence 1 258 2025-02-06

Non-Binary Constraints, e.g. “Send More Money”

Example 8.3.10 (Send More Money). A student writes home:

S E N D Puzzle: letters stand for digits, addition should

+ M O R E work out (parents send MONEY€)
M O N E Y

Variables: S, E, N, D, M, O, R, Y , each with domain {0, . . . ,9}.

Constraints:
1. all variables should have different values: S ̸= E, S ̸= N , . . .
2. first digits are non-zero: S ̸= 0, M ̸= 0.
3. the addition scheme should work out: i.e.
1000 · S + 100 · E + 10 · N + D + 1000 · M + 100 · O + 10 · R + E = 10000 · M +
1000 · 0 + 100 · N + 10 · E + Y .

BTW: The solution is S 7→ 9, E 7→ 5, N 7→ 6, D 7→ 7, M 7→ 1, O 7→ 0, R 7→

8, Y 7→ 2 ; parents send 10652€

Definition 8.3.11. Problems like the one in ?? are called crypto-arithmetic puzzles.

Michael Kohlhase: Artificial Intelligence 1 259 2025-02-06

Encoding Higher-Order Constraints as Binary ones

8.3. CSP: TOWARDS A FORMAL DEFINITION 179

Problem: The last constraint is of order 8. (n = 8 variables involved)

Observation 8.3.12. We can write the addition scheme constraint column wise
using auxiliary variables, i.e. variables that do not “occur” in the original problem.

D+E = Y + 10 · X1
S E N D
X1 + N + R = E + 10 · X2
+ M O R E
X2 + E + O = N + 10 · X3 M O N E Y
X3 + S + M = O + 10 · M

These constraints are of order ≤ 5.

General Recipe: For n ≥ 3, encode C(v1 , . . . , vn−1 , vn ) as

C(p1 (x), . . . , pn−1 (x), vn ) ∧ v1 = p1 (x) ∧ . . . ∧ vn−1 = pn−1 (x)

Problem: The problem structure gets hidden. (search algorithms can get
confused)

Michael Kohlhase: Artificial Intelligence 1 260 2025-02-06

Constraint Graph
Definition 8.3.13. A binary CSP is a CSP where each constraint is unary or binary.
Observation 8.3.14. A binary CSP forms a graph called the constraint graph
whose nodes are variables, and whose edges represent the constraints.

Example
204 204 8.3.15. Australia as a binary CSP
Chapter 6.
Chapter 6.Constraint Satisfaction
Constraint Problems
Satisfaction Problems

NT NT
Q Q
NorthernNorthern WA WA
Territory
Territory
Queensland
Queensland
WesternWestern
Australia
SA SA NSW NSW
Australia
South South
Australia
Australia New New
South South V
Wales Wales
V
VictoriaVictoria

Tasmania
Tasmania
T T
(a) (a) (b) (b)
Figure 6.1 (a) The principal states and territories of Australia. Coloring this map can
Figure 6.1 (a) The principal states and territories of Australia. Coloring this map can
Intuition: General-purpose
be viewed as a constraint
be viewed CSP
satisfaction
as a constraint algorithms
problem
satisfaction use
(CSP).(CSP).
problem The Thethe
goal isgoal graph
to assign
is colorsstructure
to assign to each to speed up
to each
colors
search. regionregion
so thatsonothat
represented
neighboring
no neighboring
as a constraint
represented
(E.g.,
regions
graph.graph.
as a constraint
have the
regions Tasmania
same
have the color. is
(b) The
same color. an independent
(b)map-coloring problem subproblem!)
problem
The map-coloring

immediately discard
immediately furtherfurther
discard refinements of theofpartial
refinements assignment.
the partial Furthermore,
assignment. we can
Furthermore, wesee
can see
why the
whyassignment is not isa solution—we
the assignment see which
not a solution—we
Michael Kohlhase: Artificial Intelligence 1
variables
see which violate
variables
261
a constraint—so
violate we
a constraint—so can
we can
2025-02-06
focus focus
attention on the variables that matter. As a result, many problems that are intractable
attention on the variables that matter. As a result, many problems that are intractable
for regular state-space search can be solved quickly when formulated as a
for regular state-space search can be solved quickly when formulated as a CSP. CSP.

6.1.26.1.2
Example problem:
Example Job-shop
problem: scheduling
Job-shop scheduling
Real-world Factories
CSPs have the
Factories problem
have of scheduling
the problem a day’s
of scheduling worthworth
a day’s of jobs,
of subject to various
jobs, subject constraints.
to various constraints.
In practice, manymany
In practice, of these problems
of these are solved
problems with CSP
are solved withtechniques.
CSP techniques. Consider the problem
Consider of of
the problem
scheduling the assembly
scheduling of a car.
the assembly of aThe
car.whole
The wholejob is job
composed
is composed of tasks, and we
of tasks, can
and wemodel each each
can model
Example 8.3.16
task as
task as(Assignment
a variable, wherewhere
a variable, the value problems).
of each
the value of variable e.g.,
is the
each variable istime who
the that
time the teaches
thattask
thestarts, what
expressed
task starts, class
expressed
as an asinteger number
an integer of minutes.
number of minutes.Constraints can assert
Constraints that one
can assert that task
one must occuroccur
task must beforebefore
another—for example,
another—for a wheel
example, must must
a wheel be installed beforebefore
be installed the hubcap is putison—and
the hubcap that only
put on—and that only
so many tasks tasks
so many can go canongoat ononce. Constraints
at once. can also
Constraints can specify that athat
also specify taska takes a certain
task takes a certain
amount of time
amount of to complete.
time to complete.
We consider a small
We consider part ofpart
a small theofcartheassembly, consisting
car assembly, of 15 of
consisting tasks: installinstall
15 tasks: axles axles
(front(front
and back), affix all
and back), four
affix all wheels (right(right
four wheels and left,andfront and back),
left, front tighten
and back), nuts for
tighten nutseach
for wheel,
each wheel,
affix hubcaps, and inspect
affix hubcaps, the final
and inspect the assembly.
final assembly.We can Werepresent
can representthe tasks with 15
the tasks variables:
with 15 variables:
180 CHAPTER 8. CONSTRAINT SATISFACTION PROBLEMS

Example 8.3.17 (Timetabling problems). e.g., which class is offered when and
where?
Example 8.3.18 (Hardware configuration).

Example 8.3.19 (Spreadsheets).

Example 8.3.20 (Transportation scheduling).
Example 8.3.21 (Factory scheduling).
Example 8.3.22 (Floorplanning).

Note: many real-world problems involve real-valued variables ; continuous CSPs.

Michael Kohlhase: Artificial Intelligence 1 262 2025-02-06

8.4 Constraint Networks: Formalizing Binary CSPs

A Video Nugget covering this section can be found at https://fau.tv/clip/id/22279.

Constraint Networks (Formalizing binary CSPs)

Definition 8.4.1. A constraint network is a triple γ := ⟨V , D, C ⟩, where

V is a finite set of variables,

D := {Dv | v ∈ V } the set of their domains, and
C := {C uv ⊆ Du ×Dv | u, v ∈ V and u ̸= v} is a set of constraints with C uv =
C −1
vu .

We call the undirected graph ⟨V , {(u,v) ∈ V 2 | C uv ̸= Du × Dv }⟩, the constraint

graph of γ.
We will talk of CSPs and mean constraint networks.
Remarks: The mathematical formulation gives us a lot of leverage:
b possible assignments to variables u and v
C uv ⊆ Du ×Dv =
Relations are the most general formalization, generally we use symbolic formu-
lations, e.g. “u = v” for the relation C uv = {(a,b) | a = b} or “u ̸= v”.
We can express unary constraints Cu by restricting the domain of v: Dv := Cv .

Michael Kohlhase: Artificial Intelligence 1 263 2025-02-06

Example: SuDoKu as a Constraint Network

Example 8.4.2 (Formalize SuDoKu). We use the added formality to encode
SuDoKu as a constraint network, not just as a CSP as ??.
8.4. CONSTRAINT NETWORKS: FORMALIZING BINARY CSPS 181

Variables: V = {vij | 1 ≤ i, j ≤ 9}: vij =cell in row i column j.

Domains For all v ∈ V : Dv = D = {1, . . . ,9}.
Unary constraint: Cvij = {d} if cell i, j is pre-filled with d.
(Binary) constraint: C vij vi′ j′ =
b “vij ̸= vi′ j ′ ”, i.e.
C vij vi′ j′ = {(d,d′ ) ∈ D × D | d ̸= d′ }, for: i = i′ (same row), or j = j ′ (same
′ ′
column), or (⌈ 3i ⌉,⌈ 3j ⌉) = (⌈ i3 ⌉,⌈ j3 ⌉) (same block).

Note that the ideas are still the same as ??, but in constraint networks we have a
language to formulate things precisely.

Michael Kohlhase: Artificial Intelligence 1 264 2025-02-06

Constraint Networks (Solutions)

Let γ := ⟨V , D, C ⟩ be a constraint network.
S
Definition 8.4.3. We call a partial function a : V ⇀ u∈V Du a variable assignment
if a(u) ∈ Du for all u ∈ dom(a).
S
Definition 8.4.4. Let C := ⟨V , D, C ⟩ be a constraint network and a : V ⇀ v∈V Dv
a variable assignment. We say that a satisfies (otherwise violates) a constraint C uv ,
iff u, v ∈ dom(a) and (a(u),a(v)) ∈ C uv . a is called consistent in C, iff it satisfies
all constraints in C. A value w ∈ Du is legal for a variable u in C, iff {(u,w)} is a
consistent assignment in C. A variable with illegal value under a is called conflicted.
Example 8.4.5. The empty assignment ϵ is (trivially) consistent in any constraint
network.
Definition 8.4.6. Let f and g be variable assignments, then we say that f extends
(or is an extension of) g, iff dom(g)⊂dom(f ) and f |dom(g) = g.

Definition 8.4.7. We call a consistent (total) assignment a solution for γ and γ

itself solvable or satisfiable.

Michael Kohlhase: Artificial Intelligence 1 265 2025-02-06

How it all fits together

Lemma 8.4.8. Higher-order constraints can be transformed into equi-satisfiable
182 CHAPTER 8. CONSTRAINT SATISFACTION PROBLEMS

binary constraints using auxiliary variables.

Corollary 8.4.9. Any CSP can be represented by a constraint network.
In other words The notion of a constraint network is a refinement of a CSP.

So we will stick to constraint networks in this course.

Observation 8.4.10. We can view a constraint network as a search problem, if we
take the states as the variable assignments, the actions as assignment extensions,
and the goal states as consistent assignments.

Idea: We will explore that idea for algorithms that solve constraint networks.

Michael Kohlhase: Artificial Intelligence 1 266 2025-02-06

8.5 CSP as Search

We now follow up on ?? to use search algorithms for solving constraint networks.
The key point of this section is that the factored states representations realized by constraint
networks allow the formulation of very powerful heuristics. A Video Nugget covering this
section can be found at https://fau.tv/clip/id/22319.

Standard search formulation (incremental)

Idea: Every constraint network induces a single state problem.
Definition 8.5.1 (Let’s do the math). Given a constraint network γ := ⟨V , D, C ⟩,
then Πγ := ⟨S γ , Aγ , T γ , I γ , G γ ⟩ is called the search problem induced by γ, iff
State S γ are variable assignments
Action Aγ : extend φ ∈ S γ by a pair x 7→ v not conflicted with φ.
Transition model T γ (a, φ) = φ,x 7→ v (extended assignment)
Initial state I γ : the empty assignment ϵ.
Goal states G γ : the total, consistent assignments
What has just happened?: We interpret a constraint network γ as a search
problem Πγ . A solution to Πγ induces a solution to γ.

Idea: We have algorithms for that: e.g. tree search.

Remark: This is the same for all CSPs! ,
; fail if no consistent assignments exist (not fixable!)

Michael Kohlhase: Artificial Intelligence 1 267 2025-02-06

Standard search formulation (incremental)

Example 8.5.2. A search tree for ΠAustralia :
8.5. CSP AS SEARCH 183

W A = red W A = green W A = blue

W A = red W A = red
N T = green N T = blue

W A = red W A = red
N T = green N T = green
Q = red Q = blue

Observation: Every solution appears at depth n with n variables.

Idea: Use depth first search!

Observation: Path is irrelevant ; can use local search algorithms.

Branching factor b = (n − ℓ)d at depth ℓ, hence n!dn leaves!!!! /

Michael Kohlhase: Artificial Intelligence 1 268 2025-02-06

Backtracking Search
Assignments for different variables are independent!
e.g. first WA = red then NT = green vs. first NT = green then WA = red
; we only need to consider assignments to a single variable at each node
; b = d and there are dn leaves.
Definition 8.5.3. Depth first search for CSPs with single-variable assignment
extensions actions is called backtracking search.
Backtracking search is the basic uninformed algorithm for CSPs.

It can solve the n-queens problem for ≊ n, 25.

Michael Kohlhase: Artificial Intelligence 1 269 2025-02-06

Backtracking Search (Implementation)

Definition 8.5.4. The generic backtracking search algorithm:
procedure Backtracking−Search(csp ) returns solution/failure
return Recursive−Backtracking (∅, csp)
procedure Recursive−Backtracking (assignment) returns soln/failure
if assignment is complete then return assignment
var := Select−Unassigned−Variable(Variables[csp], assignment, csp)
foreach value in Order−Domain−Values(var, assignment, csp) do
if value is consistent with assignment given Constraints[csp] then
add {var = value} to assignment
result := Recursive−Backtracking(assignment,csp)
184 CHAPTER 8. CONSTRAINT SATISFACTION PROBLEMS

if result ̸= failure then return result

remove {var= value} from assignment
return failure

Michael Kohlhase: Artificial Intelligence 1 270 2025-02-06

Backtracking in Australia
Example 8.5.5. We apply backtracking search for a map coloring problem:

Step 1:

Step 2:
8.5. CSP AS SEARCH 185

Step 3:

Step 4:

Michael Kohlhase: Artificial Intelligence 1 271 2025-02-06

Improving Backtracking Efficiency

General-purpose methods can give huge gains in speed for backtracking search.
Answering the following questions well helps find powerful heuristics:
1. Which variable should be assigned next? (i.e. a variable ordering heuristic)
2. In what order should its values be tried? (i.e. a value ordering heuristic)
3. Can we detect inevitable failure early? (for pruning strategies)
4. Can we take advantage of problem structure? (; inference)
Observation: Questions 1/2 correspond to the missing subroutines
Select−Unassigned−Variable and Order−Domain−Values from ??.

Michael Kohlhase: Artificial Intelligence 1 272 2025-02-06

186 CHAPTER 8. CONSTRAINT SATISFACTION PROBLEMS

Heuristic: Minimum Remaining Values (Which Variable)

Definition 8.5.6. The minimum remaining values (MRV) heuristic for backtracking
search always chooses the variable with the fewest legal values, i.e. a variable v that
given an initial assignment a minimizes #({d ∈ Dv | a ∪ {v 7→ d} is consistent}).
Intuition: By choosing a most constrained variable v first, we reduce the branching
factor (number of sub trees generated for v) and thus reduce the size of our search
tree.
Extreme case: If #({d ∈ Dv | a ∪ {v 7→ d} is consistent}) = 1, then the value
assignment to v is forced by our previous choices.
Example 8.5.7. In step 3 of ??, there is only one remaining value for SA!

Michael Kohlhase: Artificial Intelligence 1 273 2025-02-06

Degree Heuristic (Variable Order Tie Breaker)

Problem: Need a tie-breaker among MRV variables! (there was no preference in

step 1,2)
Definition 8.5.8. The degree heuristic in backtracking search always chooses a
most constraining variable, i.e. given an initial assignment a always pick a variable
v with #({v ∈ (V \dom(a)) | C uv ∈ C}) maximal.

By choosing a most constraining variable first, we detect inconsistencies earlier on

and thus reduce the size of our search tree.
Commonly used strategy combination: From the set of most constrained vari-
able, pick a most constraining variable.
Example 8.5.9.

Degree heuristic: SA = 5, T = 0, all others 2 or 3.

Michael Kohlhase: Artificial Intelligence 1 274 2025-02-06

Where in ?? does the most constraining variable play a role in the choice? SA (only possible
choice), NT (all choices possible except WA, V, T). Where in the illustration does most con-
strained variable play a role in the choice? NT (all choices possible except T), Q (only Q and WA
8.6. CONCLUSION & PREVIEW 187

possible).

Least Constraining Value Heuristic (Value Ordering)

Definition 8.5.10. Given a variable v, the least constraining value heuristic chooses
the least constraining value for v: the one that rules out the fewest values in the
remaining variables, i.e. for a given initial assignment a and a chosen variable v pick a
value d ∈ Dv that minimizes #({e ∈ Du | u ̸∈ dom(a), C uv ∈ C, and (e,d) ̸∈ C uv })

By choosing the least constraining value first, we increase the chances to not rule
out the solutions below the current node.
Example 8.5.11.

Combining these heuristics makes 1000 queens feasible.

Michael Kohlhase: Artificial Intelligence 1 275 2025-02-06

8.6 Conclusion & Preview

Summary & Preview
Summary of “CSP as Search”:
Constraint networks γ consist of variables, associated with finite domains, and
constraints which are binary relations specifying permissible value pairs.
A variable assignment a maps some variables to values. a is consistent if it
complies with all constraints. A consistent total assignment is a solution.
The constraint satisfaction problem (CSP) consists in finding a solution for a
constraint network. This has numerous applications including, e.g., scheduling
and timetabling.
Backtracking search assigns variable one by one, pruning inconsistent variable
assignments.
Variable orderings in backtracking can dramatically reduce the size of the search
tree. Value orderings have this potential (only) in solvable sub trees.
Up next: Inference and decomposition, for improved efficiency.

Michael Kohlhase: Artificial Intelligence 1 276 2025-02-06

• Chapter 6: Constraint Satisfaction Problems, Sections 6.1 and 6.3, in [RN09].

188 CHAPTER 8. CONSTRAINT SATISFACTION PROBLEMS

– Compared to our treatment of the topic “Constraint Satisfaction Problems” (?? and ??),
RN covers much more material, but less formally and in much less detail (in particular, my
slides contain many additional in-depth examples). Nice background/additional reading, can’t
replace the lectures.
– Section 6.1: Similar to our “Introduction” and “Constraint Networks”, less/different examples,
much less detail, more discussion of extensions/variations.
– Section 6.3: Similar to my “Naïve Backtracking” and “Variable- and Value Ordering”, with
less examples and details; contains part of what we cover in ?? (RN does inference first, then
backtracking). Additional discussion of backjumping.
Chapter 9

Constraint Propagation

In this chapter we discuss another idea that is central to symbolic AI as a whole. The first com-
ponent is that with the factored states representations, we need to use a representation language
for (sets of) states. The second component is that instead of state-level search, we can graduate
to representation-level search (inference), which can be much more efficient that state level search
as the respective representation language actions correspond to groups of state-level actions.

9.1 Introduction
A Video Nugget covering this section can be found at https://fau.tv/clip/id/22321.

Illustration: Constraint Propagation

Example 9.1.1. A constraint
204 network γ: Chapter 6. Constraint Satisfaction Problems

NT
Q
Northern WA
Territory
Queensland
Western
Australia
SA NSW
South
Australia New
South V
Wales
Victoria

Tasmania
T
(a) (b)

Figure 6.1 (a) The principal states and territories of Australia. Coloring this map can

Question: Can we add a constraint without losing have the any solutions?
be viewed as a constraint satisfaction problem (CSP). The goal is to assign colors to each
region so that no neighboring regions same color. (b) The map-coloring problem
represented as a constraint graph.

Example 9.1.2. C WAQ := “=”. If WA

immediately and
discard further Q are
refinements assigned
of the partial different
assignment. Furthermore, we cancolors,
why the assignment is not a solution—we see which variables violate a constraint—so we can
see then
NT must be assigned the 3rd color, leaving
focus attention no
on the variables thatcolor
matter. As for
a result,SA.
many problems that are intractable
for regular state-space search can be solved quickly when formulated as a CSP.

6.1.2 Example problem: Job-shop scheduling

Intuition: Adding constraintsFactories
without losing solutions
have the problem of scheduling a day’s worth of jobs, subject to various constraints.
b obtaining an equivalent network
= with
In practice, a “tighter
many of these description”
problems are solved with CSP techniques. Consider the problem of
scheduling the assembly of a car. The whole job is composed of tasks, and we can model each
; a smaller number of consistent (partial) variable assignments
task as a variable, where the value of each variable is the time that the task starts, expressed
as an integer number of minutes. Constraints can assert that one task must occur before
; more efficient search! another—for example, a wheel must be installed before the hubcap is put on—and that only
so many tasks can go on at once. Constraints can also specify that a task takes a certain
amount of time to complete.
We consider a small part of the car assembly, consisting of 15 tasks: install axles (front
and back), affix all four wheels (right and left, front and back), tighten nuts for each wheel,
affix
Michael Kohlhase: Artificial Intelligence 1 hubcaps, and inspect the final assembly.
277 We can represent the tasks 2025-02-06
with 15 variables:
X = {Axle F , Axle B , Wheel RF , Wheel LF , Wheel RB , Wheel LB , Nuts RF ,
Nuts LF , Nuts RB , Nuts LB , Cap RF , Cap LF , Cap RB , Cap LB , Inspect } .
The value of each variable is the time that the task starts. Next we represent precedence
PRECEDENCE
constraints between individual tasks. Whenever a task T1 must occur before task T2 , and

Illustration: Decomposition
CONSTRAINTS
task T1 takes duration d1 to complete, we add an arithmetic constraint of the form
T1 + d1 ≤ T2 .

Example 9.1.3. Constraint network γ:

189
190 204
CHAPTERChapter
9. 6.CONSTRAINT PROPAGATION
Constraint Satisfaction Problems

NT
Q
Northern WA
Territory
Queensland
Western
Australia
SA NSW
South
Australia New
South V
Wales
Victoria

Tasmania
T
(a) (b)

Figure 6.1 (a) The principal states and territories of Australia. Coloring this map can

We can separate this into two independent constraint

regions have the samenetworks.
be viewed as a constraint satisfaction problem (CSP). The goal is to assign colors to each
region so that no neighboring color. (b) The map-coloring problem
represented as a constraint graph.

Tasmania is not adjacent to any other state. Thus we can color Australia first, and
immediately discard further refinements of the partial assignment. Furthermore, we can see
why the assignment is not a solution—we see which variables violate a constraint—so we can
assign an arbitrary color to Tasmania afterwards. focus attention on the variables that matter. As a result, many problems that are intractable
for regular state-space search can be solved quickly when formulated as a CSP.

6.1.2 Example problem: Job-shop scheduling

Decomposition methods exploit the structure of the constraint network. They
Factories have the problem of scheduling a day’s worth of jobs, subject to various constraints.
identify separate parts (sub-networks) whose inter-dependencies are “simple” and
In practice, many of these problems are solved with CSP techniques. Consider the problem of
scheduling the assembly of a car. The whole job is composed of tasks, and we can model each
can be handled efficiently. task as a variable, where the value of each variable is the time that the task starts, expressed
as an integer number of minutes. Constraints can assert that one task must occur before
another—for example, a wheel must be installed before the hubcap is put on—and that only

Example 9.1.4 (Extreme case). No inter-dependencies at all, as for Tasmania

so many tasks can go on at once. Constraints can also specify that a task takes a certain
amount of time to complete.
We consider a small part of the car assembly, consisting of 15 tasks: install axles (front
above. and back), affix all four wheels (right and left, front and back), tighten nuts for each wheel,
affix hubcaps, and inspect the final assembly. We can represent the tasks with 15 variables:
X = {Axle F , Axle B , Wheel RF , Wheel LF , Wheel RB , Wheel LB , Nuts RF ,
Nuts LF , Nuts RB , Nuts LB , Cap RF , Cap LF , Cap RB , Cap LB , Inspect } .
The
Michael Kohlhase: Artificial Intelligence 1 value of each variable is the time
278that the task starts. Next we represent precedence
2025-02-06
PRECEDENCE
CONSTRAINTS constraints between individual tasks. Whenever a task T1 must occur before task T2 , and
task T1 takes duration d1 to complete, we add an arithmetic constraint of the form
T1 + d1 ≤ T2 .

Our Agenda for This Chapter

Constraint propagation: How does inference work in principle? What are relevant
practical aspects?
Fundamental concepts underlying inference, basic facts about its use.

Forward checking: What is the simplest instance of inference?

Gets us started on this subject.
Arc consistency: How to make inferences between variables whose value is not fixed
yet?

Details a state of the art inference method.

Decomposition: Constraint graphs, and two simple cases
How to capture dependencies in a constraint network? What are “simple cases”?
Basic results on this subject.

Cutset conditioning: What if we’re not in a simple case?

Outlines the most easily understandable technique for decomposition in the gen-
eral case.

Michael Kohlhase: Artificial Intelligence 1 279 2025-02-06

9.2 Constraint Propagation/Inference

A Video Nugget covering this section can be found at https://fau.tv/clip/id/22326.
9.2. CONSTRAINT PROPAGATION/INFERENCE 191

Constraint Propagation/Inference: Basic Facts

Definition 9.2.1. Constraint propagation (i.e inference in constraint networks)
consists in deducing additional constraints, that follow from the already known
constraints, i.e. that are satisfied in all solutions.
Example 9.2.2. It’s what you do all the time when playing SuDoKu:

Formally: Replace γ by an equivalent and strictly tighter constraint network γ ′ .

Michael Kohlhase: Artificial Intelligence 1 280 2025-02-06

Equivalent Constraint Networks

Definition 9.2.3. We say that two constraint networks γ := ⟨V , D, C ⟩ and γ ′ :=
⟨V , D′ , C ′ ⟩ sharing the same set of variables are equivalent, (write γ ′ ≡γ), if they
have the same solutions.
Example 9.2.4.

v1 v1

γ red red
γ′
blue blue

̸= ̸= ̸= ̸=

v2 red red v3 v2 red red v3

blue blue blue ̸= blue

Are these constraint networks equivalent? No.

v1 v1

γ red red
γ′
blue blue

̸= ̸= ̸= ̸=

v2 red red v3 v2 red red v3

blue blue blue = blue
192 CHAPTER 9. CONSTRAINT PROPAGATION

Are these constraint networks equivalent? Yes.

Michael Kohlhase: Artificial Intelligence 1 281 2025-02-06

Tightness
Definition 9.2.5 (Tightness). Let γ := ⟨V , D, C ⟩ and γ ′ = ⟨V , D′ , C ′ ⟩ be
constraint networks sharing the same set of variables, then γ ′ is tighter than γ,
(write γ ′ ⊑γ), if:
(i) For all v ∈ V : D′ v ⊆ Dv .
(ii) For all u ̸= v ∈ V and C ′ uv ∈ C ′ : either C ′ uv ̸∈ C or C ′ uv ⊆ C uv .

γ ′ is strictly tighter than γ, (written γ ′ <γ), if at least one of these inclusions is

proper.
Example 9.2.6.

v1 v1

γ red red
γ′
blue blue

̸= ̸= ̸= ̸=

v2 red red v3 v2 red red v3

blue blue blue ̸= blue

Here, we do have γ ′ ⊑γ.

v1 v1

γ red red
γ′
blue blue

̸= ̸= ̸= ̸=

v2 red red v3 v2 red red v3

blue blue blue = blue

Here, we do have γ ′ ⊑γ.

v1 v1

γ red red
γ′
blue blue

̸= ̸= ̸=

v2 red red v3 v2 red red v3

blue blue blue = blue

Here, we do not have γ ′ ⊑γ!.

9.2. CONSTRAINT PROPAGATION/INFERENCE 193

b γ ′ has the same constraints as γ, plus some.

Intuition: Strict tightness =

Michael Kohlhase: Artificial Intelligence 1 282 2025-02-06

Equivalence + Tightness = Inference

Theorem 9.2.7. Let γ and γ ′ be constraint networks such that γ ′ ≡γ and γ ′ ⊑γ.
Then γ ′ has the same solutions as, but fewer consistent assignments than, γ.
; γ ′ is a better encoding of the underlying problem.

Example 9.2.8. Two equivalent constraint networks (one obviously unsolvable)

v1 v1

γ red red
γ′
blue blue

̸= ̸= ̸= ̸=

v2 red blue v3 v2 red blue v3

ϵ cannot be extended to a solution (neither in γ nor in γ ′ because they’re equivalent);

this is obvious (red ̸= blue) in γ ′ , but not in γ.

Michael Kohlhase: Artificial Intelligence 1 283 2025-02-06

How to Use Constraint Propagation in CSP Solvers?

Simple: Constraint propagation as a pre-process:
When: Just once before search starts.
Effect: Little running time overhead, little pruning power. (not considered
here)
More Advanced: Constraint propagation during search:
When: At every recursive call of backtracking.
Effect: Strong pruning power, may have large running time overhead.
Search vs. Inference: The more complex the inference, the smaller the number
of search nodes, but the larger the running time needed at each node.
Idea: Encode variable assignments as unary constraints (i.e., for a(v) = d, set the
unary constraint Dv = {d}), so that inference reasons about the network restricted
to the commitments already made in the search.

Michael Kohlhase: Artificial Intelligence 1 284 2025-02-06

194 CHAPTER 9. CONSTRAINT PROPAGATION

Backtracking With Inference

Definition 9.2.9. The general algorithm for backtracking with inference is
1 function BacktrackingWithInference(γ,a) returns a solution, or ‘‘inconsistent’’
2 if a is inconsistent then return ‘‘inconsistent’’
3 if a is a total assignment then return a
4 γ ′ := a copy of γ /∗ γ ′ = (V γ ′ , Dγ ′ , C γ ′ ) ∗/
5 γ ′ := Inference(γ ′ )
6 if exists v with Dγ ′ v = ∅ then return ‘‘inconsistent’’
7 select some variable v for which a is not defined
8 for each d ∈ copy of Dγ ′ v in some order do
9 a′ := a ∪ {v = d}; Dγ ′ v := {d} /∗ makes a explicit as a constraint ∗/
10 a′′ := BacktrackingWithInference(γ ′ ,a′ )
11 if a′′ ̸= “inconsistent” then return a′′
12 return ‘‘inconsistent’’

Exactly the same as ??, only line 5 new!

Inference(): Any procedure delivering a (tighter) equivalent network.
Inference() typically prunes domains; indicate unsolvability by Dγ ′ v = ∅.
When backtracking out of a search branch, retract the inferred constraints: these
were dependent on a, the search commitments so far.

Michael Kohlhase: Artificial Intelligence 1 285 2025-02-06

9.3 Forward Checking

A Video Nugget covering this section can be found at https://fau.tv/clip/id/22326.
Forward
Forward Checking
Checking
I Inference, version 1: Forward Checking
Definition 9.3.1. Forward checking propagates information about illegal values:
function ForwardChecking( ,a) returns modified
Whenever a
for each variable is assigned
v whereu a(v by a, delete
) = d 0 is defined do all values inconsistent with a(u) from
every Dvforforeach
all uvariables v connected
where a(u) is undefinedwith Cuvby2aCconstraint.
and u do
Du := {d 2 Du | (d, d 0 ) 2 Cuv }
Forward
return
Example Checking
9.3.2. Forward checking in Australia
I Example 3.1.
I Inference, version 1: Forward Checking
function ForwardChecking( ,a) returns modified
for each v where a(v ) = d 0 is defined do
for each u where
WA
a(u)
NT
is undefined
Q
and Cuv 2V C do
NSW SA T
Du := {d 2 Du | (d, d 0 ) 2 Cuv }
return
I Example 3.1.

WA NT Q NSW V SA T

Kohlhase: Künstliche Intelligenz 1 295 July 5, 2018

Forward Checking
I Inference, version 1: Forward Checking
function ForwardChecking( ,a) returns modified
for each v where a(v ) = d 0 is defined do
for each u where a(u) is undefined and Cuv 2 C do
Du := {d 2 Du | (d, d 0 ) 2 Cuv }
9.3. FORWARDreturn
CHECKING 195
I Example 3.1.
Forward Checking
I Inference, version 1: Forward Checking
function ForwardChecking(
WA NT ,a) returns
Q modifiedV
NSW SA T
for each v where a(v ) = d 0 is defined do
for each u where a(u) is undefined and Cuv 2 C do
Du := {d 2 Du | (d, d 0 ) 2 Cuv }
return
I Example 3.1.

Kohlhase: Künstliche Intelligenz 1 295 July 5, 2018

WA NT Q NSW V SA T

Definition 9.3.3 (Inference, Version 1). Forward checking implemented

Kohlhase: Künstliche Intelligenz 1 295 July 5, 2018

function ForwardChecking(γ,a) returns modified γ

for each v where a(v) = d′ is defined do
for each u where a(u) is undefined and Cuv ∈ C do
Du := {d ∈ Du | (d,d′ ) ∈ C uv }
return γ

Michael Kohlhase: Artificial Intelligence 1 286 2025-02-06

Note: It’s a bit strange that we start with d′ here; this is to make link to arc consistency –
coming up next – as obvious as possible (same notations u, and d vs. v and d′ ).

Forward Checking: Discussion

Definition 9.3.4. An inference procedure is called sound, iff for any input γ the
output γ ′ have the same solutions.

Lemma 9.3.5. Forward checking is sound.

Proof sketch: Recall here that the assignment a is represented as unary constraints
inside γ.
Corollary 9.3.6. γ and γ ′ are equivalent.

Incremental computation: Instead of the first for-loop in ??, use only the inner one
every time a new assignment a(v) = d′ is added.
Practical Properties:
Cheap but useful inference method.
Rarely a good idea to not use forward checking (or a stronger inference method
subsuming it).
Up next: A stronger inference method (subsuming forward checking).
196 CHAPTER 9. CONSTRAINT PROPAGATION

Definition 9.3.7. Let p and q be inference procedures, then p subsumes q, if

p(γ)⊑q(γ) for any input γ.

Michael Kohlhase: Artificial Intelligence 1 287 2025-02-06

9.4 Arc Consistency

Video Nuggets covering this section can be found at https://fau.tv/clip/id/22350 and
https://fau.tv/clip/id/22351.

When Forward Checking is Not Good Enough

Problem: Forward checking makes inferences only from assigned to unassigned
variables.

Example 9.4.1.

v1 v1 v1

1 1 1
v1 < v2 v1 < v2 v1 < v2

v2 123 1 2 3 v3 v2 23 1 2 3 v3 v2 23 3 v3
v2 < v3 v2 < v 3 v2 < v 3

We could do better here: value 3 for v2 is not consistent with any remaining value
for v3 ; it can be removed!
But forward checking does not catch this.

Michael Kohlhase: Artificial Intelligence 1 288 2025-02-06

Arc Consistency: Definition

Definition 9.4.2 (Arc Consistency). Let γ := ⟨V , D, C ⟩ be a constraint network.

1. A variable u ∈ V is arc consistent relative to another variable v ∈ V if either

C uv ̸∈ C, or for every value d ∈ Du there exists a value d′ ∈ Dv such that
(d,d′ ) ∈ C uv .
2. The constraint network γ is arc consistent if every variable u ∈ V is arc consistent
relative to every other variable v ∈ V .

The concept of arc consistency concerns both levels.

Intuition: Arc consistency = b for every domain value and constraint, at least one
value on the other side of the constraint “works”.
Note the asymmetry between u and v: arc consistency is directed.

Michael Kohlhase: Artificial Intelligence 1 289 2025-02-06

9.4. ARC CONSISTENCY 197

Arc Consistency: Example

Definition 9.4.3 (Arc Consistency). Let γ := ⟨V , D, C ⟩ be a constraint network.
1. A variable u ∈ V is arc consistent relative to another variable v ∈ V if either
C uv ̸∈ C, or for every value d ∈ Du there exists a value d′ ∈ Dv such that
(d,d′ ) ∈ C uv .
2. The constraint network γ is arc consistent if every variable u ∈ V is arc consistent
relative to every other variable v ∈ V .
The concept of arc consistency concerns both levels.

Example 9.4.4 (Arc Consistency).

v1 v1 v1

1 1 1
v1 < v2 v1 < v2 v1 < v2

v2 123 1 2 3 v3 v2 23 1 2 3 v3 v2 23 3 v3
v2 < v3 v2 < v 3 v2 < v 3

Question: On top, middle, is v 3 arc consistent relative to v 2 ?

Answer: No. For values 1 and 2, Dv2 does not have a value that works.
Note: Enforcing arc consistency for one variable may lead to further reductions
on another variable!
Question: And on the right?
Answer: Yes. (But v 2 is not arc consistent relative to v 3 )

Michael Kohlhase: Artificial Intelligence 1 290 2025-02-06

Arc Consistency: Example

Definition 9.4.5 (Arc Consistency). Let γ := ⟨V , D, C ⟩ be a constraint network.

1. A variable u ∈ V is arc consistent relative to another variable v ∈ V if either

The concept of arc consistency concerns both levels.

Example 9.4.6.
Forward Checking
v1 v1 v1

I Inference, 1version 1: Forward Checking 1 1

function
v1 < v2 ForwardChecking( ,a) returns
v1 < v 2 modified v1 < v2
for each v where a(v ) = d 0 is defined do
v2 1 for
2 3 each u where
1 2 3 a(u) 2 2 3 and Cuv1 22 3C do
v3 isvundefined v3 v2 2 3 v <v 3 v3
Duv2:=
< v{d
3 2 Du | (d, d 0 ) 2 Cuv } v2 < v3 2 3
198 return CHAPTER 9. CONSTRAINT PROPAGATION
I Example 3.1.

WA NT Q NSW V SA T
WA NT Q NSW V SA T

;?
;?

Forward checking
Note: SA is not makes arc
Kohlhase: Künstliche inferences
consistent
Intelligenz 1 only “from
relative
295 assigned
to NT in 3rd torow.
July 5, unassigned”
2018 variables.
Kohlhase: Künstliche Intelligenz 1 297 July 5, 2018

Michael Kohlhase: Artificial Intelligence 1 291 2025-02-06

Enforcing Arc Consistency: General Remarks

Inference, version 2: “Enforcing Arc Consistency” = removing domain values
until γ is arc consistent. (Up next)

Note: Assuming such an inference method AC(γ).

Lemma 9.4.7. AC(γ) is sound: guarantees to deliver an equivalent network.
Proof sketch: If, for d ∈ Du , there does not exist a value d′ ∈ Dv such that
(d,d′ ) ∈ C uv , then u = d cannot be part of any solution.

Observation 9.4.8. AC(γ) subsumes forward checking: AC(γ)⊑ForwardChecking(γ).

Proof: Recall from slide 282 that γ ′ ⊑γ means γ ′ is tighter than γ.
1. Forward checking removes d from Du only if there is a constraint C uv such
that Dv = {d′ } (i.e. when v was assigned the value d′ ), and (d,d′ ) ̸∈ C uv .
2. Clearly, enforcing arc consistency of u relative to v removes d from Du as well.

Michael Kohlhase: Artificial Intelligence 1 292 2025-02-06

Enforcing Arc Consistency for One Pair of Variables

Definition 9.4.9 (Revise). Revise is an algorithm enforcing arc consistency of u
relative to v
function Revise(γ,u,v) returns modified γ
for each d ∈ Du do
if there is no d′ ∈ Dv with (d,d′ ) ∈ C uv then Du := Du \{d}
return γ

Lemma 9.4.10. If d is maximal domain size in γ and the test “(d,d′ ) ∈ C uv ?” has
time complexity O(1), then the running time of Revise(γ, u, v) is O(d2 ).

Example 9.4.11. Revise(γ, v 3 , v 2 )

9.4. ARC CONSISTENCY 199

v1 v1

1 1

v1 < v2 v1 < v 2

v2 23 123 v3 v2 23 123 v3
v2 < v3 v2 < v3

v1 v1

1 1

v1 < v2 v1 < v 2

v2 23 123 v3 v2 23 23 v3
v2 < v3 v2 < v3

v1 v1

1 1

v1 < v2 v1 < v 2

v2 23 23 v3 v2 23 3 v3
v2 < v3 v2 < v3

v1 < v2

v2 23 3 v3
v2 < v3

Michael Kohlhase: Artificial Intelligence 1 293 2025-02-06

AC-1: Enforcing Arc Consistency (Version 1)

Idea: Apply Revise pairwise up to a fixed point.
Definition 9.4.12. AC-1 enforces arc consistency in constraint networks:
function AC−1(γ) returns modified γ
repeat
changesMade := False
for each constraint C uv do
Revise(γ,u,v) /∗ if Du reduces, set changesMade := True ∗/
Revise(γ,v,u) /∗ if Dv reduces, set changesMade := True ∗/
until changesMade = False
return γ

Observation: Obviously, this does indeed enforce arc consistency for γ.

200 CHAPTER 9. CONSTRAINT PROPAGATION

Lemma 9.4.13. If γ has n variables, m constraints, and maximal domain size d,

then the time complexity of AC1(γ) is O(md2 nd).
Proof sketch: O(md2 ) for each inner loop, fixed point reached at the latest once
all nd variable values have been removed.

Problem: There are redundant computations.

Question: Do you see what these redundant computations are?
Redundant computations: u and v are revised even if theirdomains haven’t
changed since the last time.

Better algorithm avoiding this: AC 3 (coming up)

Michael Kohlhase: Artificial Intelligence 1 294 2025-02-06

AC-3: Enforcing Arc Consistency (Version 3)

Idea: Remember the potentially inconsistent variable pairs.
Definition 9.4.14. AC-3 optimizes AC-1 for enforcing arc consistency.
function AC−3(γ) returns modified γ
M := ∅
for each constraint C uv ∈ C do
M := M ∪ {(u,v), (v,u)}
while M ̸= ∅ do
remove any element (u,v) from M
Revise(γ, u, v)
if Du has changed in the call to Revise then
for each constraint C wu ∈ C where w ̸= v do
M := M ∪ {(w,u)}
return γ

Question: AC − 3(γ) enforces arc consistency because?

Answer: At any time during the while-loop, if (u,v) ̸∈ M then u is arc consistent
relative to v.

Question: Why only “where w ̸= v”?

Answer: If w = v is the reason why Du changed, then w is still arc consistent
relative to u: the values just removed from Du did not match any values from Dw
anyway.

Michael Kohlhase: Artificial Intelligence 1 295 2025-02-06

AC-3: Example
Example 9.4.15. y div x = 0: y modulo x is 0, i.e., y is divisible by x
9.4. ARC CONSISTENCY 201