0% found this document useful (0 votes)

18 views78 pages

Adversarial Search in Game Theory

The document discusses adversarial search and game playing. It describes the minimax algorithm and how it can be used to find optimal moves in two-player perfect information games by searching the game tree. It also discusses enhancements like alpha-beta pruning which prune portions of the tree that cannot improve the optimal choice.

Uploaded by

paksmiler

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views78 pages

Adversarial Search in Game Theory

Uploaded by

paksmiler

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Adversarial Search

Adversarial Search
Game playing
Perfect play
The minimax algorithm
alpha-beta pruning

Resource limitations
Elements of chance
Imperfect information

Game Playing State-of-the-Art

Checkers: Chinook ended 40-year-reign of human world champion Marion Tinsley
in 1994. Used an endgame database defining perfect play for all positions involving
8 or fewer pieces on the board, a total of 443,748,401,247 positions. Checkers is
now solved!
Chess: Deep Blue defeated human world champion Gary Kasparov in a six-game
match in 1997. Deep Blue examined 200 million positions per second, used very
sophisticated evaluation and undisclosed methods for extending some lines of
search up to 40 ply. Current programs are even better, if less historic.
Othello: Human champions refuse to compete against computers, which are too
good.
Go: Human champions are beginning to be challenged by machines, though the
best humans still beat the best machines. In go, b > 300, so most programs use
pattern knowledge bases to suggest plausible moves, along with aggressive
pruning.
Pacman: unknown

What kind of games?

Abstraction: To describe a game we must capture every
relevant aspect of the game. Such as:
Chess
Tic-tac-toe

Accessible environments: Such games are characterized by

perfect information
Search: game-playing then consists of a search through
possible game positions
Unpredictable opponent: introduces uncertainty thus gameplaying must deal with contingency problems

Type of games

Game Playing
Many different kinds of games!
Axes:

Deterministic or stochastic?
One, two, or more players?
Perfect information (can you see the state)?
Turn taking or simultaneous action?

Want algorithms for calculating a strategy (policy) which

recommends a move in each state

Deterministic Games
Deterministic, single player,
perfect information:

Know the rules

Know what actions do
Know when you win
E.g. Freecell, 8-Puzzle, Rubiks cube

its just search!

Slight reinterpretation:
Each node stores a value: the best
outcome it can reach
This is the maximal outcome of its
children (the max value)
Note that we dont have path sums
as before (utilities at end)

After search, can pick move that

leads to best node

Deterministic Two-Player
E.g. tic-tac-toe, chess, checkers
Zero-sum games
One player maximizes result
The other minimizes result

Minimax search

A state-space search tree

Players alternate
Each layer, or ply, consists of around of moves*
Choose move to position with highest minimax
value = best achievable utility against best play

Games vs. search problems

Unpredictable" opponent solution is a strategy specifying a move for every

possible opponent reply

Time limits unlikely to find goal, must approximate

Plan of attack:
Computer considers possible lines of play (Babbage, 1846)
Algorithm for perfect play (Zermelo, 1912; Von Neumann, 1944)
Finite horizon, approximate evaluation (Zuse, 1945; Wiener, 1948; Shannon,
1950)
First chess program (Turing, 1951)
Machine learning to improve evaluation accuracy (Samuel, 1952- 57)
Pruning to allow deeper search (McCarthy, 1956)

Searching for the next move

Complexity: many games have a huge search space

Chess:

b = 35, m=100 nodes = 35 100

if each node takes about 1 ns to explore then each move will
take about 10 50 millennia to calculate.

Resource (e.g., time, memory) limit: optimal solution not

feasible/possible, thus must approximate

Pruning: makes the search more efficient by discarding

portions of the search tree that cannot improve quality
result.

Evaluation functions: heuristics to evaluate utility of a state

without exhaustive search.

Two-player games
A game formulated as a search problem

Initial state:
Operators:
Terminal state:
Utility function:
of the

board position and turn

definition of legal moves
conditions for when game is over
a numeric value that describes the outcome
game. E.g., -1, 0, 1 for loss, draw,
win. (AKA payoff function)

Example: Tic-Tac-Toe

CS561 - Lecture 7-8 - Macskassy - Spring 2011

The minimax algorithm

Perfect play for deterministic environments with perfect information

Basic idea: choose move with highest minimax value

= best achievable payoff against best play

Algorithm:

1. Generate game tree completely

x o x
ox
x o
x ox
ox
x oo

x o x
o ox
x o

x o x
ox
o xo

x o x
o ox
x x o

x o x
x ox
o x o
20

What is a good move?

x o x
ox
o
x o x
x ox
o
x o x
x ox
o
o
x o x
x ox
o x o

x o x
x ox
oo

x ox
ox
x
o
x ox
o ox
x
o
x ox
o ox
x xo

CS561 - Lecture 7-8 - Macskassy - Spring 2011

win
lose
draw

x o x
ox
x o
x ox
ox
x oo

x o x
o ox
x o

x o x
ox
o xo

x o x
o ox
x x o

x o x
x ox
o x o
20

MiniMax Example
MAX

MIN

CS561 - Lecture 7-8 - Macskassy - Spring 2011

MiniMax: Recursive Implementation

CS561 - Lecture 7-8 - Macskassy - Spring 2011

Minimax Properties
Optimal against a perfect player. Otherwise?
Time

complexity?

max

O(b )
m

Space

complexity?

O(bm)
For

min

chess, b=35, m=100

Exact solution is completely infeasible

But, do we need to explore the whole tree?

100

Resource Limits
max

Cannot search to leaves

Depth-limited search
Instead, search a limited depth of tree

-2

min
-1

-2

min
9

Replace terminal utilities with an eval

function for non-terminal positions

Guarantee of optimal play is gone

More plies makes a BIG difference

Example:
Suppose we have 100 seconds, can explore 10K nodes / sec
So can check 1M nodes per move
reaches about depth 8 decent chess program

Evaluation Functions
Function which scores non-terminals

Ideal
In

function: returns the utility of the position

practice: typically weighted linear sum of features:

e.g.

f1(s) = (num white queens num black queens), etc.

Evaluation Functions

Why Pacman starves

knows his score will

go up by eating the dot now

He knows his score will go up
just as much by eating the dot later on
There are no point- scoring
opportunities after eating the dot
Therefore, waiting seems
just as good as eating

- pruning: general principle

Player

Opponent

m
If
> v then MAX will chose m so
prune tree under n
Similar for

Opponent
Player

CS561 - Lecture 7-8 - Macskassy - Spring 2011

for MIN

- pruning: example 1
MAX

[-,+]

MIN

- pruning: example 1
[-,+]
[3,+]

MAX

MIN

[3,2]

[-,3]

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 1
[-,+]
[3,+]

MAX

MIN

[3,2]

[-,3]

[3,14]

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 1
[-,+]
[3,+]

MAX

MIN

[3,2]

[-,3]

[3,14] [3,5]

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 1
[-,+]
[3,+]

MAX

MIN

[3,2]

[-,3]

[3,14] [3,5] [3,2]

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 1
[-,+]
[3,+]

- pruning: example 2
MAX

[-,+]
[2,+]

MIN

[2,5]
[2,1]

[-,2]

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 2
[-,+]
[2,+]

MAX

MIN

[2,8]

[-,2]

[2,5]
[2,1]

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 2
[-,+]
[2,+]

MAX

MIN

[2,8]

[-,2]

[2,5]
[2,1]

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 2
[-,+] [2,+] [3,
+]

MAX

MIN

[2,8]
[2,3]

[-,2]

[2,5]
[2,1]

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 2
[-,+] [2,+] [3,
+]

MAX
Selected move
MIN

[2,8]
[2,3]

[-,2]

[2,5]
[2,1]

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 3
MIN

MAX

[6, ]

MIN

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 3
[-,6]

MIN

MAX

[6, ]

[-,6]

MIN

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 3
[-,6]

MIN

MAX

[6, ]

MIN

[-,14]

[-,6]

14
CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 3
[-,6]

MIN

MAX

[6, ]

MIN

[-,6]

[-,14][-,5]

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 3
[-,6]

MIN

MAX

[6, ]

MIN

[5,6]

[-,14][-,5]

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 3
[-,6]

MIN

MAX

[5,6]

[6, ]

MIN

[5,1]

[5,4]

4
57

- pruning: example 4
MIN

MAX

[6, ]

MIN

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 4
[-,6]

MIN

MAX

[6, ]

[-,6]

MIN

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 4
[-,6]

MIN

MAX

[6, ]

MIN

[-,14]

[-,6]

14
CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 4
[-,6]

MIN

MAX

[6, ]

MIN

[-,6]

[-,14][-,7]

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 4
[-,6]

MIN

MAX

[6, ]

MIN

[7,6]

[-,14][-,7]

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: example 4
[-,6]

MIN
Selected move
MAX

[6, ]

MIN

[7,6]

[-,14][-,7]

CS561 - Lecture 7-8 - Macskassy - Spring 2011

- pruning: general principle

Player

Opponent

m
If
> v then MAX will chose m so
prune tree under n
Similar for

Opponent
Player

CS561 - Lecture 7-8 - Macskassy - Spring 2011

for MIN

The algorithm

Properties algorithm

Resource limits
Standard

approach:
Use CUTOFF-TEST instead of TERMINAL-TEST
e.g., depth limit (perhaps add quiescence search)

Use EVAL instead of UTILITY

i.e., evaluation function that estimates desirability of position

Suppose

we have 100 seconds, and can explore

104 nodes/second

106 nodes per move 358/2

reaches depth 8 pretty good
chess program

Evaluation Functions
Function which scores non-terminals

Ideal
In

function: returns the utility of the position

practice: typically weighted linear sum of features:

e.g.

f1(s) = (num white queens num black queens), etc.

Digression: Exact values don't

matter

Behavior is preserved under any monotonic transformation of

Eval
Only the order matters:

payoff in deterministic games acts as an ordinal utility function

STOCHASTIC GAMES
Dice

rolls increase b: 21 possible

rolls with 2 dice
Backgammon
20 legal
moves
Depth 4 = 20 x (21 x 20)3
x 109

1.2

depth increases, probability

of
reaching a given node shrinks

So value of lookahead is
diminished
So limiting depth is less
damaging
pruning is much less
effective
CS561 - Lecture 7-8 - Macskassy - Spring 2011
TDGammon

uses depth-2 search +

Nondeterministic games in general

In nondeterministic games, chance introduced by dice,
card-shuffling
Simplified example with coin-flipping:

Algorithm for nondeterministic

games
Expectiminimax gives perfect play
Just like Minimax, except we must also handle chance nodes:

if state is a Max node then

return the highest ExpectiMinimax-Value of Successors(state)

if state is a Min node then

return the lowest ExpectiMinimax-Value of Successors(state)

if state is a chance node then

return average of ExpectiMinimax-Value of Successors(state)

Expectiminimax

Digression: Exact values DO matter

Behavior is preserved only by positive linear

transformation of Eval
Hence Eval should be proportional to the expected
payoff

Games of imperfect information

E.g., card games, where opponent's initial cards are unknown
Typically we can calculate a probability for each possible deal
Seems just like having one big dice roll at the beginning of the
game
Idea: compute the minimax value of each action in each deal,
then choose the action with highest expected value over all deals
Special case: if an action is optimal for all deals, it's optimal.
GIB, current best bridge program, approximates this idea by
1) generating 100 deals consistent with bidding information
2) picking the action that wins most tricks on average

Example
Four-card bridge/whist/hearts hand, Max to play first

Commonsense example

Proper analysis
* Intuition that the value of an action is the average of
its values in all actual states is WRONG
With partial observability, value of an action depends on
the information state or belief state the agent is in
Can generate and search a tree of information states
Leads to rational behaviors such as
Acting to obtain information
Signalling to one's partner
Acting randomly to minimize information disclosure

Minimax Algorithm in Game AI
No ratings yet
Minimax Algorithm in Game AI
109 pages
Adversarial Search in Game Playing
No ratings yet
Adversarial Search in Game Playing
41 pages
2023 Lecture05 AdversarialSearch 1
No ratings yet
2023 Lecture05 AdversarialSearch 1
44 pages
Adversarial Search in Game Algorithms
No ratings yet
Adversarial Search in Game Algorithms
23 pages
Adversarial Search in AI Games
No ratings yet
Adversarial Search in AI Games
57 pages
Adversarial Search in AI Games
No ratings yet
Adversarial Search in AI Games
44 pages
Lecture 4 AI
No ratings yet
Lecture 4 AI
69 pages
Adversarial Search in AI Games
No ratings yet
Adversarial Search in AI Games
46 pages
Adversarial Search in Game Theory
No ratings yet
Adversarial Search in Game Theory
51 pages
Adversarial Search in AI Games
No ratings yet
Adversarial Search in AI Games
51 pages
Game Playing and AI Strategies
No ratings yet
Game Playing and AI Strategies
30 pages
Adversarial Search in Game Theory
No ratings yet
Adversarial Search in Game Theory
29 pages
Adversarial Search in AI Games
No ratings yet
Adversarial Search in AI Games
12 pages
Adversarial Search in AI Games
No ratings yet
Adversarial Search in AI Games
77 pages
2025 CSC14003 Lecture03 Minimax
No ratings yet
2025 CSC14003 Lecture03 Minimax
52 pages
Adversarial Search in AI Games
No ratings yet
Adversarial Search in AI Games
34 pages
Game Playing and Minimax in AI
No ratings yet
Game Playing and Minimax in AI
16 pages
4.1.L22 - 23 - Probelm Solving Agents - Adversarial Search
No ratings yet
4.1.L22 - 23 - Probelm Solving Agents - Adversarial Search
29 pages
Adversarial Search in Game Theory
No ratings yet
Adversarial Search in Game Theory
50 pages
Adversarial Search in AI Games
No ratings yet
Adversarial Search in AI Games
68 pages
AI Lecture: Minimax and Game Strategies
No ratings yet
AI Lecture: Minimax and Game Strategies
42 pages
Adversarial Search in AI: MiniMax Explained
No ratings yet
Adversarial Search in AI: MiniMax Explained
15 pages
Adversarial Search in AI Games
No ratings yet
Adversarial Search in AI Games
88 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
65 pages
Adversarial Search in Game Theory
No ratings yet
Adversarial Search in Game Theory
54 pages
Adversarial Search in Game AI
No ratings yet
Adversarial Search in Game AI
27 pages
AI Game Playing: Minimax & Nim
No ratings yet
AI Game Playing: Minimax & Nim
85 pages
Adversarial Search in Game Theory
No ratings yet
Adversarial Search in Game Theory
8 pages
Adversarial Search in AI: MiniMax Explained
No ratings yet
Adversarial Search in AI: MiniMax Explained
13 pages
Adversarial Search in AI Games
No ratings yet
Adversarial Search in AI Games
41 pages
Adversarial Search in AI Games
No ratings yet
Adversarial Search in AI Games
21 pages
Adversarial Search in AI Games
No ratings yet
Adversarial Search in AI Games
36 pages
AI in Games: Minimax & Alpha-Beta Pruning
No ratings yet
AI in Games: Minimax & Alpha-Beta Pruning
79 pages
Adversarial Search in Game Theory
No ratings yet
Adversarial Search in Game Theory
85 pages
Adversarial Search in Game Theory
No ratings yet
Adversarial Search in Game Theory
42 pages
Alpha-Beta Pruning in Game Theory
No ratings yet
Alpha-Beta Pruning in Game Theory
54 pages
Game Theory and Adversarial Search Techniques
No ratings yet
Game Theory and Adversarial Search Techniques
72 pages
AI Notes Unit - 2 FULL
No ratings yet
AI Notes Unit - 2 FULL
50 pages
Advanced Search Methods in AI
No ratings yet
Advanced Search Methods in AI
37 pages
Understanding Game Playing in AI
No ratings yet
Understanding Game Playing in AI
114 pages
Game Playing Strategies and AI Techniques
No ratings yet
Game Playing Strategies and AI Techniques
4 pages
Minimax Algorithm in Adversarial Games
No ratings yet
Minimax Algorithm in Adversarial Games
160 pages
Ch-6 Adverserial Search
No ratings yet
Ch-6 Adverserial Search
32 pages
AI Strategies in Game Theory
No ratings yet
AI Strategies in Game Theory
51 pages
Game Playing and AI Techniques
No ratings yet
Game Playing and AI Techniques
4 pages
Game Playing Strategies in AI
No ratings yet
Game Playing Strategies in AI
32 pages
Adversarial Search in Game Theory
No ratings yet
Adversarial Search in Game Theory
101 pages
Chapter 6 (CS-4011)
No ratings yet
Chapter 6 (CS-4011)
38 pages
Unit4 - AdversarialSearch, Various Types
No ratings yet
Unit4 - AdversarialSearch, Various Types
34 pages
Minimax Algorithm in Adversarial Games
No ratings yet
Minimax Algorithm in Adversarial Games
88 pages
Minimax and Alpha-Beta Pruning in AI
No ratings yet
Minimax and Alpha-Beta Pruning in AI
164 pages
AI Strategies in Game Playing
No ratings yet
AI Strategies in Game Playing
55 pages
Adversarial Search in Game Theory
No ratings yet
Adversarial Search in Game Theory
17 pages
Optimal Decisions in AI Multiplayer Games
No ratings yet
Optimal Decisions in AI Multiplayer Games
68 pages
Game Theory: Minimax & Alpha-Beta Pruning
No ratings yet
Game Theory: Minimax & Alpha-Beta Pruning
23 pages
AI Game Playing Techniques Overview
No ratings yet
AI Game Playing Techniques Overview
5 pages
Adversarial Search in Game AI
No ratings yet
Adversarial Search in Game AI
36 pages
Bayesian Belief Nets Overview
No ratings yet
Bayesian Belief Nets Overview
45 pages
Uninformed Search Strategies Explained
No ratings yet
Uninformed Search Strategies Explained
23 pages
Understanding KNNL in Regression Analysis
No ratings yet
Understanding KNNL in Regression Analysis
23 pages
Lec 2
No ratings yet
Lec 2
20 pages
FuzzyLogic Intro
No ratings yet
FuzzyLogic Intro
34 pages
JADE Platform Overview and Setup Guide
100% (1)
JADE Platform Overview and Setup Guide
50 pages
Fuzzy Logic Overview and Applications
No ratings yet
Fuzzy Logic Overview and Applications
15 pages
Flynn's Taxonomy of Computer Architectures
No ratings yet
Flynn's Taxonomy of Computer Architectures
13 pages
Survey of Steganography: With An Emphasis On Audio Techniques
No ratings yet
Survey of Steganography: With An Emphasis On Audio Techniques
34 pages
Cloud 2.0 for Game Development
No ratings yet
Cloud 2.0 for Game Development
33 pages
Error Detection and Correction Techniques
No ratings yet
Error Detection and Correction Techniques
76 pages
Understanding Automata Theory Basics
No ratings yet
Understanding Automata Theory Basics
37 pages
MarCode Installation Confirmation
No ratings yet
MarCode Installation Confirmation
3 pages
Indian Mobile Number Codes Overview
No ratings yet
Indian Mobile Number Codes Overview
353 pages
Ordinal Theory Handbook for Satoshis
No ratings yet
Ordinal Theory Handbook for Satoshis
108 pages
Ward Clerk Supervisor Job Overview
No ratings yet
Ward Clerk Supervisor Job Overview
4 pages
Algorithm Analysis Lab Manual
No ratings yet
Algorithm Analysis Lab Manual
85 pages
Pic16c505-04 P075
No ratings yet
Pic16c505-04 P075
80 pages
Columbia SEAS CS Major Guide
No ratings yet
Columbia SEAS CS Major Guide
9 pages
Discrete Mathematics Final Exam Solutions
No ratings yet
Discrete Mathematics Final Exam Solutions
4 pages
Flood Monitoring and Control System
No ratings yet
Flood Monitoring and Control System
16 pages
BLE Algorithms for OLL Cases
No ratings yet
BLE Algorithms for OLL Cases
2 pages
Shanne Herbal Products Overview
No ratings yet
Shanne Herbal Products Overview
15 pages
Hexapod Spider Robot Project Overview
No ratings yet
Hexapod Spider Robot Project Overview
15 pages
Intro to Algorithms & Complexity Basics
No ratings yet
Intro to Algorithms & Complexity Basics
2 pages
Intro to Programming and Data Structures
No ratings yet
Intro to Programming and Data Structures
18 pages
Laxmi Kant Tanwar's Employment Profile
No ratings yet
Laxmi Kant Tanwar's Employment Profile
1 page
Patterns in Algebraic Functions
No ratings yet
Patterns in Algebraic Functions
7 pages
Zenith Bank Account Statement 2025
No ratings yet
Zenith Bank Account Statement 2025
7 pages
Induction Exercises and Problems
No ratings yet
Induction Exercises and Problems
2 pages
BreezeCONFIG Manuale
No ratings yet
BreezeCONFIG Manuale
94 pages
FourNxt Business Analyst Role Overview
No ratings yet
FourNxt Business Analyst Role Overview
2 pages
Security Challenges in Ad Hoc Networks
No ratings yet
Security Challenges in Ad Hoc Networks
6 pages
Grade 8 Computer Science Study Plan
No ratings yet
Grade 8 Computer Science Study Plan
2 pages
Identifying Duplicates in Excel
No ratings yet
Identifying Duplicates in Excel
8 pages
Algorithm Exam Answers-1
No ratings yet
Algorithm Exam Answers-1
25 pages
Sap PP/QM User Manual: Published by Team of SAP Consultants at Saptopjobs
No ratings yet
Sap PP/QM User Manual: Published by Team of SAP Consultants at Saptopjobs
10 pages
WebLogic Server Administration Guide
No ratings yet
WebLogic Server Administration Guide
4 pages
RFID ID Scanner Implementation Proposal
No ratings yet
RFID ID Scanner Implementation Proposal
8 pages
JEE 2025-26 CBT Schedule and Guidelines
No ratings yet
JEE 2025-26 CBT Schedule and Guidelines
1 page
Python Notes 15 Pages
No ratings yet
Python Notes 15 Pages
15 pages
Tableau Performance Optimization Guide
No ratings yet
Tableau Performance Optimization Guide
2 pages