neurogammon -...

Neurogammon CJ Bell Matthew Maas Brian Suchland Joe Cartano

Upload: others

Post on 29-Oct-2019

6 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Neurogammon - courses.cs.washington.educourses.cs.washington.edu/courses/cse473/07au/notes/neuroslides.pdf · Backgammon • A zero-sum board game between two players • Players

NeurogammonCJ Bell

Matthew MaasBrian Suchland

Joe Cartano

Page 2: Neurogammon - courses.cs.washington.educourses.cs.washington.edu/courses/cse473/07au/notes/neuroslides.pdf · Backgammon • A zero-sum board game between two players • Players

Backgammon• A zero-sum board game between two players• Players roll dice and choose which checkers to

move• Players can also choose to use the doubling-

cube

Page 3: Neurogammon - courses.cs.washington.educourses.cs.washington.edu/courses/cse473/07au/notes/neuroslides.pdf · Backgammon • A zero-sum board game between two players • Players

Backgammon (continued)• An excellent candidate for an AI program• BUT, the game involves a large element of

chance• Traditional search methods are inefficient• Expert human players rely on judgment, not

search.

Page 4: Neurogammon - courses.cs.washington.educourses.cs.washington.edu/courses/cse473/07au/notes/neuroslides.pdf · Backgammon • A zero-sum board game between two players • Players

Neurogammon• Developed by Gerald Tesauro of IBM• Relies on neural-networks instead of search

Page 5: Neurogammon - courses.cs.washington.educourses.cs.washington.edu/courses/cse473/07au/notes/neuroslides.pdf · Backgammon • A zero-sum board game between two players • Players

Implementation

• Six neural-networks for six different stages of the game. (289-?-?)

• One additional neural-network to determine whether to use the doubling-cube. (best setup: 243-24-9)

Page 6: Neurogammon - courses.cs.washington.educourses.cs.washington.edu/courses/cse473/07au/notes/neuroslides.pdf · Backgammon • A zero-sum board game between two players • Players

Training

• Input: Initial board position and transition to next position

• The first six networks trained on a set of expert’s games, where each move was rated from -100 (worst) to 100.

Page 7: Neurogammon - courses.cs.washington.educourses.cs.washington.edu/courses/cse473/07au/notes/neuroslides.pdf · Backgammon • A zero-sum board game between two players • Players

Training

• The seventh network trained on a separate set of expert games

• 3000 positions covering 64 games (225 set aside for testing)

• Each position was categorized from 1 to 9 by an expert, indicating whether it was a good time to use the doubling-cube.

• The 9 outputs were summed

Page 8: Neurogammon - courses.cs.washington.educourses.cs.washington.edu/courses/cse473/07au/notes/neuroslides.pdf · Backgammon • A zero-sum board game between two players • Players

First Computer Olympiad

• Held in 1989• Pitted the six premier computer

backgammon programs of the time against each other in a round-robin tournament

• The first serious test of Neurogammon’sabilities

• All five other programs relied on traditional, human-defined board evaluations

Page 9: Neurogammon - courses.cs.washington.educourses.cs.washington.edu/courses/cse473/07au/notes/neuroslides.pdf · Backgammon • A zero-sum board game between two players • Players

Results of the First Computer Olympiad

COMPUTER OPPONENT RESULTS(FIRST TO 11 POINTS)

Saitek Backgammon 12-9, won by Neurogammon

Mephisto Backgammon 12-5, won by Neurogammon

Backbrain 11-4, won by Neurogammon

AI Backgammon 16-1, won by Neurogammon

Video Gammon 12-7, won by Neurogammon

Page 10: Neurogammon - courses.cs.washington.educourses.cs.washington.edu/courses/cse473/07au/notes/neuroslides.pdf · Backgammon • A zero-sum board game between two players • Players

TD-GammonVersion Training

GamesOpponents Results

TD-Gammon 0.0 300,000 Computer Programs

Tied for Best

TD-Gammon 1.0 300,000 Various Human Experts

-13 Points / 51 Games

TD-Gammon 2.0 800,000 Various Human Experts

-7 Points / 38 Games

TD-Gammon 2.1 1,500,000 Robertie(Grandmaster)

-1 Point / 40 Games

TD-Gammon 3.0 1,500,000 Kazaros(Grandmaster)

+6 Points / 20 Games

Page 11: Neurogammon - courses.cs.washington.educourses.cs.washington.edu/courses/cse473/07au/notes/neuroslides.pdf · Backgammon • A zero-sum board game between two players • Players

TD-Gammon is Used to Reevaluate Board Positions

White has just rolled two 4’s, giving it 4 moves of 4 spaces each

Page 12: Neurogammon - courses.cs.washington.educourses.cs.washington.edu/courses/cse473/07au/notes/neuroslides.pdf · Backgammon • A zero-sum board game between two players • Players

The Traditional Move

The traditionally accepted move in this situation is 8-4, 8-4, 11-7, 11-7

Page 13: Neurogammon - courses.cs.washington.educourses.cs.washington.edu/courses/cse473/07au/notes/neuroslides.pdf · Backgammon • A zero-sum board game between two players • Players

TD-Gammon’s Move

TD-Gammon’s move in this situation is 8-4, 8-4, 21-17, 21-17

Players dribble around area, on coaches command players

Ray Tracing - courses.cs.washington.educourses.cs.washington.edu/courses/cse557/15au/lectures/ray-tracin… · Whitted ray-tracing algorithm In 1980, Turner Whitted introduced ray

Reflection in Java - courses.cs.washington.educourses.cs.washington.edu/.../reflection.pdf · Programming Reflection To program with reflection, we must put on our meta-thinking caps

Building Java Programs - courses.cs.washington.educourses.cs.washington.edu/courses/cse142/12su/lectures/07-02/7-r… · Math.floor(value) rounds down Math.log10(value) logarithm,

courses.cs.washington.educourses.cs.washington.edu/courses/csep561/08au/slides/p561.3.ink.pdfInternetworks Set of interconnected networks, e.g., the Internet Scale and heterogeneity

Replication - courses.cs.washington.educourses.cs.washington.edu/courses/csep545/12wi/slides/10_Replication.pdfthe same effect as a serial execution on a one-copy database. • Readset

A Survey on Virtualization Technologiescourses.cs.washington.edu/courses/cse451/07au/lectures/18-virt.pdf · Virtualization: What is it, really? Real vs. Virtual Similar essence,

Semantic Parsing with CCG - courses.cs.washington.educourses.cs.washington.edu/courses/csep517/13au/... · Logical expression List of logical expressions. ... Constructing Lambda

cse403 scale - courses.cs.washington.educourses.cs.washington.edu/courses/cse403/12au/lectures/cse403_s… · Scaling up through caching database server/ database web servers application

The Google File System - courses.cs.washington.educourses.cs.washington.edu/courses/cse490h/08au/lectures/490h-gfs.pdf · Seagate Barracuda 7200.11 ... transfer: transferring data

Image Stitching - courses.cs.washington.educourses.cs.washington.edu/courses/cse576/16sp/Slides/10_ImageStitching.pdfImage Stitching Ali Farhadi CSE 576 Several slides from Rick Szeliski,

courses.cs.washington.educourses.cs.washington.edu/courses/cse454/10au/student... · Web viewBook verification and deselecting: This second page would be an intermediate page, containing

Introduction to Data Programming - courses.cs.washington.educourses.cs.washington.edu/courses/cse140/14wi/lectures/00-introduction.pdf · programming language –…and you will gain

LASSO Regression - courses.cs.washington.educourses.cs.washington.edu/.../slides/LARS-fusedlasso-annotated.pdfAxel Gandy LASSO and related algorithms 34 LARS – Illustration for p=2

Presentation2 - courses.cs.washington.educourses.cs.washington.edu/courses/csep557/13wi/lectures/markup/… · Bernstein polynomials, cont'd For degree 3, the Bernstein polynomials

Why estimate visual motion? - courses.cs.washington.educourses.cs.washington.edu/courses/cse576/05sp/... · 1 Motion estimation Computer Vision CSE576, Spring 2005 Richard Szeliski

What is ICT for Development - courses.cs.washington.educourses.cs.washington.edu/courses/csep590/04au/lectures/slides/... · What is ICT for Development? ... curriculum with rural

Chapter 4: Greedy Algorithms - courses.cs.washington.educourses.cs.washington.edu/courses/cse417/14sp/slides/04greed.pdf · Interval Scheduling: Greedy Algorithms! Greedy template

Sam Cook Photography - courses.cs.washington.educourses.cs.washington.edu/courses/cse131/12sp/EC_Projects/Sam C… · Sam Cook Photography | 425.381.6417. 611 9Million in Unmarked

ECG filtering - courses.cs.washington.educourses.cs.washington.edu/courses/cse466/13au/pdfs/lectures/ECG... · ECG Filtering Willem Einthoven’s EKG machine, 1903 ... lines as far

Architecture - courses.cs.washington.educourses.cs.washington.edu/.../lectures/lecture-07-architecture.pdfWhy architecture? “Good software architecture makes the rest of the project

Slides by Alex Mariakakis - courses.cs.washington.educourses.cs.washington.edu/.../14sp/sections/section10-designpatter… · Design Patterns 2 . List of Design Patterns We discussed

Python Evaluation Rules - courses.cs.washington.educourses.cs.washington.edu/courses/cse140/13wi/eval_rules.pdf · 2016-08-02 · 3.2.1 Rules for Evaluation To evaluate a compound

Building Java Programs - courses.cs.washington.educourses.cs.washington.edu/courses/cse142/08au/lectures/2008-10-13... · Building Java Programs Chapter 3 ... Java class libraries:

Ruby (on Rails) - courses.cs.washington.educourses.cs.washington.edu/courses/cse190m/09sp/ruby/week1/week1.pdf · About the Section • Introduce the Ruby programming language •

Pipelining vs. Parallel processingcourses.cs.washington.edu/courses/cse378/07au/lectures/L...2 Pipelining vs. Parallel processing In both cases, multiple “things” processed by

cse403 testing part 2 - courses.cs.washington.educourses.cs.washington.edu/courses/cse403/12au/... · Announcements • Deliverables for 11/19 release (due 11:59PM) • Documents

courses.cs.washington.educourses.cs.washington.edu/courses/cse458/05au/help/mayaguide/Com… · Character Setup 3 Table of Contents 1 Character Setup overview . . . . . . . . .

CSE 473: Artificial Intelligence - courses.cs.washington.educourses.cs.washington.edu/courses/cse473/14sp/slides/22-BN-inference.pdfCSE 473: Artificial Intelligence Bayesian Networks:

Card 3000 - courses.cs.washington.educourses.cs.washington.edu/.../final-presentation.pdfStarbucks - 10/20 $1.23 $38.56 $700.89 $2.75 $4.00 RECENT TRANSACTIONS McDonald's rispy Kreme

SECTION 2 - courses.cs.washington.educourses.cs.washington.edu/courses/cse331/15au/sections/sec02.pdf · SECTION 2: HW3 Setup cse331 ... NetBeans, Visual Studio, IntelliJIDEA . ECLIPSE

RNA Secondary Structure - courses.cs.washington.educourses.cs.washington.edu/courses/cse417/06wi/slides/06dp-rna.pdf · •E.g. “riboswitches”: thousands in bacteria. DNA structure:

courses.cs.washington.educourses.cs.washington.edu/.../04sp/pdfs/lectures/L4-MicroBlaze.pdf · Simple MicroBlaze System Block Diagram External to FPGA MicroBlaze LM B_BRAM IF_CN TLR

courses.cs.washington.educourses.cs.washington.edu/courses/cse458/08au/... · Animation 3 Table of Contents 1 Animation Basics . . . . . . . . . . . . . . . . . . . . . . . . .

Computer Animation - courses.cs.washington.educourses.cs.washington.edu/.../animation/Computer_Animation.pdfBasic Animation •What’s the simplest kind of animation you can think