Chess as a testing grounds for the oracle approach to AI safety

James D. Miller; Roman V. Yampolskiy; Olle Häggström; Stuart Armstrong

Chess as a testing grounds for the oracle approach to AI safety
Paper i proceeding, 2021

To reduce the danger of powerful super-intelligent AIs, we might make the first such AIs oracles that can only send and receive messages. This paper proposes a possibly practical means of using machine learning to create two classes of narrow AI oracles that would provide chess advice: those aligned with the player's interest, and those that want the player to lose and give deceptively bad advice. The player would be uncertain which type of oracle it was interacting with. As the oracles would be vastly more intelligent than the player in the domain of chess, experience with these oracles might help us prepare for future artificial general intelligence oracles.

Författare

James D. Miller

Smith College

Roman V. Yampolskiy

University of Louisville

Olle Häggström

Chalmers, Matematiska vetenskaper, Tillämpad matematik och statistik

Forskning Andra publikationer

Stuart Armstrong

University of Oxford

CEUR Workshop Proceedings

16130073 (ISSN)

Vol. 2916

2021 Workshop on Artificial Intelligence Safety, AISafety 2021
Virtual, Online, ,

Ämneskategorier

Medieteknik

Språkteknologi (språkvetenskaplig databehandling)

Mediateknik

Mer information

Senast uppdaterat

2021-08-27

Om du har frågor, behöver hjälp, hittar en bugg eller vill ge feedback kan du göra det här nedan. Du når oss också direkt per e-post research.lib@chalmers.se.

Meddelande

Din e-postadress

Research.chalmers.se innehåller information om forskning på Chalmers, publikationer och projekt inklusive information om finansiärer och samarbetspartners.

Läs mer om tjänsten, täckningsgrad och vilka som kan se informationen

Personuppgifter och cookies

Tillgänglighet

Citation Style Language
citeproc-js (Frank Bennett)

Chalmers bibliotek

Chalmers forskning

Chalmers examensarbeten

412 96 GÖTEBORG
TELEFON: 031-772 10 00
WWW.CHALMERS.SE