Dilemma-Prisoners

Prisoner's Dilemma: Introduction of a player playing according to a Reinforcement Learning algorithm

The Prisoner's Dilemma, formulated in 1950 by Albert W. Tucker at Princeton, characterises a situation in game theory where two players have an interest in cooperating but, in the absence of communication between them, each chooses to betray the other. In 1984, Robert Axelrod published his article entitled ‘The Evolution of Cooperation’. He examined the biological and sociological foundations of cooperation.

Axelrod suggested that reciprocity, i.e. responding to positive actions with positive actions and to negative actions with negative actions, was an essential element of cooperation.

This principle, embodied in the tit-for-tat strategy, has proved remarkably effective in Axelrod's computer simulations of repeated prisoner's dilemma games. The aim of our project is to introduce an agent playing with a Deep Q-Network algorithm against different types of player (random player, player playing the ‘Tit For That’ strategy).

This will enable us to study the DQN player's ability to learn and adapt to different player profiles. Finally, we will be able to see whether a reinforcement learning algorithm can do better than the optimal ‘Tit For Tat’ strategy found by Axelrod.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
DQN Code.ipynb		DQN Code.ipynb
Environnement PettingZoo.ipynb		Environnement PettingZoo.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Dilemma-Prisoners

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Mouss6315/Dilemma-Prisoners

Folders and files

Latest commit

History

Repository files navigation

Dilemma-Prisoners

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages