Project: Reinforcement Learning

This project implements model-based and model-free reinforcement learning algorithms.

Value Iteration Agent: It utilizes an MDP and runs value iteration for set iterations before the constructor returns. It implements both asynchronous & prioritized sweeping.
Q-Learning: A RL agent that learns by trial and error from interactions with the environment through its update(state, action, nextState, reward) method. Approximate Q-learning is also implemented

How to Run

python pacman.py -p ApproximateQAgent -a extractor=myExtractor -x 50 -n 60 -l mediumClassic
python pacman.py -p ApproximateQAgent -a extractor=myExtractor -x 50 -n 60 -l mediumGrid
python pacman.py -p ApproximateQAgent -a extractor=myExtractor -x 50 -n 60 -l mylayout

Default result

python pacman.py -p ApproximateQAgent -a extractor=SimpleExtractor -x 50 -n 60 -l mediumClassic

Average Score: 1352.4
Scores: 1346.0, 1549.0, 1320.0, 1340.0, 1320.0, 1322.0, 1321.0, 1343.0, 1338.0, 1325.0
Win Rate: 10/10 (1.00)
Record: Win, Win, Win, Win, Win, Win, Win, Win, Win, Win

Part 1

Develop a sophisticated feature extractor

Implemented feature: Don't run away from ghosts if they are scared.

python pacman.py -p ApproximateQAgent -a extractor=myExtractor -x 50 -n 60 -l mediumClassic

Average Score: 1595.5
Scores: 1747.0, 1335.0, 1347.0, 1937.0, 1919.0, 1727.0, 1533.0, 1330.0, 1540.0, 1540.0
Win Rate: 10/10 (1.00)
Record: Win, Win, Win, Win, Win, Win, Win, Win, Win, Win

Part 2

Make a layout

mylayout.lay file is specifically designed to demonstrate the benefit of the sophisticated feature extractor which is developed for part 1. Thanks to the new feature extractor, the Pacman is able to eat all foods without running away from ghost and losing points redundantly.

python pacman.py -p ApproximateQAgent -a extractor=SimpleExtractor -x 50 -n 60 -l mylayout

Average Score: 525.0
Scores: 529.0, 523.0, 528.0, 525.0, 521.0, 521.0, 528.0, 525.0, 525.0, 525.0
Win Rate: 10/10 (1.00)
Record: Win, Win, Win, Win, Win, Win, Win, Win, Win, Win

python pacman.py -p ApproximateQAgent -a extractor=myExtractor -x 50 -n 60 -l mylayout

Average Score: 778.0
Scores: 913.0, 763.0, 763.0, 763.0, 763.0, 763.0, 763.0, 763.0, 763.0, 763.0
Win Rate: 10/10 (1.00)
Record: Win, Win, Win, Win, Win, Win, Win, Win, Win, Win

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
__pycache__		__pycache__
layouts		layouts
A1.pdf		A1.pdf
README.md		README.md
analysis.py		analysis.py
autograder.py		autograder.py
crawler.py		crawler.py
environment.py		environment.py
game.py		game.py
ghostAgents.py		ghostAgents.py
grading.py		grading.py
graphicsCrawlerDisplay.py		graphicsCrawlerDisplay.py
graphicsDisplay.py		graphicsDisplay.py
graphicsGridworldDisplay.py		graphicsGridworldDisplay.py
graphicsUtils.py		graphicsUtils.py
gridworld.py		gridworld.py
keyboardAgents.py		keyboardAgents.py
layout.py		layout.py
learningAgents.py		learningAgents.py
mdp.py		mdp.py
pacman.py		pacman.py
pacmanAgents.py		pacmanAgents.py
projectParams.py		projectParams.py
qlearningAgents.py		qlearningAgents.py
reinforcementTestClasses.py		reinforcementTestClasses.py
testClasses.py		testClasses.py
testParser.py		testParser.py
textDisplay.py		textDisplay.py
textGridworldDisplay.py		textGridworldDisplay.py
theFeatureExtractors.py		theFeatureExtractors.py
util.py		util.py
valueIterationAgents.py		valueIterationAgents.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project: Reinforcement Learning

How to Run

Default result

Part 1

Develop a sophisticated feature extractor

Part 2

Make a layout

About

Releases

Packages

Languages

abogutalan/pacman-AI

Folders and files

Latest commit

History

Repository files navigation

Project: Reinforcement Learning

How to Run

Default result

Part 1

Develop a sophisticated feature extractor

Part 2

Make a layout

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages