This is repo is the process of learning about SARSA based reinforcement learning.
In the agent-environment relationship, the environment is a matrix with external and internal boundaries and the agent is a 1x1 unit within that maze that starts at a consistent point and most reach a consistent location.