GaTech Reinforcement Learning

HW1: Value Iteration

HW2: TD($$\lambda$$) -- n-step TD

HW3: SARSA

HW4: Q-Learning

HW5: KWIK

HW6: Game Theory/LP for Rock-paper-stone

Project1: reproduce TD($$\lambda$$)

Project2: Deep Q-Learning for LunaLander

Project3: Q-Learning, Friend/Foe-Q, CE-Q for soccer gamer

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
HW1		HW1
HW2		HW2
HW3		HW3
HW4		HW4
HW5		HW5
HW6		HW6
project1		project1
project2		project2
project3		project3
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md

Provide feedback