Foundations Of Intelligent Learning Agents (FILA) Assignments
reinforcement-learning
monte-carlo
linear-programming
thompson-sampling
ucb
bootstrapping
multi-armed-bandits
bellman-equation
temporal-differencing-learning
howards-pi
sarsa-learning
kl-ucb
windy-gridworld
intelligent-learning-agents
-
Updated
Nov 8, 2019 - Python