temporal-difference-learning

Here are 16 public repositories matching this topic...

K-Winkles / Multi-stage-TDL-2048

This repository contains my undergraduate thesis source code for Multi-stage Temporal Difference Learning with 2048 as an AI testbed. I reimagined my original C++ implementation in Qt for visualisation purposes.

qt 2048 temporal-difference-learning n-tuple-networks

Updated May 25, 2019
C++

u84819482 / Nano-TD

Star

Tabular TD control in MAZE environment using Q-Learning, SARSA, and Expected SARSA

reinforcement-learning maze q-learning sarsa expected-sarsa temporal-difference-learning

Updated Aug 3, 2024
Jupyter Notebook

tensor-mutator / Symbiosis

Star

A Reinforcement Learning library for solving custom environments

reinforcement-learning deep-learning artificial-intelligence mcts optical-flow deep-q-network model-based-rl prioritized-experience-replay alphazero experience-replay flappy-bird-agent model-free-rl temporal-difference-learning enduro-agent

Updated May 27, 2021
Python

enstit / Racetrack

Star

Temporal Difference Learning for the Racetrack problem

reinforcement-learning temporal-difference-learning

Updated Nov 5, 2023
Jupyter Notebook

Mahsatajik / Reinforcement-Learning

Star

University of Tehran-Reinforcement Learning Fall 2022

python reinforcement-learning monte-carlo deep-reinforcement-learning dqn reinforcement-learning-algorithms dynamic-programming markov-decision-processes policy-iteration value-iteration object-oriented-programming gym-environment temporal-difference-learning sarsa-algorithm q-learning-algorithm

Updated May 25, 2024
Jupyter Notebook

sohaila-ahmed3011 / RobotNavigation

Star

This RL project aims to make robot navigate from start to an end goal. Two cases for continuous and discrete action spaces are implemented.

reinforcement-learning dqn ddpg-algorithm temporal-difference-learning

Updated Nov 21, 2022
Jupyter Notebook

CrosleyZack / cse574

Star

Course work for CSE 574 Planning and Learning Methods in AI

markov-chain q-learning capture-the-flag dynamic-programming hidden-markov-model markov-decision-processes temporal-difference-learning partially-observable-markov-model

Updated Nov 1, 2021
Python

HridayM25 / ReinforcementLearning

Star

Some algorithms of Reinforcement Learning implemented by me, in accordance to "Introduction to Reinforcement Learning" by Richard Sutton and Andrew Barto.

monte-carlo policy-gradient reinforcement-learning-algorithms markov-decision-processes bandit-algorithms temporal-difference-learning policy-control

Updated Sep 8, 2024
Jupyter Notebook

VEXLife / Accelerated-TD

Star

My Implementation of the Accelerated Gradient Temporal Difference Learning algorithm in Python

reinforcement-learning reinforcement-learning-algorithms td atd random-walk temporal-differencing-learning temporal-difference temporal-difference-algorithms temporal-difference-learning accelerated-td

Updated Jan 25, 2024
Python

Develop-Packt / Introduction-to-Temporal-Difference-Learning

Star

This module introduces temporal-difference learning and focuses on how it develops over the ideas of both Monte Carlo methods, and dynamic programming.

python reinforcement-learning ai tensorflow-2 temporal-difference-learning

Updated Jun 11, 2020
Jupyter Notebook

ziap / 2048-tdl

Star

Temporal difference learning for 2048

agent reinforcement-learning expectimax 2048-ai 2048-solver temporal-difference-learning n-tuple-networks

Updated Jul 13, 2023
HTML

ZikangZhou / nim_rl

Star

A reinforcement learning framework for the game of Nim.

reinforcement-learning q-learning dqn sarsa dynamic-programming policy-iteration value-iteration expected-sarsa monte-carlo-methods double-q-learning temporal-difference-learning double-sarsa double-expected-sarsa n-step-bootstrapping n-step-sarsa n-step-expected-sarsa off-policy-n-step-sarsa off-policy-n-step-expected-sarsa n-step-tree-backup

Updated Nov 21, 2020
C++

UniBwTAS / CollisionPro

Star

A framework for collision probability distribution estimation via deep temporal difference learning

python reinforcement-learning robotics tensorflow end-to-end deep-reinforcement-learning collision-detection autonomous-driving xai temporal-difference-learning

Updated Sep 14, 2024
Python

NikolaZubic / AppliedGameTheoryHomeworkSolutions

Star

Solutions for course: "Applied Game Theory" taken at University of Novi Sad - Faculty of Technical Sciences

tic-tac-toe blackjack q-learning game-theory minimax-algorithm multi-armed-bandit softmax monte-carlo-methods bellman-ford-algorithm evolutionary-game-theory sarsa-learning temporal-difference-learning markov-decision-process softmax-policy cournot-competition instigation-game applied-game-theory

Updated Jan 18, 2021
Jupyter Notebook

moporgic / TDL2048

Star

The Most Efficient Temporal Difference Learning Framework for 2048

machine-learning framework machine-learning-algorithms 2048 2048-game 2048-ai temporal-difference-algorithms 2048-solver temporal-difference-learning n-tuple-networks

Updated Sep 4, 2024
C++

qpwoeirut / 2048-solver

Star

A set of AIs for the 2048 tile-merging game. Includes an expectimax strategy that reaches 16384 with 34.6% success and an ML model trained with temporal difference learning.

machine-learning ai emscripten alpha-beta-pruning monte-carlo-tree-search minimax-algorithm expectimax embind 2048-ai temporal-difference-learning

Updated Mar 20, 2023
C++

Improve this page

Add a description, image, and links to the temporal-difference-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the temporal-difference-learning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

temporal-difference-learning

Here are 16 public repositories matching this topic...

K-Winkles / Multi-stage-TDL-2048

u84819482 / Nano-TD

tensor-mutator / Symbiosis

enstit / Racetrack

Mahsatajik / Reinforcement-Learning

sohaila-ahmed3011 / RobotNavigation

CrosleyZack / cse574

HridayM25 / ReinforcementLearning

VEXLife / Accelerated-TD

Develop-Packt / Introduction-to-Temporal-Difference-Learning

ziap / 2048-tdl

ZikangZhou / nim_rl

UniBwTAS / CollisionPro

NikolaZubic / AppliedGameTheoryHomeworkSolutions

moporgic / TDL2048

qpwoeirut / 2048-solver

Improve this page

Add this topic to your repo