Implementation of "A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes" (ICML)
-
Updated
Jun 18, 2022 - Python
Implementation of "A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes" (ICML)
cleaning agent where the environment is partially observable
Virtual reality (VR) environment for studying human-agent decision making.
Forward A* and Adaptive A* for navigation in partially known environments
Grids, mountains, and mysterious problems. Solved with Partially-Observable Markov Decision Procesees. Created at Stanford University, by Pablo Rodriguez Bertorello
Recurrent Policies for Handling Partially Observable Environments
Heuristic Search Value Iteration for One-Sided Partially Observable Stochastic Games
Scallable partially observable and/or non-Markovian gridworld for planning or reinforcement learning
Multi-agent pursuit in matrix world (pursuitMW)
A robot motion planning simulator that can efficiently navigate partially observable environments using deep learning
Certainty-Equivalent Expectation Maximization: a scalable algorithm for system identification of partially observed systems
Predator-Prey-Grass gridworld environment using PettingZoo, with dynamic deletion and spawning of partially observant agents.
POMDP wrappers for OpenAI Gym
Partially Observable Process Gym
Add a description, image, and links to the partially-observable-environment topic page so that developers can more easily learn about it.
To associate your repository with the partially-observable-environment topic, visit your repo's landing page and select "manage topics."