v1.4.0: Agents define their own rewards #125

stephane-caron · 2023-08-24T10:04:03Z

stephane-caron
Aug 24, 2023
Maintainer

This release brings a great deal of fixes and build system improvements. Notably, we can now run agents both from Python or Bazel interchangeably. There was also a refactoring to the Reward class in environments: agents can now define their own task-specific rewards. The standing reward is now part of the PPO balancer, while at the environment level the default is simply a survival reward (+1 at each non-failing step).

Thanks to @boragokbakan for contributing to this release 👍

Added

Makefile: Add clean rule to remove all intermediate files
Makefile: Add coverage rule to check a local HTML report
agents: Add --show CLI argument to the wheel balancer's Bullet target
envs: Base Reward class for rewards
envs: Survival reward, which is simply always one
utils: Configure agent process on the Raspberry Pi

Changed

MPC balancer: Can now be run both via Bazel or Python
Makefile: Remove most agent targets to promote running their main.py
PPO balancer: Can now be run both via Bazel or Python
Pink balancer: Can now be run both via Bazel or Python
Pink balancer: Remove unit tests
README: Recommended way to run agents is now via Python
Wheel balancer: Can now be run both via Bazel or Python
envs: Default reward for all environments is now the survival reward
envs: Move StandingReward to the PPO balancer
examples: Remove CPU isolation example, now a utils.raspi function call
tools: CPU scaling scripts don't need to be run as root any more
utils: Remove realtime submodule in favor of raspi

Fixed

PPO balancer: Configure main script process on the Raspberry Pi
PPO balancer: Disable rate limiter during training
Wheel balancer: Handle SpineInterface failures when forking a Bullet simulation (thanks to @boragokbakan)
observers: Check whether floor contact observer is initialized properly
tools: Fix permissions of vcgencheck

This discussion was created from the release v1.4.0.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upkie

v1.4.0: Agents define their own rewards #125

{{title}}

Replies: 0 comments

Select a reply

Upkie

v1.4.0: Agents define their own rewards #125

stephane-caron Aug 24, 2023 Maintainer

Added

Changed

Fixed

Replies: 0 comments

stephane-caron
Aug 24, 2023
Maintainer