Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Difference between Breakout-v0, Breakout-v4 and BreakoutDeterministic-v4? #1280

Closed
tianlinxu312 opened this issue Jan 8, 2019 · 8 comments
Closed

Comments

@tianlinxu312
Copy link

Can anyone explain the difference between Breakout-v0, Breakout-v4 and BreakoutDeterministic-v4?

@xysun
Copy link

xysun commented Feb 24, 2019

If you look at the atari_env source code, essentially:

  • v0 vs v4: v0 has repeat_action_probability of 0.25 (meaning 25% of the time the previous action will be used instead of the new action), while v4 has 0 (always follow your issued action)
  • Deterministic: a fixed frameskip of 4, while for the env without Deterministic, frameskip is sampled from (2,5) (code here)

There is also NoFrameskip-v4 with no frame skip and no action repeat stochasticity.

@tianlinxu312
Copy link
Author

If you look at the atari_env source code, essentially:

  • v0 vs v4: v0 has repeat_action_probability of 0.25 (meaning 25% of the time the previous action will be used instead of the new action), while v4 has 0 (always follow your issued action)
  • Deterministic: a fixed frameskip of 4, while for the env without Deterministic, frameskip is sampled from (2,5) (code here)

There is also NoFrameskip-v4 with no frame skip and no action repeat stochasticity.

Thank you very much. That's very clear.

@xysun
Copy link

xysun commented Feb 28, 2019

Glad to help! Can you close the issue now it's solved?

@christopherhesse
Copy link
Contributor

Thanks for answering @xysun!

@rfali
Copy link

rfali commented Dec 22, 2021

Breakout-v0 vs Breakout-v4
v0: repeat_action_probability of 0.25 (meaning 25% of the time the previous action will be used instead of the new action)
v4: repeat_action_probability of 0 (always follow issued action)

Breakout-v4 vs BreakoutDeterministic-v4 vs BreakoutNoFrameskip-v4
game-vX: frameskip is sampled from (2,5), meaning either 2, 3 or 4 frames are skipped [low: inclusive, high: exclusive]
game-Deterministic-vX: a fixed frame skip of 4
game-NoFrameskip-vX: with no frame skip

Breakout-v4 vs Breakout-ram-v4
game-ram-vX: Observation Space (128,). Observation is given as a vector of size 128.
game-vX: Observation Space (210,160,3). Observation is given as pixels of Width:210, Height:160, Depth:3

The atari environment source code has been removed from Gym [AFAIK] and you can see it on the ALE's GitHub. See source code here

Other useful references:
This video
ALE v0.7 Blog which introduces the v5 versions of the game [Sep 2021].

@alexxcollins
Copy link

This thread has helped me out a lot.

@rfali - I think you may have a typo and your post should read:

Breakout-v4 vs Breakout-ram-v4
game-ram-vX: Observation Space (128,). Observation is given as a vector of size 128.
game-vX: Observation Space (210,160,3). Observation is given as pixels of Width:210, Height:160, Depth:3

@basictask
Copy link

Dear rfali,
I have checked, what you have written in game-v4 / game-ram-v4 is swapped up as alexxcollins said

@rfali
Copy link

rfali commented May 1, 2022

@alexxcollins @kknydnai thanks for noticing that typo, I have updated my comment!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants