| Name | Observation Space | Action Space | Paper |
|---|---|---|---|
| SARSA | discrete or continuous | discrete | Sutton and Barto, 2011, Blog Post |
| DQN | discrete or continuous | discrete | [MKSG+13], [MKSR+15], [HGS15] |
| CEM | discrete or continuous | discrete | Szita et al., 2006, Schulman, 2016 |
| DDPG | discrete or continuous | continuous | [LHPH+15] |
| NAF | discrete or continuous | continuous | [GLSL16] |
Twitter-Style Explanations
- DQN
- Like Q-Learning, but represent the current q-function by a neural network as function approximator.
- SARSA
- Initialize the Q-Function $Q: \mathcal{X} \times \mathcal{A} \rightarrow \mathbb{R}$ randomly, adjust it by time. See [Pseudocode](https://martin-thoma.com/probabilistische-planung/#sarsa)
- DDPG
- ?
- NAF
- ?
Comparisons
The Code is on Github.
CartPole-v0
The CartPole-v0 environemnt has
2 actions: move the paddle to the right or to the left. A reward of +1 is given
is the pole is upright. The episode is finished when the pole is more than
15 degrees from vertical or moves more than 2.4 units from the center.
CartPole-v0 defines "solving" as getting average reward of 195.0 over 100 consecutive trials.
| Agent | NN Parameters | Configuration | Time | Test reward |
|---|---|---|---|---|
| CEM | 10 | steps=1000 | 9s | mean= 9.49, std= 0.79, min= 8.00, max=11.00 |
| CEM | 10 | (default, steps=10000) | 39s | mean=77.14, std=44.18, min=41.00, max=200.00 |
| CEM | 10 | steps=100000 | 284s | mean=106.21, std=19.99, min=71.00, max=185.00 |
| CEM | 658 | bigger NN | 60s | mean=42.61, std=36.36, min=10.00, max=200.00 |
| CEM | 658 | steps=10000, bigger NN | 60s | mean=200.00, std= 0.00, min=200.00, max=200.00 |
| DQN | (default) | 40s | mean=42.61, std=36.36, min=10.00, max=200.00 |
The bigger NN is
model = Sequential()
model.add(Flatten(input_shape=input_shape))
model.add(Dense(16))
model.add(Activation("relu"))
model.add(Dense(16))
model.add(Activation("relu"))
model.add(Dense(16))
model.add(Activation("relu"))
model.add(Dense(nb_actions))
model.add(Activation("softmax"))
Ok, so the bigger network is important. Also, 1000 training steps are not enough, but 10000 are. Let's see if we can reduce the episode memory. The episode memory is what is used for training.
| Agent | EpisodeParameterMemory | Time | Test reward |
|---|---|---|---|
| CEM | 1000 | 60s | mean=200.00, std= 0.00, min=200.00, max=200.00 |
| CEM | 500 | 65s | mean=200.00, std= 0.00, min=200.00, max=200.00 |
| CEM | 450 | 68s | mean=200.00, std= 0.00, min=200.00, max=200.00 |
| CEM | 400 | 59s | mean=200.00, std= 0.00, min=200.00, max=200.00 |
| CEM | 300 | 51s | mean=103.73, std=37.07, min=55.00, max=200.00 |
| CEM | 200 | 38s | mean=34.22, std= 6.75, min=17.00, max=52.00 |
| CEM | 100 | 56s | mean=82.77, std=25.05, min=32.00, max=172.00 |
CartPole-v1
| Agent | Config | Time | Test reward |
|---|---|---|---|
| CEM | (default) | 100s | mean=461.70, std=66.26, min=264.00, max=500.00 |
| DQN | (default) | 30s | mean=10.62, std= 4.31, min= 8.00, max=30.00 |
All environments
You can list all environments with
#!/usr/bin/env python
"""Print OpenAI Gym Environment data."""
import gym
from gym import envs
envids = [spec.id for spec in envs.registry.all()]
print('<table class="table">')
for i, envid in enumerate(sorted(envids), start=1):
try:
env = gym.make(envid)
observations = env.observation_space
actions = env.action_space
except:
observations = "Error"
actions = "Error"
print(
'<tr><td id="env-{i}">{i}</td>'
'<td><a href="https://gym.openai.com/envs/{envid}/" id="{envid}">'
"{envid}</a></td>"
"<td>{observations}</td>"
"<td>{actions}</td></tr>".format(
i=i, envid=envid, observations=str(observations), actions=str(actions)
)
)
print("</table>")
which gives
| # | Environment | Observation Space | Action Space | Reward Range |
|---|---|---|---|---|
| 1 | Acrobot-v1 | Box(6,) | Discrete(3) | (-inf, inf) |
| 2 | AirRaid-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 3 | AirRaid-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 4 | AirRaid-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 5 | AirRaid-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 6 | AirRaid-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 7 | AirRaid-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 8 | AirRaid-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 9 | AirRaid-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 10 | AirRaidDeterministic-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 11 | AirRaidDeterministic-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 12 | AirRaidNoFrameskip-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 13 | AirRaidNoFrameskip-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 14 | Alien-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 15 | Alien-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 16 | Alien-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 17 | Alien-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 18 | Alien-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 19 | Alien-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 20 | Alien-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 21 | Alien-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 22 | AlienDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 23 | AlienDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 24 | AlienNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 25 | AlienNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 26 | Amidar-ram-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
| 27 | Amidar-ram-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
| 28 | Amidar-ramDeterministic-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
| 29 | Amidar-ramDeterministic-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
| 30 | Amidar-ramNoFrameskip-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
| 31 | Amidar-ramNoFrameskip-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
| 32 | Amidar-v0 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
| 33 | Amidar-v4 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
| 34 | AmidarDeterministic-v0 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
| 35 | AmidarDeterministic-v4 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
| 36 | AmidarNoFrameskip-v0 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
| 37 | AmidarNoFrameskip-v4 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
| 38 | Ant-v1 | Error | Error | Error |
| 39 | Assault-ram-v0 | Box(128,) | Discrete(7) | (-inf, inf) |
| 40 | Assault-ram-v4 | Box(128,) | Discrete(7) | (-inf, inf) |
| 41 | Assault-ramDeterministic-v0 | Box(128,) | Discrete(7) | (-inf, inf) |
| 42 | Assault-ramDeterministic-v4 | Box(128,) | Discrete(7) | (-inf, inf) |
| 43 | Assault-ramNoFrameskip-v0 | Box(128,) | Discrete(7) | (-inf, inf) |
| 44 | Assault-ramNoFrameskip-v4 | Box(128,) | Discrete(7) | (-inf, inf) |
| 45 | Assault-v0 | Box(250, 160, 3) | Discrete(7) | (-inf, inf) |
| 46 | Assault-v4 | Box(250, 160, 3) | Discrete(7) | (-inf, inf) |
| 47 | AssaultDeterministic-v0 | Box(250, 160, 3) | Discrete(7) | (-inf, inf) |
| 48 | AssaultDeterministic-v4 | Box(250, 160, 3) | Discrete(7) | (-inf, inf) |
| 49 | AssaultNoFrameskip-v0 | Box(250, 160, 3) | Discrete(7) | (-inf, inf) |
| 50 | AssaultNoFrameskip-v4 | Box(250, 160, 3) | Discrete(7) | (-inf, inf) |
| 51 | Asterix-ram-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 52 | Asterix-ram-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 53 | Asterix-ramDeterministic-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 54 | Asterix-ramDeterministic-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 55 | Asterix-ramNoFrameskip-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 56 | Asterix-ramNoFrameskip-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 57 | Asterix-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 58 | Asterix-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 59 | AsterixDeterministic-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 60 | AsterixDeterministic-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 61 | AsterixNoFrameskip-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 62 | AsterixNoFrameskip-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 63 | Asteroids-ram-v0 | Box(128,) | Discrete(14) | (-inf, inf) |
| 64 | Asteroids-ram-v4 | Box(128,) | Discrete(14) | (-inf, inf) |
| 65 | Asteroids-ramDeterministic-v0 | Box(128,) | Discrete(14) | (-inf, inf) |
| 66 | Asteroids-ramDeterministic-v4 | Box(128,) | Discrete(14) | (-inf, inf) |
| 67 | Asteroids-ramNoFrameskip-v0 | Box(128,) | Discrete(14) | (-inf, inf) |
| 68 | Asteroids-ramNoFrameskip-v4 | Box(128,) | Discrete(14) | (-inf, inf) |
| 69 | Asteroids-v0 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
| 70 | Asteroids-v4 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
| 71 | AsteroidsDeterministic-v0 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
| 72 | AsteroidsDeterministic-v4 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
| 73 | AsteroidsNoFrameskip-v0 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
| 74 | AsteroidsNoFrameskip-v4 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
| 75 | Atlantis-ram-v0 | Box(128,) | Discrete(4) | (-inf, inf) |
| 76 | Atlantis-ram-v4 | Box(128,) | Discrete(4) | (-inf, inf) |
| 77 | Atlantis-ramDeterministic-v0 | Box(128,) | Discrete(4) | (-inf, inf) |
| 78 | Atlantis-ramDeterministic-v4 | Box(128,) | Discrete(4) | (-inf, inf) |
| 79 | Atlantis-ramNoFrameskip-v0 | Box(128,) | Discrete(4) | (-inf, inf) |
| 80 | Atlantis-ramNoFrameskip-v4 | Box(128,) | Discrete(4) | (-inf, inf) |
| 81 | Atlantis-v0 | Box(210, 160, 3) | Discrete(4) | (-inf, inf) |
| 82 | Atlantis-v4 | Box(210, 160, 3) | Discrete(4) | (-inf, inf) |
| 83 | AtlantisDeterministic-v0 | Box(210, 160, 3) | Discrete(4) | (-inf, inf) |
| 84 | AtlantisDeterministic-v4 | Box(210, 160, 3) | Discrete(4) | (-inf, inf) |
| 85 | AtlantisNoFrameskip-v0 | Box(210, 160, 3) | Discrete(4) | (-inf, inf) |
| 86 | AtlantisNoFrameskip-v4 | Box(210, 160, 3) | Discrete(4) | (-inf, inf) |
| 87 | BankHeist-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 88 | BankHeist-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 89 | BankHeist-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 90 | BankHeist-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 91 | BankHeist-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 92 | BankHeist-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 93 | BankHeist-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 94 | BankHeist-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 95 | BankHeistDeterministic-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 96 | BankHeistDeterministic-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 97 | BankHeistNoFrameskip-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 98 | BankHeistNoFrameskip-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 99 | BattleZone-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 100 | BattleZone-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 101 | BattleZone-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 102 | BattleZone-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 103 | BattleZone-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 104 | BattleZone-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 105 | BattleZone-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 106 | BattleZone-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 107 | BattleZoneDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 108 | BattleZoneDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 109 | BattleZoneNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 110 | BattleZoneNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 111 | BeamRider-ram-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 112 | BeamRider-ram-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 113 | BeamRider-ramDeterministic-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 114 | BeamRider-ramDeterministic-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 115 | BeamRider-ramNoFrameskip-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 116 | BeamRider-ramNoFrameskip-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 117 | BeamRider-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 118 | BeamRider-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 119 | BeamRiderDeterministic-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 120 | BeamRiderDeterministic-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 121 | BeamRiderNoFrameskip-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 122 | BeamRiderNoFrameskip-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 123 | Berzerk-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 124 | Berzerk-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 125 | Berzerk-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 126 | Berzerk-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 127 | Berzerk-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 128 | Berzerk-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 129 | Berzerk-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 130 | Berzerk-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 131 | BerzerkDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 132 | BerzerkDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 133 | BerzerkNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 134 | BerzerkNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 135 | BipedalWalker-v2 | Error | Error | Error |
| 136 | BipedalWalkerHardcore-v2 | Error | Error | Error |
| 137 | Blackjack-v0 | Tuple(Discrete(32), Discrete(11), Discrete(2)) | Discrete(2) | (-inf, inf) |
| 138 | Bowling-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 139 | Bowling-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 140 | Bowling-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 141 | Bowling-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 142 | Bowling-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 143 | Bowling-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 144 | Bowling-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 145 | Bowling-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 146 | BowlingDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 147 | BowlingDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 148 | BowlingNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 149 | BowlingNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 150 | Boxing-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 151 | Boxing-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 152 | Boxing-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 153 | Boxing-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 154 | Boxing-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 155 | Boxing-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 156 | Boxing-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 157 | Boxing-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 158 | BoxingDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 159 | BoxingDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 160 | BoxingNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 161 | BoxingNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 162 | Breakout-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 163 | Breakout-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 164 | Breakout-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 165 | Breakout-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 166 | Breakout-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 167 | Breakout-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 168 | Breakout-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 169 | Breakout-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 170 | BreakoutDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 171 | BreakoutDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 172 | BreakoutNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 173 | BreakoutNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 174 | CNNClassifierTraining-v0 | Error | Error | Error |
| 175 | CarRacing-v0 | Error | Error | Error |
| 176 | Carnival-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 177 | Carnival-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 178 | Carnival-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 179 | Carnival-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 180 | Carnival-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 181 | Carnival-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 182 | Carnival-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 183 | Carnival-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 184 | CarnivalDeterministic-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 185 | CarnivalDeterministic-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 186 | CarnivalNoFrameskip-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 187 | CarnivalNoFrameskip-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 188 | CartPole-v0 | Box(4,) | Discrete(2) | (-inf, inf) |
| 189 | CartPole-v1 | Box(4,) | Discrete(2) | (-inf, inf) |
| 190 | Centipede-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 191 | Centipede-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 192 | Centipede-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 193 | Centipede-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 194 | Centipede-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 195 | Centipede-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 196 | Centipede-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 197 | Centipede-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 198 | CentipedeDeterministic-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 199 | CentipedeDeterministic-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 200 | CentipedeNoFrameskip-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 201 | CentipedeNoFrameskip-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 202 | ChopperCommand-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 203 | ChopperCommand-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 204 | ChopperCommand-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 205 | ChopperCommand-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 206 | ChopperCommand-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 207 | ChopperCommand-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 208 | ChopperCommand-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 209 | ChopperCommand-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 210 | ChopperCommandDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 211 | ChopperCommandDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 212 | ChopperCommandNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 213 | ChopperCommandNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 214 | CliffWalking-v0 | Discrete(48) | Discrete(4) | (-inf, inf) |
| 215 | ConvergenceControl-v0 | Error | Error | Error |
| 216 | Copy-v0 | Discrete(6) | Tuple(Discrete(2), Discrete(2), Discrete(5)) | (-inf, inf) |
| 217 | CrazyClimber-ram-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 218 | CrazyClimber-ram-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 219 | CrazyClimber-ramDeterministic-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 220 | CrazyClimber-ramDeterministic-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 221 | CrazyClimber-ramNoFrameskip-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 222 | CrazyClimber-ramNoFrameskip-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 223 | CrazyClimber-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 224 | CrazyClimber-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 225 | CrazyClimberDeterministic-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 226 | CrazyClimberDeterministic-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 227 | CrazyClimberNoFrameskip-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 228 | CrazyClimberNoFrameskip-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 229 | DemonAttack-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 230 | DemonAttack-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 231 | DemonAttack-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 232 | DemonAttack-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 233 | DemonAttack-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 234 | DemonAttack-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 235 | DemonAttack-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 236 | DemonAttack-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 237 | DemonAttackDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 238 | DemonAttackDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 239 | DemonAttackNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 240 | DemonAttackNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 241 | DoubleDunk-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 242 | DoubleDunk-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 243 | DoubleDunk-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 244 | DoubleDunk-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 245 | DoubleDunk-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 246 | DoubleDunk-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 247 | DoubleDunk-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 248 | DoubleDunk-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 249 | DoubleDunkDeterministic-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 250 | DoubleDunkDeterministic-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 251 | DoubleDunkNoFrameskip-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 252 | DoubleDunkNoFrameskip-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 253 | DuplicatedInput-v0 | Discrete(6) | Tuple(Discrete(2), Discrete(2), Discrete(5)) | (-inf, inf) |
| 254 | ElevatorAction-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 255 | ElevatorAction-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 256 | ElevatorAction-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 257 | ElevatorAction-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 258 | ElevatorAction-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 259 | ElevatorAction-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 260 | ElevatorAction-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 261 | ElevatorAction-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 262 | ElevatorActionDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 263 | ElevatorActionDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 264 | ElevatorActionNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 265 | ElevatorActionNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 266 | Enduro-ram-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 267 | Enduro-ram-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 268 | Enduro-ramDeterministic-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 269 | Enduro-ramDeterministic-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 270 | Enduro-ramNoFrameskip-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 271 | Enduro-ramNoFrameskip-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 272 | Enduro-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 273 | Enduro-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 274 | EnduroDeterministic-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 275 | EnduroDeterministic-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 276 | EnduroNoFrameskip-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 277 | EnduroNoFrameskip-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 278 | FishingDerby-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 279 | FishingDerby-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 280 | FishingDerby-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 281 | FishingDerby-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 282 | FishingDerby-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 283 | FishingDerby-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 284 | FishingDerby-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 285 | FishingDerby-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 286 | FishingDerbyDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 287 | FishingDerbyDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 288 | FishingDerbyNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 289 | FishingDerbyNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 290 | Freeway-ram-v0 | Box(128,) | Discrete(3) | (-inf, inf) |
| 291 | Freeway-ram-v4 | Box(128,) | Discrete(3) | (-inf, inf) |
| 292 | Freeway-ramDeterministic-v0 | Box(128,) | Discrete(3) | (-inf, inf) |
| 293 | Freeway-ramDeterministic-v4 | Box(128,) | Discrete(3) | (-inf, inf) |
| 294 | Freeway-ramNoFrameskip-v0 | Box(128,) | Discrete(3) | (-inf, inf) |
| 295 | Freeway-ramNoFrameskip-v4 | Box(128,) | Discrete(3) | (-inf, inf) |
| 296 | Freeway-v0 | Box(210, 160, 3) | Discrete(3) | (-inf, inf) |
| 297 | Freeway-v4 | Box(210, 160, 3) | Discrete(3) | (-inf, inf) |
| 298 | FreewayDeterministic-v0 | Box(210, 160, 3) | Discrete(3) | (-inf, inf) |
| 299 | FreewayDeterministic-v4 | Box(210, 160, 3) | Discrete(3) | (-inf, inf) |
| 300 | FreewayNoFrameskip-v0 | Box(210, 160, 3) | Discrete(3) | (-inf, inf) |
| 301 | FreewayNoFrameskip-v4 | Box(210, 160, 3) | Discrete(3) | (-inf, inf) |
| 302 | Frostbite-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 303 | Frostbite-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 304 | Frostbite-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 305 | Frostbite-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 306 | Frostbite-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 307 | Frostbite-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 308 | Frostbite-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 309 | Frostbite-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 310 | FrostbiteDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 311 | FrostbiteDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 312 | FrostbiteNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 313 | FrostbiteNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 314 | FrozenLake-v0 | Discrete(16) | Discrete(4) | (-inf, inf) |
| 315 | FrozenLake8x8-v0 | Discrete(64) | Discrete(4) | (-inf, inf) |
| 316 | Go19x19-v0 | Error | Error | Error |
| 317 | Go9x9-v0 | Error | Error | Error |
| 318 | Gopher-ram-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
| 319 | Gopher-ram-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
| 320 | Gopher-ramDeterministic-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
| 321 | Gopher-ramDeterministic-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
| 322 | Gopher-ramNoFrameskip-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
| 323 | Gopher-ramNoFrameskip-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
| 324 | Gopher-v0 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
| 325 | Gopher-v4 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
| 326 | GopherDeterministic-v0 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
| 327 | GopherDeterministic-v4 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
| 328 | GopherNoFrameskip-v0 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
| 329 | GopherNoFrameskip-v4 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
| 330 | Gravitar-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 331 | Gravitar-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 332 | Gravitar-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 333 | Gravitar-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 334 | Gravitar-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 335 | Gravitar-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 336 | Gravitar-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 337 | Gravitar-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 338 | GravitarDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 339 | GravitarDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 340 | GravitarNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 341 | GravitarNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 342 | GuessingGame-v0 | Discrete(4) | Box(1,) | (-inf, inf) |
| 343 | HalfCheetah-v1 | Error | Error | Error |
| 344 | Hero-ram-v0 | Error | Error | Error |
| 345 | Hero-ram-v4 | Error | Error | Error |
| 346 | Hero-ramDeterministic-v0 | Error | Error | Error |
| 347 | Hero-ramDeterministic-v4 | Error | Error | Error |
| 348 | Hero-ramNoFrameskip-v0 | Error | Error | Error |
| 349 | Hero-ramNoFrameskip-v4 | Error | Error | Error |
| 350 | Hero-v0 | Error | Error | Error |
| 351 | Hero-v4 | Error | Error | Error |
| 352 | HeroDeterministic-v0 | Error | Error | Error |
| 353 | HeroDeterministic-v4 | Error | Error | Error |
| 354 | HeroNoFrameskip-v0 | Error | Error | Error |
| 355 | HeroNoFrameskip-v4 | Error | Error | Error |
| 356 | Hex9x9-v0 | Error | Error | Error |
| 357 | Hopper-v1 | Error | Error | Error |
| 358 | HotterColder-v0 | Discrete(4) | Box(1,) | (-inf, inf) |
| 359 | Humanoid-v1 | Error | Error | Error |
| 360 | HumanoidStandup-v1 | Error | Error | Error |
| 361 | IceHockey-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 362 | IceHockey-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 363 | IceHockey-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 364 | IceHockey-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 365 | IceHockey-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 366 | IceHockey-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 367 | IceHockey-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 368 | IceHockey-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 369 | IceHockeyDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 370 | IceHockeyDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 371 | IceHockeyNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 372 | IceHockeyNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 373 | InvertedDoublePendulum-v1 | Error | Error | Error |
| 374 | InvertedPendulum-v1 | Error | Error | Error |
| 375 | Jamesbond-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 376 | Jamesbond-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 377 | Jamesbond-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 378 | Jamesbond-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 379 | Jamesbond-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 380 | Jamesbond-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 381 | Jamesbond-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 382 | Jamesbond-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 383 | JamesbondDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 384 | JamesbondDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 385 | JamesbondNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 386 | JamesbondNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 387 | JourneyEscape-ram-v0 | Box(128,) | Discrete(16) | (-inf, inf) |
| 388 | JourneyEscape-ram-v4 | Box(128,) | Discrete(16) | (-inf, inf) |
| 389 | JourneyEscape-ramDeterministic-v0 | Box(128,) | Discrete(16) | (-inf, inf) |
| 390 | JourneyEscape-ramDeterministic-v4 | Box(128,) | Discrete(16) | (-inf, inf) |
| 391 | JourneyEscape-ramNoFrameskip-v0 | Box(128,) | Discrete(16) | (-inf, inf) |
| 392 | JourneyEscape-ramNoFrameskip-v4 | Box(128,) | Discrete(16) | (-inf, inf) |
| 393 | JourneyEscape-v0 | Box(230, 160, 3) | Discrete(16) | (-inf, inf) |
| 394 | JourneyEscape-v4 | Box(230, 160, 3) | Discrete(16) | (-inf, inf) |
| 395 | JourneyEscapeDeterministic-v0 | Box(230, 160, 3) | Discrete(16) | (-inf, inf) |
| 396 | JourneyEscapeDeterministic-v4 | Box(230, 160, 3) | Discrete(16) | (-inf, inf) |
| 397 | JourneyEscapeNoFrameskip-v0 | Box(230, 160, 3) | Discrete(16) | (-inf, inf) |
| 398 | JourneyEscapeNoFrameskip-v4 | Box(230, 160, 3) | Discrete(16) | (-inf, inf) |
| 399 | Kangaroo-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 400 | Kangaroo-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 401 | Kangaroo-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 402 | Kangaroo-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 403 | Kangaroo-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 404 | Kangaroo-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 405 | Kangaroo-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 406 | Kangaroo-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 407 | KangarooDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 408 | KangarooDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 409 | KangarooNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 410 | KangarooNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 411 | KellyCoinflip-v0 | Tuple(Box(1,), Discrete(301)) | Discrete(25000) | (0, 250.0) |
| 412 | KellyCoinflipGeneralized-v0 | Tuple(Box(1,), Discrete(280), Discrete(280), Discrete(280), Box(1,)) | Discrete(20300) | (0, 203.0) |
| 413 | Krull-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 414 | Krull-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 415 | Krull-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 416 | Krull-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 417 | Krull-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 418 | Krull-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 419 | Krull-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 420 | Krull-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 421 | KrullDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 422 | KrullDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 423 | KrullNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 424 | KrullNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 425 | KungFuMaster-ram-v0 | Box(128,) | Discrete(14) | (-inf, inf) |
| 426 | KungFuMaster-ram-v4 | Box(128,) | Discrete(14) | (-inf, inf) |
| 427 | KungFuMaster-ramDeterministic-v0 | Box(128,) | Discrete(14) | (-inf, inf) |
| 428 | KungFuMaster-ramDeterministic-v4 | Box(128,) | Discrete(14) | (-inf, inf) |
| 429 | KungFuMaster-ramNoFrameskip-v0 | Box(128,) | Discrete(14) | (-inf, inf) |
| 430 | KungFuMaster-ramNoFrameskip-v4 | Box(128,) | Discrete(14) | (-inf, inf) |
| 431 | KungFuMaster-v0 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
| 432 | KungFuMaster-v4 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
| 433 | KungFuMasterDeterministic-v0 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
| 434 | KungFuMasterDeterministic-v4 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
| 435 | KungFuMasterNoFrameskip-v0 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
| 436 | KungFuMasterNoFrameskip-v4 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
| 437 | LunarLander-v2 | Error | Error | Error |
| 438 | LunarLanderContinuous-v2 | Error | Error | Error |
| 439 | MontezumaRevenge-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 440 | MontezumaRevenge-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 441 | MontezumaRevenge-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 442 | MontezumaRevenge-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 443 | MontezumaRevenge-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 444 | MontezumaRevenge-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 445 | MontezumaRevenge-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 446 | MontezumaRevenge-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 447 | MontezumaRevengeDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 448 | MontezumaRevengeDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 449 | MontezumaRevengeNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 450 | MontezumaRevengeNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 451 | MountainCar-v0 | Box(2,) | Discrete(3) | (-inf, inf) |
| 452 | MountainCarContinuous-v0 | Box(2,) | Box(1,) | (-inf, inf) |
| 453 | MsPacman-ram-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 454 | MsPacman-ram-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 455 | MsPacman-ramDeterministic-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 456 | MsPacman-ramDeterministic-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 457 | MsPacman-ramNoFrameskip-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 458 | MsPacman-ramNoFrameskip-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 459 | MsPacman-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 460 | MsPacman-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 461 | MsPacmanDeterministic-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 462 | MsPacmanDeterministic-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 463 | MsPacmanNoFrameskip-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 464 | MsPacmanNoFrameskip-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
| 465 | NChain-v0 | Discrete(5) | Discrete(2) | (-inf, inf) |
| 466 | NameThisGame-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 467 | NameThisGame-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 468 | NameThisGame-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 469 | NameThisGame-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 470 | NameThisGame-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 471 | NameThisGame-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 472 | NameThisGame-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 473 | NameThisGame-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 474 | NameThisGameDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 475 | NameThisGameDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 476 | NameThisGameNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 477 | NameThisGameNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 478 | OffSwitchCartpole-v0 | Tuple(Discrete(2), Box(4,)) | Discrete(2) | (-inf, inf) |
| 479 | OffSwitchCartpoleProb-v0 | Tuple(Discrete(2), Box(4,)) | Discrete(2) | (-inf, inf) |
| 480 | OneRoundDeterministicReward-v0 | Discrete(1) | Discrete(2) | (-inf, inf) |
| 481 | OneRoundNondeterministicReward-v0 | Discrete(1) | Discrete(2) | (-inf, inf) |
| 482 | Pendulum-v0 | Box(3,) | Box(1,) | (-inf, inf) |
| 483 | Phoenix-ram-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
| 484 | Phoenix-ram-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
| 485 | Phoenix-ramDeterministic-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
| 486 | Phoenix-ramDeterministic-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
| 487 | Phoenix-ramNoFrameskip-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
| 488 | Phoenix-ramNoFrameskip-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
| 489 | Phoenix-v0 | Box(210, 160, 3) | Discrete(8) | (-inf, inf) |
| 490 | Phoenix-v4 | Box(210, 160, 3) | Discrete(8) | (-inf, inf) |
| 491 | PhoenixDeterministic-v0 | Box(210, 160, 3) | Discrete(8) | (-inf, inf) |
| 492 | PhoenixDeterministic-v4 | Box(210, 160, 3) | Discrete(8) | (-inf, inf) |
| 493 | PhoenixNoFrameskip-v0 | Box(210, 160, 3) | Discrete(8) | (-inf, inf) |
| 494 | PhoenixNoFrameskip-v4 | Box(210, 160, 3) | Discrete(8) | (-inf, inf) |
| 495 | Pitfall-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 496 | Pitfall-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 497 | Pitfall-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 498 | Pitfall-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 499 | Pitfall-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 500 | Pitfall-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 501 | Pitfall-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 502 | Pitfall-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 503 | PitfallDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 504 | PitfallDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 505 | PitfallNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 506 | PitfallNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 507 | Pong-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 508 | Pong-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 509 | Pong-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 510 | Pong-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 511 | Pong-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 512 | Pong-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 513 | Pong-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 514 | Pong-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 515 | PongDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 516 | PongDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 517 | PongNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 518 | PongNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 519 | Pooyan-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 520 | Pooyan-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 521 | Pooyan-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 522 | Pooyan-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 523 | Pooyan-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 524 | Pooyan-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 525 | Pooyan-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 526 | Pooyan-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 527 | PooyanDeterministic-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 528 | PooyanDeterministic-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 529 | PooyanNoFrameskip-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 530 | PooyanNoFrameskip-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
| 531 | PredictActionsCartpole-v0 | Box(4,) | Tuple(Discrete(2), Discrete(2), Discrete(2), Discrete(2), Discrete(2), Discrete(2)) | (-inf, inf) |
| 532 | PredictObsCartpole-v0 | Box(4,) | Tuple(Discrete(2), Box(4,), Box(4,), Box(4,), Box(4,), Box(4,)) | (-inf, inf) |
| 533 | PrivateEye-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 534 | PrivateEye-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 535 | PrivateEye-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 536 | PrivateEye-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 537 | PrivateEye-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 538 | PrivateEye-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 539 | PrivateEye-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 540 | PrivateEye-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 541 | PrivateEyeDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 542 | PrivateEyeDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 543 | PrivateEyeNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 544 | PrivateEyeNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 545 | Pusher-v0 | Error | Error | Error |
| 546 | Qbert-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 547 | Qbert-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 548 | Qbert-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 549 | Qbert-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 550 | Qbert-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 551 | Qbert-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 552 | Qbert-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 553 | Qbert-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 554 | QbertDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 555 | QbertDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 556 | QbertNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 557 | QbertNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 558 | Reacher-v1 | Error | Error | Error |
| 559 | RepeatCopy-v0 | Discrete(6) | Tuple(Discrete(2), Discrete(2), Discrete(5)) | (-inf, inf) |
| 560 | Reverse-v0 | Discrete(3) | Tuple(Discrete(2), Discrete(2), Discrete(2)) | (-inf, inf) |
| 561 | ReversedAddition-v0 | Discrete(4) | Tuple(Discrete(4), Discrete(2), Discrete(3)) | (-inf, inf) |
| 562 | ReversedAddition3-v0 | Discrete(4) | Tuple(Discrete(4), Discrete(2), Discrete(3)) | (-inf, inf) |
| 563 | Riverraid-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 564 | Riverraid-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 565 | Riverraid-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 566 | Riverraid-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 567 | Riverraid-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 568 | Riverraid-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 569 | Riverraid-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 570 | Riverraid-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 571 | RiverraidDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 572 | RiverraidDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 573 | RiverraidNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 574 | RiverraidNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 575 | RoadRunner-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 576 | RoadRunner-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 577 | RoadRunner-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 578 | RoadRunner-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 579 | RoadRunner-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 580 | RoadRunner-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 581 | RoadRunner-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 582 | RoadRunner-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 583 | RoadRunnerDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 584 | RoadRunnerDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 585 | RoadRunnerNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 586 | RoadRunnerNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 587 | Robotank-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 588 | Robotank-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 589 | Robotank-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 590 | Robotank-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 591 | Robotank-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 592 | Robotank-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 593 | Robotank-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 594 | Robotank-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 595 | RobotankDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 596 | RobotankDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 597 | RobotankNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 598 | RobotankNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 599 | Roulette-v0 | Discrete(1) | Discrete(38) | (-inf, inf) |
| 600 | Seaquest-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 601 | Seaquest-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 602 | Seaquest-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 603 | Seaquest-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 604 | Seaquest-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 605 | Seaquest-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 606 | Seaquest-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 607 | Seaquest-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 608 | SeaquestDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 609 | SeaquestDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 610 | SeaquestNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 611 | SeaquestNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 612 | SemisuperPendulumDecay-v0 | Box(3,) | Box(1,) | (-inf, inf) |
| 613 | SemisuperPendulumNoise-v0 | Box(3,) | Box(1,) | (-inf, inf) |
| 614 | SemisuperPendulumRandom-v0 | Box(3,) | Box(1,) | (-inf, inf) |
| 615 | Skiing-ram-v0 | Box(128,) | Discrete(3) | (-inf, inf) |
| 616 | Skiing-ram-v4 | Box(128,) | Discrete(3) | (-inf, inf) |
| 617 | Skiing-ramDeterministic-v0 | Box(128,) | Discrete(3) | (-inf, inf) |
| 618 | Skiing-ramDeterministic-v4 | Box(128,) | Discrete(3) | (-inf, inf) |
| 619 | Skiing-ramNoFrameskip-v0 | Box(128,) | Discrete(3) | (-inf, inf) |
| 620 | Skiing-ramNoFrameskip-v4 | Box(128,) | Discrete(3) | (-inf, inf) |
| 621 | Skiing-v0 | Box(250, 160, 3) | Discrete(3) | (-inf, inf) |
| 622 | Skiing-v4 | Box(250, 160, 3) | Discrete(3) | (-inf, inf) |
| 623 | SkiingDeterministic-v0 | Box(250, 160, 3) | Discrete(3) | (-inf, inf) |
| 624 | SkiingDeterministic-v4 | Box(250, 160, 3) | Discrete(3) | (-inf, inf) |
| 625 | SkiingNoFrameskip-v0 | Box(250, 160, 3) | Discrete(3) | (-inf, inf) |
| 626 | SkiingNoFrameskip-v4 | Box(250, 160, 3) | Discrete(3) | (-inf, inf) |
| 627 | Solaris-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 628 | Solaris-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 629 | Solaris-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 630 | Solaris-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 631 | Solaris-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 632 | Solaris-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 633 | Solaris-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 634 | Solaris-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 635 | SolarisDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 636 | SolarisDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 637 | SolarisNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 638 | SolarisNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 639 | SpaceInvaders-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 640 | SpaceInvaders-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 641 | SpaceInvaders-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 642 | SpaceInvaders-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 643 | SpaceInvaders-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 644 | SpaceInvaders-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 645 | SpaceInvaders-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 646 | SpaceInvaders-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 647 | SpaceInvadersDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 648 | SpaceInvadersDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 649 | SpaceInvadersNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 650 | SpaceInvadersNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 651 | StarGunner-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 652 | StarGunner-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 653 | StarGunner-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 654 | StarGunner-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 655 | StarGunner-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 656 | StarGunner-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 657 | StarGunner-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 658 | StarGunner-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 659 | StarGunnerDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 660 | StarGunnerDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 661 | StarGunnerNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 662 | StarGunnerNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 663 | Striker-v0 | Error | Error | Error |
| 664 | Swimmer-v1 | Error | Error | Error |
| 665 | Taxi-v2 | Discrete(500) | Discrete(6) | (-inf, inf) |
| 666 | Tennis-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 667 | Tennis-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 668 | Tennis-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 669 | Tennis-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 670 | Tennis-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 671 | Tennis-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 672 | Tennis-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 673 | Tennis-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 674 | TennisDeterministic-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 675 | TennisDeterministic-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 676 | TennisNoFrameskip-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 677 | TennisNoFrameskip-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
| 678 | Thrower-v0 | Error | Error | Error |
| 679 | TimePilot-ram-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
| 680 | TimePilot-ram-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
| 681 | TimePilot-ramDeterministic-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
| 682 | TimePilot-ramDeterministic-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
| 683 | TimePilot-ramNoFrameskip-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
| 684 | TimePilot-ramNoFrameskip-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
| 685 | TimePilot-v0 | Box(210, 160, 3) | Discrete(10) | (-inf, inf) |
| 686 | TimePilot-v4 | Box(210, 160, 3) | Discrete(10) | (-inf, inf) |
| 687 | TimePilotDeterministic-v0 | Box(210, 160, 3) | Discrete(10) | (-inf, inf) |
| 688 | TimePilotDeterministic-v4 | Box(210, 160, 3) | Discrete(10) | (-inf, inf) |
| 689 | TimePilotNoFrameskip-v0 | Box(210, 160, 3) | Discrete(10) | (-inf, inf) |
| 690 | TimePilotNoFrameskip-v4 | Box(210, 160, 3) | Discrete(10) | (-inf, inf) |
| 691 | Tutankham-ram-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
| 692 | Tutankham-ram-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
| 693 | Tutankham-ramDeterministic-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
| 694 | Tutankham-ramDeterministic-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
| 695 | Tutankham-ramNoFrameskip-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
| 696 | Tutankham-ramNoFrameskip-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
| 697 | Tutankham-v0 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
| 698 | Tutankham-v4 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
| 699 | TutankhamDeterministic-v0 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
| 700 | TutankhamDeterministic-v4 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
| 701 | TutankhamNoFrameskip-v0 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
| 702 | TutankhamNoFrameskip-v4 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
| 703 | TwoRoundDeterministicReward-v0 | Discrete(3) | Discrete(2) | (-inf, inf) |
| 704 | TwoRoundNondeterministicReward-v0 | Discrete(3) | Discrete(2) | (-inf, inf) |
| 705 | UpNDown-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 706 | UpNDown-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 707 | UpNDown-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 708 | UpNDown-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 709 | UpNDown-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
| 710 | UpNDown-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
| 711 | UpNDown-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 712 | UpNDown-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 713 | UpNDownDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 714 | UpNDownDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 715 | UpNDownNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 716 | UpNDownNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
| 717 | Venture-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 718 | Venture-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 719 | Venture-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 720 | Venture-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 721 | Venture-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 722 | Venture-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 723 | Venture-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 724 | Venture-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 725 | VentureDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 726 | VentureDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 727 | VentureNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 728 | VentureNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 729 | VideoPinball-ram-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 730 | VideoPinball-ram-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 731 | VideoPinball-ramDeterministic-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 732 | VideoPinball-ramDeterministic-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 733 | VideoPinball-ramNoFrameskip-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
| 734 | VideoPinball-ramNoFrameskip-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
| 735 | VideoPinball-v0 | Box(250, 160, 3) | Discrete(9) | (-inf, inf) |
| 736 | VideoPinball-v4 | Box(250, 160, 3) | Discrete(9) | (-inf, inf) |
| 737 | VideoPinballDeterministic-v0 | Box(250, 160, 3) | Discrete(9) | (-inf, inf) |
| 738 | VideoPinballDeterministic-v4 | Box(250, 160, 3) | Discrete(9) | (-inf, inf) |
| 739 | VideoPinballNoFrameskip-v0 | Box(250, 160, 3) | Discrete(9) | (-inf, inf) |
| 740 | VideoPinballNoFrameskip-v4 | Box(250, 160, 3) | Discrete(9) | (-inf, inf) |
| 741 | Walker2d-v1 | Error | Error | Error |
| 742 | WizardOfWor-ram-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
| 743 | WizardOfWor-ram-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
| 744 | WizardOfWor-ramDeterministic-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
| 745 | WizardOfWor-ramDeterministic-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
| 746 | WizardOfWor-ramNoFrameskip-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
| 747 | WizardOfWor-ramNoFrameskip-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
| 748 | WizardOfWor-v0 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
| 749 | WizardOfWor-v4 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
| 750 | WizardOfWorDeterministic-v0 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
| 751 | WizardOfWorDeterministic-v4 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
| 752 | WizardOfWorNoFrameskip-v0 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
| 753 | WizardOfWorNoFrameskip-v4 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
| 754 | YarsRevenge-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 755 | YarsRevenge-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 756 | YarsRevenge-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 757 | YarsRevenge-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 758 | YarsRevenge-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 759 | YarsRevenge-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 760 | YarsRevenge-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 761 | YarsRevenge-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 762 | YarsRevengeDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 763 | YarsRevengeDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 764 | YarsRevengeNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 765 | YarsRevengeNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 766 | Zaxxon-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 767 | Zaxxon-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 768 | Zaxxon-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 769 | Zaxxon-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 770 | Zaxxon-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
| 771 | Zaxxon-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
| 772 | Zaxxon-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 773 | Zaxxon-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 774 | ZaxxonDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 775 | ZaxxonDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 776 | ZaxxonNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
| 777 | ZaxxonNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
To summarize it:
- Most environment have an continous observation space and a discrete action space
- The action space often has about 20 possibilities. The maximum is
KellyCoinflip-v0with 25000 actions. - The observation space has at least one input (
OneRoundNondeterministicReward-v0) and never more than 500 (Taxi-v2) - The observation space shape of
(210, 160, 3)is so common (264 times!) because it is the screen of an Atari game.