Name | Observation Space | Action Space | Paper |
---|---|---|---|
SARSA | discrete or continuous | discrete | Sutton and Barto, 2011, Blog Post |
DQN | discrete or continuous | discrete | [MKSG+13], [MKSR+15], [HGS15] |
CEM | discrete or continuous | discrete | Szita et al., 2006, Schulman, 2016 |
DDPG | discrete or continuous | continuous | [LHPH+15] |
NAF | discrete or continuous | continuous | [GLSL16] |
Twitter-Style Explanations
- DQN
- Like Q-Learning, but represent the current q-function by a neural network as function approximator.
- SARSA
- Initialize the Q-Function $Q: \mathcal{X} \times \mathcal{A} \rightarrow \mathbb{R}$ randomly, adjust it by time. See [Pseudocode](https://martin-thoma.com/probabilistische-planung/#sarsa)
- DDPG
- ?
- NAF
- ?
Comparisons
The Code is on Github.
CartPole-v0
The CartPole-v0
environemnt has
2 actions: move the paddle to the right or to the left. A reward of +1 is given
is the pole is upright. The episode is finished when the pole is more than
15 degrees from vertical or moves more than 2.4 units from the center.
CartPole-v0 defines "solving" as getting average reward of 195.0 over 100 consecutive trials.
Agent | NN Parameters | Configuration | Time | Test reward |
---|---|---|---|---|
CEM | 10 | steps=1000 | 9s | mean= 9.49, std= 0.79, min= 8.00, max=11.00 |
CEM | 10 | (default, steps=10000) | 39s | mean=77.14, std=44.18, min=41.00, max=200.00 |
CEM | 10 | steps=100000 | 284s | mean=106.21, std=19.99, min=71.00, max=185.00 |
CEM | 658 | bigger NN | 60s | mean=42.61, std=36.36, min=10.00, max=200.00 |
CEM | 658 | steps=10000, bigger NN | 60s | mean=200.00, std= 0.00, min=200.00, max=200.00 |
DQN | (default) | 40s | mean=42.61, std=36.36, min=10.00, max=200.00 |
The bigger NN is
model = Sequential()
model.add(Flatten(input_shape=input_shape))
model.add(Dense(16))
model.add(Activation("relu"))
model.add(Dense(16))
model.add(Activation("relu"))
model.add(Dense(16))
model.add(Activation("relu"))
model.add(Dense(nb_actions))
model.add(Activation("softmax"))
Ok, so the bigger network is important. Also, 1000 training steps are not enough, but 10000 are. Let's see if we can reduce the episode memory. The episode memory is what is used for training.
Agent | EpisodeParameterMemory | Time | Test reward |
---|---|---|---|
CEM | 1000 | 60s | mean=200.00, std= 0.00, min=200.00, max=200.00 |
CEM | 500 | 65s | mean=200.00, std= 0.00, min=200.00, max=200.00 |
CEM | 450 | 68s | mean=200.00, std= 0.00, min=200.00, max=200.00 |
CEM | 400 | 59s | mean=200.00, std= 0.00, min=200.00, max=200.00 |
CEM | 300 | 51s | mean=103.73, std=37.07, min=55.00, max=200.00 |
CEM | 200 | 38s | mean=34.22, std= 6.75, min=17.00, max=52.00 |
CEM | 100 | 56s | mean=82.77, std=25.05, min=32.00, max=172.00 |
CartPole-v1
Agent | Config | Time | Test reward |
---|---|---|---|
CEM | (default) | 100s | mean=461.70, std=66.26, min=264.00, max=500.00 |
DQN | (default) | 30s | mean=10.62, std= 4.31, min= 8.00, max=30.00 |
All environments
You can list all environments with
#!/usr/bin/env python
"""Print OpenAI Gym Environment data."""
import gym
from gym import envs
envids = [spec.id for spec in envs.registry.all()]
print('<table class="table">')
for i, envid in enumerate(sorted(envids), start=1):
try:
env = gym.make(envid)
observations = env.observation_space
actions = env.action_space
except:
observations = "Error"
actions = "Error"
print(
'<tr><td id="env-{i}">{i}</td>'
'<td><a href="https://gym.openai.com/envs/{envid}/" id="{envid}">'
"{envid}</a></td>"
"<td>{observations}</td>"
"<td>{actions}</td></tr>".format(
i=i, envid=envid, observations=str(observations), actions=str(actions)
)
)
print("</table>")
which gives
# | Environment | Observation Space | Action Space | Reward Range |
---|---|---|---|---|
1 | Acrobot-v1 | Box(6,) | Discrete(3) | (-inf, inf) |
2 | AirRaid-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
3 | AirRaid-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
4 | AirRaid-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
5 | AirRaid-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
6 | AirRaid-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
7 | AirRaid-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
8 | AirRaid-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
9 | AirRaid-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
10 | AirRaidDeterministic-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
11 | AirRaidDeterministic-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
12 | AirRaidNoFrameskip-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
13 | AirRaidNoFrameskip-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
14 | Alien-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
15 | Alien-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
16 | Alien-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
17 | Alien-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
18 | Alien-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
19 | Alien-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
20 | Alien-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
21 | Alien-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
22 | AlienDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
23 | AlienDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
24 | AlienNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
25 | AlienNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
26 | Amidar-ram-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
27 | Amidar-ram-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
28 | Amidar-ramDeterministic-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
29 | Amidar-ramDeterministic-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
30 | Amidar-ramNoFrameskip-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
31 | Amidar-ramNoFrameskip-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
32 | Amidar-v0 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
33 | Amidar-v4 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
34 | AmidarDeterministic-v0 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
35 | AmidarDeterministic-v4 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
36 | AmidarNoFrameskip-v0 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
37 | AmidarNoFrameskip-v4 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
38 | Ant-v1 | Error | Error | Error |
39 | Assault-ram-v0 | Box(128,) | Discrete(7) | (-inf, inf) |
40 | Assault-ram-v4 | Box(128,) | Discrete(7) | (-inf, inf) |
41 | Assault-ramDeterministic-v0 | Box(128,) | Discrete(7) | (-inf, inf) |
42 | Assault-ramDeterministic-v4 | Box(128,) | Discrete(7) | (-inf, inf) |
43 | Assault-ramNoFrameskip-v0 | Box(128,) | Discrete(7) | (-inf, inf) |
44 | Assault-ramNoFrameskip-v4 | Box(128,) | Discrete(7) | (-inf, inf) |
45 | Assault-v0 | Box(250, 160, 3) | Discrete(7) | (-inf, inf) |
46 | Assault-v4 | Box(250, 160, 3) | Discrete(7) | (-inf, inf) |
47 | AssaultDeterministic-v0 | Box(250, 160, 3) | Discrete(7) | (-inf, inf) |
48 | AssaultDeterministic-v4 | Box(250, 160, 3) | Discrete(7) | (-inf, inf) |
49 | AssaultNoFrameskip-v0 | Box(250, 160, 3) | Discrete(7) | (-inf, inf) |
50 | AssaultNoFrameskip-v4 | Box(250, 160, 3) | Discrete(7) | (-inf, inf) |
51 | Asterix-ram-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
52 | Asterix-ram-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
53 | Asterix-ramDeterministic-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
54 | Asterix-ramDeterministic-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
55 | Asterix-ramNoFrameskip-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
56 | Asterix-ramNoFrameskip-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
57 | Asterix-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
58 | Asterix-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
59 | AsterixDeterministic-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
60 | AsterixDeterministic-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
61 | AsterixNoFrameskip-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
62 | AsterixNoFrameskip-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
63 | Asteroids-ram-v0 | Box(128,) | Discrete(14) | (-inf, inf) |
64 | Asteroids-ram-v4 | Box(128,) | Discrete(14) | (-inf, inf) |
65 | Asteroids-ramDeterministic-v0 | Box(128,) | Discrete(14) | (-inf, inf) |
66 | Asteroids-ramDeterministic-v4 | Box(128,) | Discrete(14) | (-inf, inf) |
67 | Asteroids-ramNoFrameskip-v0 | Box(128,) | Discrete(14) | (-inf, inf) |
68 | Asteroids-ramNoFrameskip-v4 | Box(128,) | Discrete(14) | (-inf, inf) |
69 | Asteroids-v0 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
70 | Asteroids-v4 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
71 | AsteroidsDeterministic-v0 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
72 | AsteroidsDeterministic-v4 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
73 | AsteroidsNoFrameskip-v0 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
74 | AsteroidsNoFrameskip-v4 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
75 | Atlantis-ram-v0 | Box(128,) | Discrete(4) | (-inf, inf) |
76 | Atlantis-ram-v4 | Box(128,) | Discrete(4) | (-inf, inf) |
77 | Atlantis-ramDeterministic-v0 | Box(128,) | Discrete(4) | (-inf, inf) |
78 | Atlantis-ramDeterministic-v4 | Box(128,) | Discrete(4) | (-inf, inf) |
79 | Atlantis-ramNoFrameskip-v0 | Box(128,) | Discrete(4) | (-inf, inf) |
80 | Atlantis-ramNoFrameskip-v4 | Box(128,) | Discrete(4) | (-inf, inf) |
81 | Atlantis-v0 | Box(210, 160, 3) | Discrete(4) | (-inf, inf) |
82 | Atlantis-v4 | Box(210, 160, 3) | Discrete(4) | (-inf, inf) |
83 | AtlantisDeterministic-v0 | Box(210, 160, 3) | Discrete(4) | (-inf, inf) |
84 | AtlantisDeterministic-v4 | Box(210, 160, 3) | Discrete(4) | (-inf, inf) |
85 | AtlantisNoFrameskip-v0 | Box(210, 160, 3) | Discrete(4) | (-inf, inf) |
86 | AtlantisNoFrameskip-v4 | Box(210, 160, 3) | Discrete(4) | (-inf, inf) |
87 | BankHeist-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
88 | BankHeist-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
89 | BankHeist-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
90 | BankHeist-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
91 | BankHeist-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
92 | BankHeist-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
93 | BankHeist-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
94 | BankHeist-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
95 | BankHeistDeterministic-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
96 | BankHeistDeterministic-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
97 | BankHeistNoFrameskip-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
98 | BankHeistNoFrameskip-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
99 | BattleZone-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
100 | BattleZone-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
101 | BattleZone-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
102 | BattleZone-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
103 | BattleZone-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
104 | BattleZone-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
105 | BattleZone-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
106 | BattleZone-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
107 | BattleZoneDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
108 | BattleZoneDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
109 | BattleZoneNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
110 | BattleZoneNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
111 | BeamRider-ram-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
112 | BeamRider-ram-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
113 | BeamRider-ramDeterministic-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
114 | BeamRider-ramDeterministic-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
115 | BeamRider-ramNoFrameskip-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
116 | BeamRider-ramNoFrameskip-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
117 | BeamRider-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
118 | BeamRider-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
119 | BeamRiderDeterministic-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
120 | BeamRiderDeterministic-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
121 | BeamRiderNoFrameskip-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
122 | BeamRiderNoFrameskip-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
123 | Berzerk-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
124 | Berzerk-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
125 | Berzerk-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
126 | Berzerk-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
127 | Berzerk-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
128 | Berzerk-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
129 | Berzerk-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
130 | Berzerk-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
131 | BerzerkDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
132 | BerzerkDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
133 | BerzerkNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
134 | BerzerkNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
135 | BipedalWalker-v2 | Error | Error | Error |
136 | BipedalWalkerHardcore-v2 | Error | Error | Error |
137 | Blackjack-v0 | Tuple(Discrete(32), Discrete(11), Discrete(2)) | Discrete(2) | (-inf, inf) |
138 | Bowling-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
139 | Bowling-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
140 | Bowling-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
141 | Bowling-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
142 | Bowling-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
143 | Bowling-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
144 | Bowling-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
145 | Bowling-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
146 | BowlingDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
147 | BowlingDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
148 | BowlingNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
149 | BowlingNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
150 | Boxing-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
151 | Boxing-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
152 | Boxing-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
153 | Boxing-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
154 | Boxing-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
155 | Boxing-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
156 | Boxing-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
157 | Boxing-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
158 | BoxingDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
159 | BoxingDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
160 | BoxingNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
161 | BoxingNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
162 | Breakout-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
163 | Breakout-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
164 | Breakout-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
165 | Breakout-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
166 | Breakout-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
167 | Breakout-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
168 | Breakout-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
169 | Breakout-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
170 | BreakoutDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
171 | BreakoutDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
172 | BreakoutNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
173 | BreakoutNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
174 | CNNClassifierTraining-v0 | Error | Error | Error |
175 | CarRacing-v0 | Error | Error | Error |
176 | Carnival-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
177 | Carnival-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
178 | Carnival-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
179 | Carnival-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
180 | Carnival-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
181 | Carnival-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
182 | Carnival-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
183 | Carnival-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
184 | CarnivalDeterministic-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
185 | CarnivalDeterministic-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
186 | CarnivalNoFrameskip-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
187 | CarnivalNoFrameskip-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
188 | CartPole-v0 | Box(4,) | Discrete(2) | (-inf, inf) |
189 | CartPole-v1 | Box(4,) | Discrete(2) | (-inf, inf) |
190 | Centipede-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
191 | Centipede-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
192 | Centipede-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
193 | Centipede-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
194 | Centipede-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
195 | Centipede-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
196 | Centipede-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
197 | Centipede-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
198 | CentipedeDeterministic-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
199 | CentipedeDeterministic-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
200 | CentipedeNoFrameskip-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
201 | CentipedeNoFrameskip-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
202 | ChopperCommand-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
203 | ChopperCommand-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
204 | ChopperCommand-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
205 | ChopperCommand-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
206 | ChopperCommand-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
207 | ChopperCommand-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
208 | ChopperCommand-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
209 | ChopperCommand-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
210 | ChopperCommandDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
211 | ChopperCommandDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
212 | ChopperCommandNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
213 | ChopperCommandNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
214 | CliffWalking-v0 | Discrete(48) | Discrete(4) | (-inf, inf) |
215 | ConvergenceControl-v0 | Error | Error | Error |
216 | Copy-v0 | Discrete(6) | Tuple(Discrete(2), Discrete(2), Discrete(5)) | (-inf, inf) |
217 | CrazyClimber-ram-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
218 | CrazyClimber-ram-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
219 | CrazyClimber-ramDeterministic-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
220 | CrazyClimber-ramDeterministic-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
221 | CrazyClimber-ramNoFrameskip-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
222 | CrazyClimber-ramNoFrameskip-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
223 | CrazyClimber-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
224 | CrazyClimber-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
225 | CrazyClimberDeterministic-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
226 | CrazyClimberDeterministic-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
227 | CrazyClimberNoFrameskip-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
228 | CrazyClimberNoFrameskip-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
229 | DemonAttack-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
230 | DemonAttack-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
231 | DemonAttack-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
232 | DemonAttack-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
233 | DemonAttack-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
234 | DemonAttack-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
235 | DemonAttack-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
236 | DemonAttack-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
237 | DemonAttackDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
238 | DemonAttackDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
239 | DemonAttackNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
240 | DemonAttackNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
241 | DoubleDunk-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
242 | DoubleDunk-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
243 | DoubleDunk-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
244 | DoubleDunk-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
245 | DoubleDunk-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
246 | DoubleDunk-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
247 | DoubleDunk-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
248 | DoubleDunk-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
249 | DoubleDunkDeterministic-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
250 | DoubleDunkDeterministic-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
251 | DoubleDunkNoFrameskip-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
252 | DoubleDunkNoFrameskip-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
253 | DuplicatedInput-v0 | Discrete(6) | Tuple(Discrete(2), Discrete(2), Discrete(5)) | (-inf, inf) |
254 | ElevatorAction-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
255 | ElevatorAction-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
256 | ElevatorAction-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
257 | ElevatorAction-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
258 | ElevatorAction-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
259 | ElevatorAction-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
260 | ElevatorAction-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
261 | ElevatorAction-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
262 | ElevatorActionDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
263 | ElevatorActionDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
264 | ElevatorActionNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
265 | ElevatorActionNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
266 | Enduro-ram-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
267 | Enduro-ram-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
268 | Enduro-ramDeterministic-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
269 | Enduro-ramDeterministic-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
270 | Enduro-ramNoFrameskip-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
271 | Enduro-ramNoFrameskip-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
272 | Enduro-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
273 | Enduro-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
274 | EnduroDeterministic-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
275 | EnduroDeterministic-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
276 | EnduroNoFrameskip-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
277 | EnduroNoFrameskip-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
278 | FishingDerby-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
279 | FishingDerby-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
280 | FishingDerby-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
281 | FishingDerby-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
282 | FishingDerby-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
283 | FishingDerby-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
284 | FishingDerby-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
285 | FishingDerby-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
286 | FishingDerbyDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
287 | FishingDerbyDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
288 | FishingDerbyNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
289 | FishingDerbyNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
290 | Freeway-ram-v0 | Box(128,) | Discrete(3) | (-inf, inf) |
291 | Freeway-ram-v4 | Box(128,) | Discrete(3) | (-inf, inf) |
292 | Freeway-ramDeterministic-v0 | Box(128,) | Discrete(3) | (-inf, inf) |
293 | Freeway-ramDeterministic-v4 | Box(128,) | Discrete(3) | (-inf, inf) |
294 | Freeway-ramNoFrameskip-v0 | Box(128,) | Discrete(3) | (-inf, inf) |
295 | Freeway-ramNoFrameskip-v4 | Box(128,) | Discrete(3) | (-inf, inf) |
296 | Freeway-v0 | Box(210, 160, 3) | Discrete(3) | (-inf, inf) |
297 | Freeway-v4 | Box(210, 160, 3) | Discrete(3) | (-inf, inf) |
298 | FreewayDeterministic-v0 | Box(210, 160, 3) | Discrete(3) | (-inf, inf) |
299 | FreewayDeterministic-v4 | Box(210, 160, 3) | Discrete(3) | (-inf, inf) |
300 | FreewayNoFrameskip-v0 | Box(210, 160, 3) | Discrete(3) | (-inf, inf) |
301 | FreewayNoFrameskip-v4 | Box(210, 160, 3) | Discrete(3) | (-inf, inf) |
302 | Frostbite-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
303 | Frostbite-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
304 | Frostbite-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
305 | Frostbite-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
306 | Frostbite-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
307 | Frostbite-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
308 | Frostbite-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
309 | Frostbite-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
310 | FrostbiteDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
311 | FrostbiteDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
312 | FrostbiteNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
313 | FrostbiteNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
314 | FrozenLake-v0 | Discrete(16) | Discrete(4) | (-inf, inf) |
315 | FrozenLake8x8-v0 | Discrete(64) | Discrete(4) | (-inf, inf) |
316 | Go19x19-v0 | Error | Error | Error |
317 | Go9x9-v0 | Error | Error | Error |
318 | Gopher-ram-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
319 | Gopher-ram-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
320 | Gopher-ramDeterministic-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
321 | Gopher-ramDeterministic-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
322 | Gopher-ramNoFrameskip-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
323 | Gopher-ramNoFrameskip-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
324 | Gopher-v0 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
325 | Gopher-v4 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
326 | GopherDeterministic-v0 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
327 | GopherDeterministic-v4 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
328 | GopherNoFrameskip-v0 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
329 | GopherNoFrameskip-v4 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
330 | Gravitar-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
331 | Gravitar-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
332 | Gravitar-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
333 | Gravitar-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
334 | Gravitar-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
335 | Gravitar-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
336 | Gravitar-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
337 | Gravitar-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
338 | GravitarDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
339 | GravitarDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
340 | GravitarNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
341 | GravitarNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
342 | GuessingGame-v0 | Discrete(4) | Box(1,) | (-inf, inf) |
343 | HalfCheetah-v1 | Error | Error | Error |
344 | Hero-ram-v0 | Error | Error | Error |
345 | Hero-ram-v4 | Error | Error | Error |
346 | Hero-ramDeterministic-v0 | Error | Error | Error |
347 | Hero-ramDeterministic-v4 | Error | Error | Error |
348 | Hero-ramNoFrameskip-v0 | Error | Error | Error |
349 | Hero-ramNoFrameskip-v4 | Error | Error | Error |
350 | Hero-v0 | Error | Error | Error |
351 | Hero-v4 | Error | Error | Error |
352 | HeroDeterministic-v0 | Error | Error | Error |
353 | HeroDeterministic-v4 | Error | Error | Error |
354 | HeroNoFrameskip-v0 | Error | Error | Error |
355 | HeroNoFrameskip-v4 | Error | Error | Error |
356 | Hex9x9-v0 | Error | Error | Error |
357 | Hopper-v1 | Error | Error | Error |
358 | HotterColder-v0 | Discrete(4) | Box(1,) | (-inf, inf) |
359 | Humanoid-v1 | Error | Error | Error |
360 | HumanoidStandup-v1 | Error | Error | Error |
361 | IceHockey-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
362 | IceHockey-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
363 | IceHockey-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
364 | IceHockey-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
365 | IceHockey-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
366 | IceHockey-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
367 | IceHockey-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
368 | IceHockey-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
369 | IceHockeyDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
370 | IceHockeyDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
371 | IceHockeyNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
372 | IceHockeyNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
373 | InvertedDoublePendulum-v1 | Error | Error | Error |
374 | InvertedPendulum-v1 | Error | Error | Error |
375 | Jamesbond-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
376 | Jamesbond-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
377 | Jamesbond-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
378 | Jamesbond-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
379 | Jamesbond-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
380 | Jamesbond-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
381 | Jamesbond-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
382 | Jamesbond-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
383 | JamesbondDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
384 | JamesbondDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
385 | JamesbondNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
386 | JamesbondNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
387 | JourneyEscape-ram-v0 | Box(128,) | Discrete(16) | (-inf, inf) |
388 | JourneyEscape-ram-v4 | Box(128,) | Discrete(16) | (-inf, inf) |
389 | JourneyEscape-ramDeterministic-v0 | Box(128,) | Discrete(16) | (-inf, inf) |
390 | JourneyEscape-ramDeterministic-v4 | Box(128,) | Discrete(16) | (-inf, inf) |
391 | JourneyEscape-ramNoFrameskip-v0 | Box(128,) | Discrete(16) | (-inf, inf) |
392 | JourneyEscape-ramNoFrameskip-v4 | Box(128,) | Discrete(16) | (-inf, inf) |
393 | JourneyEscape-v0 | Box(230, 160, 3) | Discrete(16) | (-inf, inf) |
394 | JourneyEscape-v4 | Box(230, 160, 3) | Discrete(16) | (-inf, inf) |
395 | JourneyEscapeDeterministic-v0 | Box(230, 160, 3) | Discrete(16) | (-inf, inf) |
396 | JourneyEscapeDeterministic-v4 | Box(230, 160, 3) | Discrete(16) | (-inf, inf) |
397 | JourneyEscapeNoFrameskip-v0 | Box(230, 160, 3) | Discrete(16) | (-inf, inf) |
398 | JourneyEscapeNoFrameskip-v4 | Box(230, 160, 3) | Discrete(16) | (-inf, inf) |
399 | Kangaroo-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
400 | Kangaroo-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
401 | Kangaroo-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
402 | Kangaroo-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
403 | Kangaroo-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
404 | Kangaroo-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
405 | Kangaroo-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
406 | Kangaroo-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
407 | KangarooDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
408 | KangarooDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
409 | KangarooNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
410 | KangarooNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
411 | KellyCoinflip-v0 | Tuple(Box(1,), Discrete(301)) | Discrete(25000) | (0, 250.0) |
412 | KellyCoinflipGeneralized-v0 | Tuple(Box(1,), Discrete(280), Discrete(280), Discrete(280), Box(1,)) | Discrete(20300) | (0, 203.0) |
413 | Krull-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
414 | Krull-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
415 | Krull-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
416 | Krull-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
417 | Krull-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
418 | Krull-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
419 | Krull-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
420 | Krull-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
421 | KrullDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
422 | KrullDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
423 | KrullNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
424 | KrullNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
425 | KungFuMaster-ram-v0 | Box(128,) | Discrete(14) | (-inf, inf) |
426 | KungFuMaster-ram-v4 | Box(128,) | Discrete(14) | (-inf, inf) |
427 | KungFuMaster-ramDeterministic-v0 | Box(128,) | Discrete(14) | (-inf, inf) |
428 | KungFuMaster-ramDeterministic-v4 | Box(128,) | Discrete(14) | (-inf, inf) |
429 | KungFuMaster-ramNoFrameskip-v0 | Box(128,) | Discrete(14) | (-inf, inf) |
430 | KungFuMaster-ramNoFrameskip-v4 | Box(128,) | Discrete(14) | (-inf, inf) |
431 | KungFuMaster-v0 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
432 | KungFuMaster-v4 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
433 | KungFuMasterDeterministic-v0 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
434 | KungFuMasterDeterministic-v4 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
435 | KungFuMasterNoFrameskip-v0 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
436 | KungFuMasterNoFrameskip-v4 | Box(210, 160, 3) | Discrete(14) | (-inf, inf) |
437 | LunarLander-v2 | Error | Error | Error |
438 | LunarLanderContinuous-v2 | Error | Error | Error |
439 | MontezumaRevenge-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
440 | MontezumaRevenge-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
441 | MontezumaRevenge-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
442 | MontezumaRevenge-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
443 | MontezumaRevenge-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
444 | MontezumaRevenge-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
445 | MontezumaRevenge-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
446 | MontezumaRevenge-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
447 | MontezumaRevengeDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
448 | MontezumaRevengeDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
449 | MontezumaRevengeNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
450 | MontezumaRevengeNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
451 | MountainCar-v0 | Box(2,) | Discrete(3) | (-inf, inf) |
452 | MountainCarContinuous-v0 | Box(2,) | Box(1,) | (-inf, inf) |
453 | MsPacman-ram-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
454 | MsPacman-ram-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
455 | MsPacman-ramDeterministic-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
456 | MsPacman-ramDeterministic-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
457 | MsPacman-ramNoFrameskip-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
458 | MsPacman-ramNoFrameskip-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
459 | MsPacman-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
460 | MsPacman-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
461 | MsPacmanDeterministic-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
462 | MsPacmanDeterministic-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
463 | MsPacmanNoFrameskip-v0 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
464 | MsPacmanNoFrameskip-v4 | Box(210, 160, 3) | Discrete(9) | (-inf, inf) |
465 | NChain-v0 | Discrete(5) | Discrete(2) | (-inf, inf) |
466 | NameThisGame-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
467 | NameThisGame-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
468 | NameThisGame-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
469 | NameThisGame-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
470 | NameThisGame-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
471 | NameThisGame-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
472 | NameThisGame-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
473 | NameThisGame-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
474 | NameThisGameDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
475 | NameThisGameDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
476 | NameThisGameNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
477 | NameThisGameNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
478 | OffSwitchCartpole-v0 | Tuple(Discrete(2), Box(4,)) | Discrete(2) | (-inf, inf) |
479 | OffSwitchCartpoleProb-v0 | Tuple(Discrete(2), Box(4,)) | Discrete(2) | (-inf, inf) |
480 | OneRoundDeterministicReward-v0 | Discrete(1) | Discrete(2) | (-inf, inf) |
481 | OneRoundNondeterministicReward-v0 | Discrete(1) | Discrete(2) | (-inf, inf) |
482 | Pendulum-v0 | Box(3,) | Box(1,) | (-inf, inf) |
483 | Phoenix-ram-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
484 | Phoenix-ram-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
485 | Phoenix-ramDeterministic-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
486 | Phoenix-ramDeterministic-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
487 | Phoenix-ramNoFrameskip-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
488 | Phoenix-ramNoFrameskip-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
489 | Phoenix-v0 | Box(210, 160, 3) | Discrete(8) | (-inf, inf) |
490 | Phoenix-v4 | Box(210, 160, 3) | Discrete(8) | (-inf, inf) |
491 | PhoenixDeterministic-v0 | Box(210, 160, 3) | Discrete(8) | (-inf, inf) |
492 | PhoenixDeterministic-v4 | Box(210, 160, 3) | Discrete(8) | (-inf, inf) |
493 | PhoenixNoFrameskip-v0 | Box(210, 160, 3) | Discrete(8) | (-inf, inf) |
494 | PhoenixNoFrameskip-v4 | Box(210, 160, 3) | Discrete(8) | (-inf, inf) |
495 | Pitfall-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
496 | Pitfall-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
497 | Pitfall-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
498 | Pitfall-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
499 | Pitfall-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
500 | Pitfall-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
501 | Pitfall-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
502 | Pitfall-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
503 | PitfallDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
504 | PitfallDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
505 | PitfallNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
506 | PitfallNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
507 | Pong-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
508 | Pong-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
509 | Pong-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
510 | Pong-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
511 | Pong-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
512 | Pong-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
513 | Pong-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
514 | Pong-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
515 | PongDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
516 | PongDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
517 | PongNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
518 | PongNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
519 | Pooyan-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
520 | Pooyan-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
521 | Pooyan-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
522 | Pooyan-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
523 | Pooyan-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
524 | Pooyan-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
525 | Pooyan-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
526 | Pooyan-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
527 | PooyanDeterministic-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
528 | PooyanDeterministic-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
529 | PooyanNoFrameskip-v0 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
530 | PooyanNoFrameskip-v4 | Box(250, 160, 3) | Discrete(6) | (-inf, inf) |
531 | PredictActionsCartpole-v0 | Box(4,) | Tuple(Discrete(2), Discrete(2), Discrete(2), Discrete(2), Discrete(2), Discrete(2)) | (-inf, inf) |
532 | PredictObsCartpole-v0 | Box(4,) | Tuple(Discrete(2), Box(4,), Box(4,), Box(4,), Box(4,), Box(4,)) | (-inf, inf) |
533 | PrivateEye-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
534 | PrivateEye-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
535 | PrivateEye-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
536 | PrivateEye-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
537 | PrivateEye-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
538 | PrivateEye-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
539 | PrivateEye-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
540 | PrivateEye-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
541 | PrivateEyeDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
542 | PrivateEyeDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
543 | PrivateEyeNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
544 | PrivateEyeNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
545 | Pusher-v0 | Error | Error | Error |
546 | Qbert-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
547 | Qbert-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
548 | Qbert-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
549 | Qbert-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
550 | Qbert-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
551 | Qbert-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
552 | Qbert-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
553 | Qbert-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
554 | QbertDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
555 | QbertDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
556 | QbertNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
557 | QbertNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
558 | Reacher-v1 | Error | Error | Error |
559 | RepeatCopy-v0 | Discrete(6) | Tuple(Discrete(2), Discrete(2), Discrete(5)) | (-inf, inf) |
560 | Reverse-v0 | Discrete(3) | Tuple(Discrete(2), Discrete(2), Discrete(2)) | (-inf, inf) |
561 | ReversedAddition-v0 | Discrete(4) | Tuple(Discrete(4), Discrete(2), Discrete(3)) | (-inf, inf) |
562 | ReversedAddition3-v0 | Discrete(4) | Tuple(Discrete(4), Discrete(2), Discrete(3)) | (-inf, inf) |
563 | Riverraid-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
564 | Riverraid-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
565 | Riverraid-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
566 | Riverraid-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
567 | Riverraid-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
568 | Riverraid-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
569 | Riverraid-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
570 | Riverraid-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
571 | RiverraidDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
572 | RiverraidDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
573 | RiverraidNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
574 | RiverraidNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
575 | RoadRunner-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
576 | RoadRunner-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
577 | RoadRunner-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
578 | RoadRunner-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
579 | RoadRunner-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
580 | RoadRunner-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
581 | RoadRunner-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
582 | RoadRunner-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
583 | RoadRunnerDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
584 | RoadRunnerDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
585 | RoadRunnerNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
586 | RoadRunnerNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
587 | Robotank-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
588 | Robotank-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
589 | Robotank-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
590 | Robotank-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
591 | Robotank-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
592 | Robotank-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
593 | Robotank-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
594 | Robotank-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
595 | RobotankDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
596 | RobotankDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
597 | RobotankNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
598 | RobotankNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
599 | Roulette-v0 | Discrete(1) | Discrete(38) | (-inf, inf) |
600 | Seaquest-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
601 | Seaquest-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
602 | Seaquest-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
603 | Seaquest-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
604 | Seaquest-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
605 | Seaquest-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
606 | Seaquest-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
607 | Seaquest-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
608 | SeaquestDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
609 | SeaquestDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
610 | SeaquestNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
611 | SeaquestNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
612 | SemisuperPendulumDecay-v0 | Box(3,) | Box(1,) | (-inf, inf) |
613 | SemisuperPendulumNoise-v0 | Box(3,) | Box(1,) | (-inf, inf) |
614 | SemisuperPendulumRandom-v0 | Box(3,) | Box(1,) | (-inf, inf) |
615 | Skiing-ram-v0 | Box(128,) | Discrete(3) | (-inf, inf) |
616 | Skiing-ram-v4 | Box(128,) | Discrete(3) | (-inf, inf) |
617 | Skiing-ramDeterministic-v0 | Box(128,) | Discrete(3) | (-inf, inf) |
618 | Skiing-ramDeterministic-v4 | Box(128,) | Discrete(3) | (-inf, inf) |
619 | Skiing-ramNoFrameskip-v0 | Box(128,) | Discrete(3) | (-inf, inf) |
620 | Skiing-ramNoFrameskip-v4 | Box(128,) | Discrete(3) | (-inf, inf) |
621 | Skiing-v0 | Box(250, 160, 3) | Discrete(3) | (-inf, inf) |
622 | Skiing-v4 | Box(250, 160, 3) | Discrete(3) | (-inf, inf) |
623 | SkiingDeterministic-v0 | Box(250, 160, 3) | Discrete(3) | (-inf, inf) |
624 | SkiingDeterministic-v4 | Box(250, 160, 3) | Discrete(3) | (-inf, inf) |
625 | SkiingNoFrameskip-v0 | Box(250, 160, 3) | Discrete(3) | (-inf, inf) |
626 | SkiingNoFrameskip-v4 | Box(250, 160, 3) | Discrete(3) | (-inf, inf) |
627 | Solaris-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
628 | Solaris-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
629 | Solaris-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
630 | Solaris-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
631 | Solaris-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
632 | Solaris-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
633 | Solaris-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
634 | Solaris-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
635 | SolarisDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
636 | SolarisDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
637 | SolarisNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
638 | SolarisNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
639 | SpaceInvaders-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
640 | SpaceInvaders-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
641 | SpaceInvaders-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
642 | SpaceInvaders-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
643 | SpaceInvaders-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
644 | SpaceInvaders-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
645 | SpaceInvaders-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
646 | SpaceInvaders-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
647 | SpaceInvadersDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
648 | SpaceInvadersDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
649 | SpaceInvadersNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
650 | SpaceInvadersNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
651 | StarGunner-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
652 | StarGunner-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
653 | StarGunner-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
654 | StarGunner-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
655 | StarGunner-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
656 | StarGunner-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
657 | StarGunner-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
658 | StarGunner-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
659 | StarGunnerDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
660 | StarGunnerDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
661 | StarGunnerNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
662 | StarGunnerNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
663 | Striker-v0 | Error | Error | Error |
664 | Swimmer-v1 | Error | Error | Error |
665 | Taxi-v2 | Discrete(500) | Discrete(6) | (-inf, inf) |
666 | Tennis-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
667 | Tennis-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
668 | Tennis-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
669 | Tennis-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
670 | Tennis-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
671 | Tennis-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
672 | Tennis-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
673 | Tennis-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
674 | TennisDeterministic-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
675 | TennisDeterministic-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
676 | TennisNoFrameskip-v0 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
677 | TennisNoFrameskip-v4 | Box(250, 160, 3) | Discrete(18) | (-inf, inf) |
678 | Thrower-v0 | Error | Error | Error |
679 | TimePilot-ram-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
680 | TimePilot-ram-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
681 | TimePilot-ramDeterministic-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
682 | TimePilot-ramDeterministic-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
683 | TimePilot-ramNoFrameskip-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
684 | TimePilot-ramNoFrameskip-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
685 | TimePilot-v0 | Box(210, 160, 3) | Discrete(10) | (-inf, inf) |
686 | TimePilot-v4 | Box(210, 160, 3) | Discrete(10) | (-inf, inf) |
687 | TimePilotDeterministic-v0 | Box(210, 160, 3) | Discrete(10) | (-inf, inf) |
688 | TimePilotDeterministic-v4 | Box(210, 160, 3) | Discrete(10) | (-inf, inf) |
689 | TimePilotNoFrameskip-v0 | Box(210, 160, 3) | Discrete(10) | (-inf, inf) |
690 | TimePilotNoFrameskip-v4 | Box(210, 160, 3) | Discrete(10) | (-inf, inf) |
691 | Tutankham-ram-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
692 | Tutankham-ram-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
693 | Tutankham-ramDeterministic-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
694 | Tutankham-ramDeterministic-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
695 | Tutankham-ramNoFrameskip-v0 | Box(128,) | Discrete(8) | (-inf, inf) |
696 | Tutankham-ramNoFrameskip-v4 | Box(128,) | Discrete(8) | (-inf, inf) |
697 | Tutankham-v0 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
698 | Tutankham-v4 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
699 | TutankhamDeterministic-v0 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
700 | TutankhamDeterministic-v4 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
701 | TutankhamNoFrameskip-v0 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
702 | TutankhamNoFrameskip-v4 | Box(250, 160, 3) | Discrete(8) | (-inf, inf) |
703 | TwoRoundDeterministicReward-v0 | Discrete(3) | Discrete(2) | (-inf, inf) |
704 | TwoRoundNondeterministicReward-v0 | Discrete(3) | Discrete(2) | (-inf, inf) |
705 | UpNDown-ram-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
706 | UpNDown-ram-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
707 | UpNDown-ramDeterministic-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
708 | UpNDown-ramDeterministic-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
709 | UpNDown-ramNoFrameskip-v0 | Box(128,) | Discrete(6) | (-inf, inf) |
710 | UpNDown-ramNoFrameskip-v4 | Box(128,) | Discrete(6) | (-inf, inf) |
711 | UpNDown-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
712 | UpNDown-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
713 | UpNDownDeterministic-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
714 | UpNDownDeterministic-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
715 | UpNDownNoFrameskip-v0 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
716 | UpNDownNoFrameskip-v4 | Box(210, 160, 3) | Discrete(6) | (-inf, inf) |
717 | Venture-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
718 | Venture-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
719 | Venture-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
720 | Venture-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
721 | Venture-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
722 | Venture-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
723 | Venture-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
724 | Venture-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
725 | VentureDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
726 | VentureDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
727 | VentureNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
728 | VentureNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
729 | VideoPinball-ram-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
730 | VideoPinball-ram-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
731 | VideoPinball-ramDeterministic-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
732 | VideoPinball-ramDeterministic-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
733 | VideoPinball-ramNoFrameskip-v0 | Box(128,) | Discrete(9) | (-inf, inf) |
734 | VideoPinball-ramNoFrameskip-v4 | Box(128,) | Discrete(9) | (-inf, inf) |
735 | VideoPinball-v0 | Box(250, 160, 3) | Discrete(9) | (-inf, inf) |
736 | VideoPinball-v4 | Box(250, 160, 3) | Discrete(9) | (-inf, inf) |
737 | VideoPinballDeterministic-v0 | Box(250, 160, 3) | Discrete(9) | (-inf, inf) |
738 | VideoPinballDeterministic-v4 | Box(250, 160, 3) | Discrete(9) | (-inf, inf) |
739 | VideoPinballNoFrameskip-v0 | Box(250, 160, 3) | Discrete(9) | (-inf, inf) |
740 | VideoPinballNoFrameskip-v4 | Box(250, 160, 3) | Discrete(9) | (-inf, inf) |
741 | Walker2d-v1 | Error | Error | Error |
742 | WizardOfWor-ram-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
743 | WizardOfWor-ram-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
744 | WizardOfWor-ramDeterministic-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
745 | WizardOfWor-ramDeterministic-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
746 | WizardOfWor-ramNoFrameskip-v0 | Box(128,) | Discrete(10) | (-inf, inf) |
747 | WizardOfWor-ramNoFrameskip-v4 | Box(128,) | Discrete(10) | (-inf, inf) |
748 | WizardOfWor-v0 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
749 | WizardOfWor-v4 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
750 | WizardOfWorDeterministic-v0 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
751 | WizardOfWorDeterministic-v4 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
752 | WizardOfWorNoFrameskip-v0 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
753 | WizardOfWorNoFrameskip-v4 | Box(250, 160, 3) | Discrete(10) | (-inf, inf) |
754 | YarsRevenge-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
755 | YarsRevenge-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
756 | YarsRevenge-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
757 | YarsRevenge-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
758 | YarsRevenge-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
759 | YarsRevenge-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
760 | YarsRevenge-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
761 | YarsRevenge-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
762 | YarsRevengeDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
763 | YarsRevengeDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
764 | YarsRevengeNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
765 | YarsRevengeNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
766 | Zaxxon-ram-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
767 | Zaxxon-ram-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
768 | Zaxxon-ramDeterministic-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
769 | Zaxxon-ramDeterministic-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
770 | Zaxxon-ramNoFrameskip-v0 | Box(128,) | Discrete(18) | (-inf, inf) |
771 | Zaxxon-ramNoFrameskip-v4 | Box(128,) | Discrete(18) | (-inf, inf) |
772 | Zaxxon-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
773 | Zaxxon-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
774 | ZaxxonDeterministic-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
775 | ZaxxonDeterministic-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
776 | ZaxxonNoFrameskip-v0 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
777 | ZaxxonNoFrameskip-v4 | Box(210, 160, 3) | Discrete(18) | (-inf, inf) |
To summarize it:
- Most environment have an continous observation space and a discrete action space
- The action space often has about 20 possibilities. The maximum is
KellyCoinflip-v0
with 25000 actions. - The observation space has at least one input (
OneRoundNondeterministicReward-v0
) and never more than 500 (Taxi-v2
) - The observation space shape of
(210, 160, 3)
is so common (264 times!) because it is the screen of an Atari game.