LIVE TRAINING RUN · Humanoid-v4 · Episode 0R=5.0

Train. Break.
Retrain. Ship.

The standard Python API for reinforcement learning. 60+ environments. Three lines to your first agent.

gymnasium · training · humanoid-v4
LIVE
Cumulative Reward+5.0
ep 0ep 500ep 1000
0
Episodes
14.2k
Steps/s
168.4
Best R
Humanoid-v4
policy learning
Terminal
Open Source · MIT License
60+ Built-in Environments
Farama Foundation
Python 3.10–3.13
Environment Families

What will your agent learn?

Retro arcade game screen with colorful pixel graphics glowing in the dark
Discrete(18)
action space
Box(210,160,3)
obs space
57 games

Atari

Arcade Learning Environment

High

Classic Atari 2600 ROMs emulated at pixel level. From Pong to Montezuma's Revenge — the original benchmark for deep RL.

Pixel-basedDiscrete ActionsALE
57 envsClassic games
Robotic arm in a laboratory setting with precision mechanical joints
Box(n,)
action space
Box(m,)
obs space
11 environments

MuJoCo

Physics-Based Robotics

Very High

Multi-joint locomotion and manipulation in continuous state/action spaces. The gold standard for continuous control research.

ContinuousPhysics EngineLocomotion
11 envsPhysics sims
Electronic circuit board with glowing components representing control systems
Discrete / Box
action space
Box(2–6,)
obs space
5 environments

Classic Control

Real-world Physics Problems

Low–Medium

CartPole, Pendulum, MountainCar — small enough to understand completely, hard enough to build intuition about RL fundamentals.

Beginner-friendlyInterpretableFast
5 envsControl tasks
Also available:Box2DToy TextMinigridGymnasium-RoboticsViZDoomHighwayEnvStable-Retro+ many more via Farama Foundation
Install

Running in under 60 seconds

v1.2.3 LatestPython 3.10–3.13
$pip install gymnasium
quickstart.py
import gymnasium as gym
# Initialize any environment in one line
env = gym.make("CartPole-v1", render_mode="human")
# Standard loop — works for every environment
obs, info = env.reset(seed=42)
for _ in range(1000):
action = env.action_space.sample() # your policy here
obs, reward, terminated, truncated, info = env.step(action)
if terminated or truncated:
obs, info = env.reset()
env.close()
Farama Foundation
github.com/Farama-Foundation
11.3k
GitHub Stars
v1.2.3
Latest Release
Star on GitHub
Benchmarks

Gymnasium wins on every metric

MetricGymnasiumGym (legacy)Competitor B
CartPole-v1 wall-clock (1M steps)18s47s61s
MuJoCo Ant-v4 steps/sec (single CPU)1420089006300
Atari Pong env reset latency0.8ms2.1ms3.4ms
Vectorized CartPole (1024 envs) steps/sec2.1M890k540k
API compatibility (Gym drop-in)100%~60%~40%
Python 3.13 support

Measured on Ubuntu 22.04, Intel Core i9-13900K, Python 3.12. Results may vary.

Throughput

Steps / Second

CartPole-v12.1M
Humanoid-v414.2k
Atari Pong280k
LunarLander-v3890k
Vectorization
1000×

faster than Sync vector env using NumPy CartPole. Parallel environments maximize GPU utilization.

Hundreds of envs in parallel
Pro EnvironmentsNEW

Production-grade robotics environments

High-fidelity environments used in published robotics research. MuJoCo-powered, Gymnasium API-compatible.

Shadow HandRobotics

24-DoF anthropomorphic manipulation

Fetch Robot ArmManipulation

7-DoF arm: Reach, Push, Pick & Place

Dexterous HandDexterity

Object reorientation with 24 actuators

Bipedal Walker ProLocomotion

Terrain-adaptive locomotion

Multi-Agent AntMulti-Agent

Cooperative locomotion, 4 agents

Sim-to-Real BridgeSim2Real

Domain randomization built-in

Compatible with:Stable-Baselines3CleanRLRLlibTorchRLTianshou
Start Training Free
Hosted Colab · GitHub OAuth

Pre-loaded CartPole environment. Watch your first agent learn in under 3 minutes. No install required.

Open in Colab
GitHub OAuth only · no password stored
Get Early Access

Gymnasium Pro Environments

Production robotics environments for serious research. Two fields, no friction.

No spam. Unsubscribe anytime.

Star on GitHub · 11.3k stars