The business of physical AI.

GlossaryAI, models & control

Reinforcement learning

Reinforcement learning (RL) is a machine-learning paradigm where an agent learns by maximizing a reward signal through trial-and-error interaction with an environment. Distinct from imitation learning (learning from demonstrations) and from supervised learning (learning from labeled examples). The defining property: the agent must explore actions and observe consequences before it can learn what works.

The distinction matters because RL in the real world is data-hungry to the point of impracticality for most robot tasks — a humanoid trying to learn manipulation by trial-and-error would damage itself before learning anything useful. Most RL in robotics happens in simulation, with sim-to-real transfer for deployment. Boston Dynamics' Atlas uses RL for locomotion training; Figure's manipulation policies use a combination of RL and imitation. Pure-RL approaches haven't crossed into deployed humanoid programs at scale yet.

Canonical reference: registry.deploy.report/glossary#reinforcement-learning

Relevant DEPLOY coverage

By topic: embodied aiai infrastructurehumanoids

By company: boston dynamicsfigure aitesla

Related terms

More in ai, models & control

← All glossary terms