Natasha Jaques
Natasha Jaques
Awards
Press
Featured
Publications
Topics
Talks
Communities
Light
Dark
Automatic
Inverse Reinforcement Learning
Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience
Using inverse reinforcement learning to infer human preferences is challenging, because it is an underspecified problem. We use multi-task RL pre-training and successor features to learn a strong prior over the space of reasonable goals in an environment—which we call a
basis
—that enables rapidly inferring an expert’s reward function in only 100 samples.
M. Abdulhai
,
Natasha Jaques
,
S. Levine
2022
In
Preprint
PDF
Cite
Code
Project
Cite
×