Natasha Jaques
Natasha Jaques
Awards
Press
Featured
Publications
Topics
Talks
Communities
Light
Dark
Automatic
Responsible AI
Moral Foundations of Large Language Models
Moral Foundations theory decomposes human moral reasoning into five factors, which vary reliably across different human populations and political affiliations. We use moral foundations to analyze large language models like GPT-3 to determine what, if any, consistent moral values it brings to conversations, whether these can be deliberately manipulated, and whether holding a particular moral stance affects downstream tasks.
M. Abdulhai
,
C. Crepy
,
D. Valter
,
J. Canny
,
S. Levine
,
Natasha Jaques
2022
In
Preprint
Cite
Cite
×