Natasha Jaques
Natasha Jaques
Awards
Press
Featured
Publications
Topics
Talks
Communities
Light
Dark
Automatic
Human-AI Interaction
Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience
Using inverse reinforcement learning to infer human preferences is challenging, because it is an underspecified problem. We use multi-task RL pre-training and successor features to learn a strong prior over the space of reasonable goals in an environment—which we call a
basis
—that enables rapidly inferring an expert’s reward function in only 100 samples.
M. Abdulhai
,
Natasha Jaques
,
S. Levine
2022
In
Preprint
PDF
Cite
Code
Project
Less is More: Generating Grounded Navigation Instructions from Landmarks
We study the automatic generation of natural language navigation instructions in visually realistic indoor environments. Existing generators suffer from poor visual grounding, skip steps, and hallucinate objects. We address this using a large language model which incorporates visual landmark detection.. The model dramatically increases the quality of generated instructions, such that humans can follow them with a 71% success rate (SR); just shy of the 75% SR of real human instructions.
S. Wang
,
C. Montgomery
,
J. Orbay
,
V. Birodkar
,
A. Faust
,
I. Gur
,
Natasha Jaques
,
A. Waters
,
J. Baldridge
,
P. Anderson
2022
In
Computer Vision and Pattern Recognition (CVPR)
PDF
Cite
Dataset
Human-Centric Dialog Training via Offline Reinforcement Learning
We train dialog models with interactive data from conversations with real humans, using a novel Offline RL technique based on KL-control. Rather than rely on manual ratings, we learn from implicit signals like sentiment, and show that this results in better performance.
Natasha Jaques
*
,
J. H. Shen
*
,
A. Ghandeharioun
,
C. Ferguson
,
A. Lapedriza
,
N. Jones
,
S. Gu
,
R. Picard
2020
In
Empirical Methods in Natural Language Processing (EMNLP)
PDF
Cite
Code
Dataset
Slides
EMNLP Talk
Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems
Existing metrics for automatically evaluating dialog models correlate poorly with human judgements, and are evaluated on static conversation snippets. Instead, we deploy bots to interact live with humans, then approximate human ratings with state-of-the-art accuracy using conversations generated with self-play.
A. Ghandeharioun
*
,
J. H. Shen
*
,
Natasha Jaques
*
,
C. Ferguson
,
N. Jones
,
A. Lapedriza
,
R. Picard
2019
In
Neural Information Processing Systems (NeurIPS)
PDF
Cite
Code
Dataset
Poster
Social and Affective Machine Learning
My PhD Thesis spans both Social Reinforcement Learning and Affective Computing, investigating how affective and social intelligence can enhance machine learning algorithms, and how machine learning can enhance our ability to predict and understand human affective and social phenomena.
Natasha Jaques
2019
In
Massachusetts Institute of Technology
PDF
Cite
Thesis Defense
CV News write-up
Learning via Social Awareness: Improving a Deep Generative Sketching Model with Facial Feedback
We show the outputs of a generative model of sketches to human observers and record their facial expressions. Using only a small number of facial expression samples, we are able to tune the model to produce drawings that are significantly better rated by humans.
Natasha Jaques
,
J. McCleary
,
J. Engel
,
D. Ha
,
F. Bertsch
,
D. Eck
,
R. Picard
2018
In
International Conference on Learning Representations (ICLR) workshop
PDF
Cite
Slides
Quartz article
Interactive Musical Improvisation with Magenta
This demo deployed RL Tuner and other Magenta music generation models into an interactive interface in which users can collaborate creatively with a machine learning model. The interface supports call and response interaction, automatically generating an accompaniment to the user’s melody, or melody morphing: responding both with variations on the user’s melody and a bass accompaniment.
A. Roberts
,
J. Engel
,
C. Hawthorne
,
I. Simon
,
E. Waite
,
S. Oore
,
Natasha Jaques
,
C. Resnick
,
D. Eck
2016
In
Neural Information Processing Systems (NeurIPS)
Best Demo
Cite
Code
Video
NeurIPS Demo
Magenta
Blog post
Personality, Attitudes, and Bonding in Conversations
We collect observational data from real human conversations, and develop a measure of how much participants experienced bonding or chemistry. We analyze the effects of personality and attitudes on bonding, and find that attentiveness and excitement are more effective at promoting bonding than traits like attractiveness and humour.
Natasha Jaques
,
Y. K. Kim
,
\& Picard R. Picard R
2016
In
Intelligent Virtual Agents (IVA)
PDF
Cite
Understanding and Predicting Bonding in Conversations Using Thin Slices of Facial Expressions and Body Language
Given only one-minute slices of facial expressions and body language, we use machine learning to accurately predict whether two humans having a conversation will bond with each other. We analyze factors which lead to bonding and discover that synchrony in body language and appropriate, empathetic facial expressions lead to higher bonding.
Natasha Jaques
,
D. McDuff
,
Y. K. Kim
,
\& Picard R. Picard R
2016
In
Intelligent Virtual Agents (IVA)
PDF
Cite
SmileTracker: Automatically and Unobtrusively Recording Smiles and their Context.
SmileTracker is an app that uses facial expression recognition to take a screenshot of the user’s screen whenever they smile. The screenshot and image of the user’s face are saved, to help them remember positive content they encountered during the day. Users can opt to share their images to a public gallery.
Natasha Jaques
,
W. V. Chen
,
R. Picard
2015
In
Proceedings of the CHI Conference Extended Abstracts
PDF
Cite
Slides
Boston Magazine article
CBC news
»
Cite
×