Home Lex Fridman Notes
Lex Fridman · 2020-03-19 · 1h 38m

Anca Dragan: Human-Robot Interaction and Reward Engineering | Lex Fridman Podcast #81

Berkeley roboticist Anca Dragan on why robots must model messy humans, learn hidden rewards, and treat human-robot interaction as a shared game.

Anca Dragan: Human-Robot Interaction and Reward Engineering | Lex Fridman Podcast #81
The guest

Anca Dragan — A professor at UC Berkeley working on human-robot interaction and reward engineering algorithms, who also consults at Waymo. She studies how robots can generate behavior that accounts for coordinating with people.

The gist

Anca Dragan explains her work on human-robot interaction, where the robot's job is to optimize for what people actually want rather than what a programmer literally specified. She argues that humans who look irrational may simply be operating under different assumptions or simpler internal models, and shows how robots can use their own actions to gather information about human intent. The conversation covers inverse reinforcement learning, the difficulty of designing reward functions, autonomous driving as a game-theoretic problem with humans, and semi-autonomous driving's risks. It closes on mortality, the meaning of life, and how finiteness might belong in our reward functions.

Big reveals

  • Anca reveals her husband proposed to her by building a seven-degree-of-freedom WALL-E robot that opened a Lego box.
  • Robots can deliberately nudge forward to probe a human driver's aggressiveness and update their model based on the reaction.
  • By modeling a human's intuitive (wrong) physics model, her team got people to actually land the Lunar Lander game.
  • She claims if you removed all humans from downtown San Francisco, autonomous driving would essentially be a solved problem.
  • She calls it irresponsible not to use lidar, while sympathizing with Musk's view of lidar as a crutch.
  • Lex pushes back on human-factors orthodoxy, saying drivers can be more energized as observers in some semi-autonomous setups.
  • The very state of the world (e.g. shoes lined up on the floor) leaks information about human preferences to a robot.

Things worth remembering

  • Inverse reinforcement learning infers what reward function a human's behavior is optimal with respect to.
  • Boltzmann rationality models humans as choosing options stochastically in proportion to their utility.
  • She frames human-robot interaction as an 'under-actuated' system where you influence but cannot directly control people.
  • Anca's hobby is watching hundreds of hours of pedestrian video to learn about human behavior.
  • 'Civil inattention': if you avoid eye contact while running, people move out of your way.
  • Goodhart's law: once a metric becomes a target, it stops being a good metric.
  • Robots should interpret specified rewards as evidence of intent, not as literal universal laws.
  • Anca says we love existing so much precisely because it ends, and that finiteness should inform reward functions.

Recommended in this episode

Books, products and media the guest or host genuinely endorsed here — with the buy link.

Affiliate link — we may earn a commission at no extra cost to you.

RecommendedMedia

WALL-E

Pixar

“my favorite fictional robot is Wally and I love how amazingly expressive it is some personal things a little bit about expressive motion” — guest 00:07:52
Find it on Amazon
RecommendedBook

Artificial Intelligence: A Modern Approach

Stuart Russell and Peter Norvig

“I got my hands on a PDF copy in Romania of Russell Norvig a I modern approach... it was so captivating” — guest 01:26:44
Find it on Amazon