Home Lex Fridman Notes
Lex Fridman · 2019-10-12 · 32m

David Ferrucci: The Story of IBM Watson Winning in Jeopardy | AI Podcast Clips

IBM Watson's lead engineer David Ferrucci explains how a self-contained machine beat the best humans at Jeopardy.

David Ferrucci: The Story of IBM Watson Winning in Jeopardy | AI Podcast Clips
The guest

David Ferrucci — AI researcher who led the IBM Watson team that won at Jeopardy in 2011. He was a senior leader in IBM Research specializing in open-domain question answering and natural language understanding.

The gist

David Ferrucci tells Lex Fridman the inside story of building IBM Watson to compete on Jeopardy against human champions. He explains why Jeopardy was uniquely hard: questions are witty and indirect, answers span tens of thousands of types, and the machine had to answer accurately, fast, and with calibrated confidence under three seconds. Ferrucci describes his pragmatic engineering philosophy of integrating existing NLP and machine learning rather than waiting for a fundamental breakthrough, and the divide-and-conquer architecture that let many researchers improve independent components fused by machine learning. He reflects on the pressure of a public challenge, his pride in staying true to the science, and the difference between winning a game and solving language understanding.

Big reveals

  • Watson never came close to knowing all of Jeopardy; the answer could only be found anywhere about 85% of the time even at the peak.
  • For several years IBM execs said no to the Jeopardy challenge, fearing reputational risk, until Ferrucci was 'crazy enough' to say yes.
  • Ferrucci deliberately committed to NOT actually understanding language, choosing 'whatever works' over solving the general NLU problem.
  • Watson was a self-contained device with no internet access, running everything in-memory across roughly 2,000-3,000 connected cores.
  • The key enabler was using machine learning to fuse hundreds of component scores, letting researchers work independently via divide-and-conquer.
  • Ferrucci admits the win 'was not a success in natural language understanding' but that was never the goal.

Things worth remembering

  • Watson targeted answering and buzzing in under three seconds on average to be competitive.
  • Even a full web search only surfaced the answer in the top 10-20 documents about 65% of the time, not good enough to win.
  • To reach the winner's circle you have to be accurate over 70% and do it very quickly.
  • Watson had to find the right answer among many documents AND compute a confidence score for each candidate.
  • Watson's knowledge base included Wikipedia plus expanded web crawls, totaling roughly two to five million books' worth of content.
  • Watson generated thousands of candidate answers per question and applied around 200 different scoring algorithms to each.
  • Every component had to demonstrate measurable impact on end-to-end performance before it was allowed into the system.
  • A researcher warned they would need 'Maxwell's equations for question-answering'; Ferrucci bet instead on integrating existing technology.