Home Andrew Huberman Notes
Andrew Huberman · 2026-02-02 · 2h 41m

How Dopamine & Serotonin Shape Decisions, Motivation & Learning | Dr. Read Montague

Neuroscientist Read Montague rewrites the dopamine story, showing it as a moment-to-moment learning currency that mirrors the same AI algorithms now beating humans.

How Dopamine & Serotonin Shape Decisions, Motivation & Learning | Dr. Read Montague
The guest

Dr. Read Montague — Director of the Center for Human Neuroscience Research at Virginia Tech and a computational neuroscientist who pioneered methods to measure dopamine, serotonin and other neuromodulators in real time in conscious humans. An expert on motivation, decision-making and learning who connects brain chemistry to reinforcement-learning algorithms.

The gist

Huberman and Montague dismantle the popular 'dopamine equals pleasure' idea, explaining that dopamine is really a learning signal that encodes the difference between one expectation and the next, not just expectation versus reward. They show this temporal-difference reinforcement-learning rule is the same algorithm DeepMind used for AlphaGo and AlphaFold, and that it is installed in brains from honeybees to humans. The conversation covers serotonin as dopamine's opponent, how SSRIs may push serotonin into dopamine terminals and dull reward, and how stress and hunger can flip dopamine from coding rewards to coding threats. Montague describes recording neurotransmitters directly from the human brain and even through probes up the nose, and how AI may soon let people read their own dopamine and serotonin in real time. Threaded throughout are practical themes: foraging in dating, screen time and ADHD-like states, the value of effort and sports, and using deliberate delays to learn better.

Big reveals

  • Montague says dopamine is most clearly a learning signal, not a pleasure signal, and that the best modern work on it came out of artificial intelligence.
  • The core claim: dopamine encodes the difference between successive expectations (temporal-difference error), not just expectation versus reward.
  • The exact reinforcement-learning algorithm dopamine runs is what DeepMind used to build AlphaGo Zero, which 'beats the history of Go' and has never been beaten.
  • Across all their human recordings, dopamine and serotonin are opponent: when one goes up, the other goes down.
  • SSRIs push serotonin into dopamine terminals, which can blunt reward and may teach people to negatively condition on things they should pursue.
  • When animals (and likely people) are made hungry or stressed, dopamine flips role and starts encoding punishment-prediction errors instead of reward.
  • His team can record dopamine and serotonin from healthy people via a minimally invasive probe snaked up the nose to the olfactory epithelium.
  • Montague endorses Claude AI ('I love Claude'), using it to compare and contrast scientists' theories in ways PubMed cannot.

Things worth remembering

  • The same learning rule appears in B. subtilis brains, sea-slug brains, songbirds and humans; in honeybee brains it is probably octopamine rather than dopamine.
  • By the time Parkinson's symptoms appear, a person has already lost 70 to 75 percent of the dopamine neurons in their brain stem.
  • Only about 80,000 dopamine neurons per side (160,000 total) wire the entire brain and spinal cord through hundreds of millions of connections.
  • Honeybees fall on an 'ADD bee' to 'concentration bee' axis set by the ratio of octopamine to tyramine, the bee analogue of dopamine to serotonin.
  • A study found that a phone merely sitting face-down in the same room lowers cognitive performance, which recovers when the phone is in another room.
  • An Israeli study of judges found rulings differed sharply depending on whether the judge had eaten; 'you really want a judge that's had a good lunch.'
  • A 2005 paper by John Dani in Neuron showed SSRIs route serotonin into the dopamine system via the dopamine transporter, a roughly 40 percent effect.
  • The human brain runs on about 23 watts, whereas a server chip would burst into flames within minutes if its cooling were turned off.
  • AlphaFold solved protein-structure prediction, a problem the NIH may have spent around 100 billion dollars on over 70 years.
  • For psychotropic medications, 50 to 80 percent of a good outcome can be a placebo effect, which Montague stresses is not a fake effect.

Recommended in this episode

Books, products and media the guest or host genuinely endorsed here — with the buy link.

Affiliate link — we may earn a commission at no extra cost to you.

RecommendedProduct

Claude (Claude AI)

Anthropic (inferred)

“I use Claude. I love Claude. I love Claude AI. I you know I love the interface... I use it more and more these days and I love it.” — Read Montague 02:13:22
Find it on Amazon