Home Lex Fridman Notes
Lex Fridman · 2021-08-29 · 2h 51m

Wojciech Zaremba: OpenAI Codex, GPT-3, Robotics, and the Future of AI | Lex Fridman Podcast #215

OpenAI's Wojciech Zaremba explains GPT-3, Codex and Copilot while weaving in consciousness-as-compression, love as shared reward functions, and meditation.

Wojciech Zaremba: OpenAI Codex, GPT-3, Robotics, and the Future of AI | Lex Fridman Podcast #215
The guest

Wojciech Zaremba — Co-founder of OpenAI who leads its language and code-generation teams (GPT-3, Codex, GitHub Copilot) and previously led OpenAI's robotics efforts.

The gist

Wojciech Zaremba joins Lex Fridman to discuss the science and philosophy behind OpenAI's language models. He frames intelligence and consciousness as forms of compression, defines love as optimizing each other's reward functions, and explains how GPT-3, Codex, and GitHub Copilot turn next-word prediction into powerful tools. He recounts OpenAI's robotics work solving a Rubik's Cube one-handed via simulation, and argues for iterative, gradual deployment of increasingly powerful AI. The conversation ranges widely across meditation, psychedelics, mortality, and how to generate good research ideas.

Big reveals

  • Zaremba states his current belief that we may be alone in the universe, which makes conscious life more valuable.
  • He proposes consciousness is intertwined with compression and that self-consciousness may be 'metacompression' - a compressor compressing itself.
  • He defines love as the dissolution of boundaries where two people end up optimizing each other's reward functions.
  • He argues for iterative deployment of AI - releasing slightly better versions for public criticism rather than holding a powerful system back.
  • He says AGI should not be controlled by a small number of people and power should be distributed to a larger collective.
  • OpenAI solved a Rubik's Cube with a single robotic hand by training across a randomized range of simulations rather than one.
  • He says if building a robotics company today he would not dismiss supervised learning - he'd record human teleoperation trajectories first.
  • He admits as a child he synthesized nitroglycerin and detonated homemade dynamite with a friend.

Things worth remembering

  • Predicting the next word is equivalent to compression, which means learning a model of reality.
  • Marcus Hutter's compression prize treats intelligence as compression of Wikipedia; Hutter advised DeepMind co-founder Shane Legg.
  • Zaremba sketches Solomonoff-style optimal induction: enumerate all programs producing the observed bits and weight them by length.
  • More AI gains have historically come from compute than from algorithms, though you can't invest 10 trillion dollars in compute.
  • At a meditation retreat he describes reaching 'inbox zero' for the mind, after which the default state feels deeply peaceful.
  • GPT-3 goes off the rails on long text because it is trained on human writing, not on its own mistakes, so errors compound.
  • Code is more verifiable than language because programs can be executed and unit-tested automatically.
  • He blocks Wednesday-Friday as out-of-office for focused work, taking meetings only Monday and Tuesday, which boosted his mood.
  • He keeps a voice recorder by his bed and a separate app-free 'meditation phone' to capture ideas without friction.
  • He cites Bezos-style regret minimization: live so your deathbed self won't regret not having tried.

Recommended in this episode

Books, products and media the guest or host genuinely endorsed here — with the buy link.

Affiliate link — we may earn a commission at no extra cost to you.

Guest’s ownProduct

GitHub Copilot

GitHub / OpenAI

“compiled is the first product and developed by github... co-pilot is actually as you code it suggests you code completions” — Wojciech Zaremba 01:33:02
Find it on Amazon
Guest’s ownProduct

OpenAI Codex

OpenAI

“codex is api for these models so it's first pre-trained on language and then... codex is the api that's similar to gpd3” — Wojciech Zaremba 01:32:01
Find it on Amazon
Guest’s ownProduct

GPT-3

OpenAI

“gpt 3 is a humongous neural network... it is trained on the entire internet and just to predict next word” — Wojciech Zaremba 01:15:26
Find it on Amazon
Guest’s ownProduct

DALL-E

OpenAI

“we released recently two models there is one model called dali that generates images” — Wojciech Zaremba 02:35:12
Find it on Amazon
Guest’s ownProduct

CLIP

OpenAI

“there is other model called clip that actually uh you provide various possibilities what could be the answer to what is on the picture” — Wojciech Zaremba 02:35:12
Find it on Amazon