AGI Strategies

person

Jan Leike

Jan Leike

Former head of OpenAI Superalignment; now at Anthropic

Co-led OpenAI's Superalignment team with Ilya Sutskever. Resigned in May 2024 stating 'safety culture and processes have taken a backseat to shiny products'. Now runs alignment science at Anthropic.

current Head of Alignment Science, Anthropic
past Head of Superalignment, OpenAI

Profile

expertise

Frontier builder

Currently or recently led training, architecture, or safety work on a frontier model. Hands on the loss curve.

Former co-lead of OpenAI's Superalignment team; now head of alignment at Anthropic. Long publication record in scalable oversight, debate, recursive reward modelling.

recognition

Field-leading

Widely known inside the AI and AI-safety community. Appears repeatedly in top venues, podcasts, or governance forums. Not a household name to outsiders.

Public resignation from OpenAI in May 2024 was widely covered. Known across the field; not a household name.

vintage

Deep-learning rise

Came up post-AlexNet. ImageNet, AlphaGo, transformer paper. DeepMind, Google Brain, FAIR establish the modern lab template.

DeepMind 2015–2021, OpenAI Superalignment 2021–2024. Came up in the deep-learning era; his frame matures in scaling.

Hand-classified. See the board for the criteria and the full grid.

p(doom)

Strategy positions

Alignment firstendorses

Solve technical alignment before capability thresholds close

Argues alignment research must scale with capabilities; publicly resigned when he felt this ratio was violated.

“Over the past years, safety culture and processes have taken a backseat to shiny products.”

Context: Resignation thread on X/Twitter.

tweetResignation thread· X/Twitter· 2024-05-17· direct quote

Closest strategy neighbours

by jaccard overlap

Other people whose strategy tags overlap with Jan Leike's. Overlap is on tag identity, not stance; opposites can show up if they reference the same tags.

  • Aaron Courville

    shared 1 · J=1.00

    Université de Montréal; Deep Learning textbook co-author

  • Adam Jermyn

    shared 1 · J=1.00

    Anthropic; previously astrophysics

  • Adam Kalai

    shared 1 · J=1.00

    Microsoft Research; AI fairness and safety

  • Agnes Callard

    Agnes Callard

    shared 1 · J=1.00

    University of Chicago philosopher; aspiration theorist

  • Ajeya Cotra

    shared 1 · J=1.00

    Open Philanthropy researcher; 'biological anchors' forecaster

  • Alan Turing

    Alan Turing

    shared 1 · J=1.00

    Founder of theoretical computer science (1912–1954)

Record last updated 2026-04-24.