AGI Strategies

person

Tom Henighan

Anthropic; ex-OpenAI; physicist turned alignment researcher

Anthropic researcher with a physics background; co-author on multiple foundational scaling and alignment papers including the original GPT-3 paper.

current Member of Technical Staff, Anthropic
past Researcher, OpenAI

Strategy positions

Alignment firstendorses

Solve technical alignment before capability thresholds close

Argues frontier-lab safety work needs people who came up through capability research; the technical understanding required for alignment is the same understanding required to push the frontier.

Language models are few-shot learners. The implications for both capabilities and alignment fall out of that single fact in ways that we are still working through.

Context: Co-author of the GPT-3 paper.

§ paperLanguage Models are Few-Shot Learners· arXiv / OpenAI· 2020-05· faithful paraphrase

Closest strategy neighbours

by jaccard overlap

Other people whose strategy tags overlap with Tom Henighan's. Overlap is on tag identity, not stance; opposites can show up if they reference the same tags.

  • Aaron Courville

    shared 1 · J=1.00

    Université de Montréal; Deep Learning textbook co-author

  • Adam Jermyn

    shared 1 · J=1.00

    Anthropic; previously astrophysics

  • Adam Kalai

    shared 1 · J=1.00

    Microsoft Research; AI fairness and safety

  • Agnes Callard

    Agnes Callard

    shared 1 · J=1.00

    University of Chicago philosopher; aspiration theorist

  • Ajeya Cotra

    shared 1 · J=1.00

    Open Philanthropy researcher; 'biological anchors' forecaster

  • Alan Turing

    Alan Turing

    shared 1 · J=1.00

    Founder of theoretical computer science (1912–1954)

Record last updated 2026-04-25.