AGI Strategies

person

Nat McAleese

OpenAI researcher; ex-DeepMind reliability

AI reliability and alignment researcher at OpenAI; previously at DeepMind working on debate-style oversight and reward modelling.

current Researcher, OpenAI
past Research engineer, Google DeepMind

Strategy positions

Alignment firstendorses

Solve technical alignment before capability thresholds close

Works on reward modelling and debate-style oversight; publicly engaged with alignment research.

Teaching language models to support answers with verified quotes is a concrete alignment sub-problem we can make progress on.
§ paperTeaching Language Models to Support Answers with Verified Quotes· arXiv· 2022· faithful paraphrase

Closest strategy neighbours

by jaccard overlap

Other people whose strategy tags overlap with Nat McAleese's. Overlap is on tag identity, not stance; opposites can show up if they reference the same tags.

  • Aaron Courville

    shared 1 · J=1.00

    Université de Montréal; Deep Learning textbook co-author

  • Adam Jermyn

    shared 1 · J=1.00

    Anthropic; previously astrophysics

  • Adam Kalai

    shared 1 · J=1.00

    Microsoft Research; AI fairness and safety

  • Agnes Callard

    Agnes Callard

    shared 1 · J=1.00

    University of Chicago philosopher; aspiration theorist

  • Ajeya Cotra

    shared 1 · J=1.00

    Open Philanthropy researcher; 'biological anchors' forecaster

  • Alan Turing

    Alan Turing

    shared 1 · J=1.00

    Founder of theoretical computer science (1912–1954)

Record last updated 2026-04-25.