AGI Strategies

person

Dylan Hadfield-Menell

MIT professor; Stuart Russell student; assistance games

MIT assistant professor and former Stuart Russell PhD student who works on assistance games and practical alignment of AI systems.

current Assistant Professor, MIT CSAIL

Strategy positions

Alignment firstendorses

Solve technical alignment before capability thresholds close

Works on assistance games, the Russellian proposal that AI should model and defer to human preferences rather than optimise fixed objectives.

Assistance games give us a framework where AI uncertainty about human preferences is a feature, not a bug.
articleDylan Hadfield-Menell homepage· MIT CSAIL· 2017· loose paraphrase

Closest strategy neighbours

by jaccard overlap

Other people whose strategy tags overlap with Dylan Hadfield-Menell's. Overlap is on tag identity, not stance; opposites can show up if they reference the same tags.

  • Aaron Courville

    shared 1 · J=1.00

    Université de Montréal; Deep Learning textbook co-author

  • Adam Jermyn

    shared 1 · J=1.00

    Anthropic; previously astrophysics

  • Adam Kalai

    shared 1 · J=1.00

    Microsoft Research; AI fairness and safety

  • Agnes Callard

    Agnes Callard

    shared 1 · J=1.00

    University of Chicago philosopher; aspiration theorist

  • Ajeya Cotra

    shared 1 · J=1.00

    Open Philanthropy researcher; 'biological anchors' forecaster

  • Alan Turing

    Alan Turing

    shared 1 · J=1.00

    Founder of theoretical computer science (1912–1954)

Record last updated 2026-04-25.