person
Dylan Hadfield-Menell
MIT professor; Stuart Russell student; assistance games
MIT assistant professor and former Stuart Russell PhD student who works on assistance games and practical alignment of AI systems.
current Assistant Professor, MIT CSAIL
Strategy positions
Alignment firstendorses
Solve technical alignment before capability thresholds closeWorks on assistance games, the Russellian proposal that AI should model and defer to human preferences rather than optimise fixed objectives.
Assistance games give us a framework where AI uncertainty about human preferences is a feature, not a bug.
Closest strategy neighbours
by jaccard overlapOther people whose strategy tags overlap with Dylan Hadfield-Menell's. Overlap is on tag identity, not stance; opposites can show up if they reference the same tags.
Record last updated 2026-04-25.