AGI Strategies

person

Lilian Weng

Thinking Machines; former OpenAI VP of Research

Former VP of Research at OpenAI who helped lead safety research there. Wrote widely-read technical blog posts on RLHF and alignment. Joined Mira Murati's Thinking Machines Lab in 2024.

current Researcher, Thinking Machines Lab
past VP of Research, Safety Systems, OpenAI

Strategy positions

Alignment firstendorses

Solve technical alignment before capability thresholds close

Public educator on alignment technical work. Ran OpenAI's Safety Systems team before leaving for Thinking Machines.

Reward modelling, RLHF, red-teaming, and eval frameworks are the concrete instruments of alignment today.
blogLil'Log· Lil'Log· 2023· loose paraphrase

Closest strategy neighbours

by jaccard overlap

Other people whose strategy tags overlap with Lilian Weng's. Overlap is on tag identity, not stance; opposites can show up if they reference the same tags.

  • Aaron Courville

    shared 1 · J=1.00

    Université de Montréal; Deep Learning textbook co-author

  • Adam Jermyn

    shared 1 · J=1.00

    Anthropic; previously astrophysics

  • Adam Kalai

    shared 1 · J=1.00

    Microsoft Research; AI fairness and safety

  • Agnes Callard

    Agnes Callard

    shared 1 · J=1.00

    University of Chicago philosopher; aspiration theorist

  • Ajeya Cotra

    shared 1 · J=1.00

    Open Philanthropy researcher; 'biological anchors' forecaster

  • Alan Turing

    Alan Turing

    shared 1 · J=1.00

    Founder of theoretical computer science (1912–1954)

Record last updated 2026-04-25.