person

Lilian Weng

Thinking Machines; former OpenAI VP of Research

Former VP of Research at OpenAI who helped lead safety research there. Wrote widely-read technical blog posts on RLHF and alignment. Joined Mira Murati's Thinking Machines Lab in 2024.

current Researcher, Thinking Machines Lab

past VP of Research, Safety Systems, OpenAI

homepage @lilianweng

Strategy positions

Alignment firstendorses

Solve technical alignment before capability thresholds close

Public educator on alignment technical work. Ran OpenAI's Safety Systems team before leaving for Thinking Machines.

Reward modelling, RLHF, red-teaming, and eval frameworks are the concrete instruments of alignment today.

✍ blogLil'Log· Lil'Log· 2023· loose paraphrase

Closest strategy neighbours

by jaccard overlap

Other people whose strategy tags overlap with Lilian Weng's. Overlap is on tag identity, not stance; opposites can show up if they reference the same tags.

Aaron Courville
shared 1 · J=1.00
Université de Montréal; Deep Learning textbook co-author
Adam Jermyn
shared 1 · J=1.00
Anthropic; previously astrophysics
Adam Kalai
shared 1 · J=1.00
Microsoft Research; AI fairness and safety
Agnes Callard
shared 1 · J=1.00
University of Chicago philosopher; aspiration theorist
Ajeya Cotra
shared 1 · J=1.00
Open Philanthropy researcher; 'biological anchors' forecaster
Alan Turing
shared 1 · J=1.00
Founder of theoretical computer science (1912–1954)

Record last updated 2026-04-25.