AGI Strategies

person

Ethan Perez

Anthropic researcher; red-teaming language models

Anthropic research scientist focused on red-teaming and sycophancy. Has published foundational work on model evaluation and LM-generated evaluations.

current Research Scientist, Anthropic

Strategy positions

Alignment firstendorses

Solve technical alignment before capability thresholds close

Designs red-teaming protocols and model-evaluation frameworks. Significant empirical contributor to Anthropic's alignment work.

You can use a language model to red-team another language model, and that lets you scale evaluation in ways humans alone cannot.
§ paperRed Teaming Language Models with Language Models· arXiv· 2022· faithful paraphrase

Closest strategy neighbours

by jaccard overlap

Other people whose strategy tags overlap with Ethan Perez's. Overlap is on tag identity, not stance; opposites can show up if they reference the same tags.

  • Aaron Courville

    shared 1 · J=1.00

    Université de Montréal; Deep Learning textbook co-author

  • Adam Jermyn

    shared 1 · J=1.00

    Anthropic; previously astrophysics

  • Adam Kalai

    shared 1 · J=1.00

    Microsoft Research; AI fairness and safety

  • Agnes Callard

    Agnes Callard

    shared 1 · J=1.00

    University of Chicago philosopher; aspiration theorist

  • Ajeya Cotra

    shared 1 · J=1.00

    Open Philanthropy researcher; 'biological anchors' forecaster

  • Alan Turing

    Alan Turing

    shared 1 · J=1.00

    Founder of theoretical computer science (1912–1954)

Record last updated 2026-04-25.