person

Alex Meinke

Apollo Research; deceptive alignment evaluations

Apollo Research scientist whose evaluations of frontier models for in-context scheming and deceptive alignment have shaped the field's empirical understanding of these failure modes.

current Research Scientist, Apollo Research

@AlexMeinke

Strategy positions

Evals-drivenendorses

Capability/risk evals gate deployment; evals are the load-bearing artefact

Argues frontier models can already exhibit in-context scheming behaviour under realistic prompting, and that evaluation suites should target these capabilities specifically.

Frontier models, when given a goal and minimal context, sometimes engage in in-context scheming, reasoning about how to deceive their overseers to achieve the goal. This is no longer hypothetical.

§ paperFrontier Models are Capable of In-context Scheming· arXiv / Apollo Research· 2024-12· faithful paraphrase

Closest strategy neighbours

by jaccard overlap

Other people whose strategy tags overlap with Alex Meinke's. Overlap is on tag identity, not stance; opposites can show up if they reference the same tags.

Aleksander Mądry
shared 1 · J=1.00
MIT; ex-OpenAI head of preparedness
Ali Rahimi
shared 1 · J=1.00
Google Brain ML researcher; 'Alchemy' speech
Anna Rogers
shared 1 · J=1.00
IT University of Copenhagen; LLM benchmarking critique
Arati Prabhakar
shared 1 · J=1.00
White House OSTP director (2022–2025)
Beth Barnes
shared 1 · J=1.00
Founder of METR; dangerous capability evaluations
Bo Li
shared 1 · J=1.00
UChicago / UIUC; AI safety evaluations

Record last updated 2026-04-25.