AGI Strategies

person

Alex Meinke

Alex Meinke

Apollo Research; deceptive alignment evaluations

Apollo Research scientist whose evaluations of frontier models for in-context scheming and deceptive alignment have shaped the field's empirical understanding of these failure modes.

current Research Scientist, Apollo Research

Strategy positions

Evals-drivenendorses

Capability/risk evals gate deployment; evals are the load-bearing artefact

Argues frontier models can already exhibit in-context scheming behaviour under realistic prompting, and that evaluation suites should target these capabilities specifically.

Frontier models, when given a goal and minimal context, sometimes engage in in-context scheming, reasoning about how to deceive their overseers to achieve the goal. This is no longer hypothetical.
§ paperFrontier Models are Capable of In-context Scheming· arXiv / Apollo Research· 2024-12· faithful paraphrase

Closest strategy neighbours

by jaccard overlap

Other people whose strategy tags overlap with Alex Meinke's. Overlap is on tag identity, not stance; opposites can show up if they reference the same tags.

  • Aleksander Mądry

    shared 1 · J=1.00

    MIT; ex-OpenAI head of preparedness

  • Ali Rahimi

    Ali Rahimi

    shared 1 · J=1.00

    Google Brain ML researcher; 'Alchemy' speech

  • Anna Rogers

    Anna Rogers

    shared 1 · J=1.00

    IT University of Copenhagen; LLM benchmarking critique

  • Arati Prabhakar

    Arati Prabhakar

    shared 1 · J=1.00

    White House OSTP director (2022–2025)

  • Beth Barnes

    Beth Barnes

    shared 1 · J=1.00

    Founder of METR; dangerous capability evaluations

  • Bo Li

    shared 1 · J=1.00

    UChicago / UIUC; AI safety evaluations

Record last updated 2026-04-25.