AGI Strategies

person

Jacob Hilton

Alignment Research Center; Prover-Verifier Games

Alignment researcher at the Alignment Research Center and independent researcher. Has published influential work on prover-verifier games and eliciting latent knowledge.

current Researcher, Alignment Research Center

Strategy positions

Alignment firstendorses

Solve technical alignment before capability thresholds close

Technical alignment work on eliciting honest behaviour from more-capable models.

The alignment problem is about getting less-capable verifiers to reliably elicit truth from more-capable provers.
§ paperProver-Verifier Games Improve Legibility· arXiv· 2024-07· faithful paraphrase

Closest strategy neighbours

by jaccard overlap

Other people whose strategy tags overlap with Jacob Hilton's. Overlap is on tag identity, not stance; opposites can show up if they reference the same tags.

  • Aaron Courville

    shared 1 · J=1.00

    Université de Montréal; Deep Learning textbook co-author

  • Adam Jermyn

    shared 1 · J=1.00

    Anthropic; previously astrophysics

  • Adam Kalai

    shared 1 · J=1.00

    Microsoft Research; AI fairness and safety

  • Agnes Callard

    Agnes Callard

    shared 1 · J=1.00

    University of Chicago philosopher; aspiration theorist

  • Ajeya Cotra

    shared 1 · J=1.00

    Open Philanthropy researcher; 'biological anchors' forecaster

  • Alan Turing

    Alan Turing

    shared 1 · J=1.00

    Founder of theoretical computer science (1912–1954)

Record last updated 2026-04-25.