person
Max Bartolo
Cohere; LLM evaluation researcher
Researcher at Cohere; previously a UCL DeepMind PhD student. Co-developed adversarial-training and evaluation methods for question-answering and instruction-following.
current Research Scientist, Cohere
Strategy positions
Evals-drivenendorses
Capability/risk evals gate deployment; evals are the load-bearing artefactArgues evaluation methods that adversarially probe model weaknesses are the only way to characterize what models will do in deployment; static benchmarks are insufficient.
Adversarial evaluation reveals failure modes that static benchmarks miss. As models become more capable, our evaluation has to become more adversarial too.
Closest strategy neighbours
by jaccard overlapOther people whose strategy tags overlap with Max Bartolo's. Overlap is on tag identity, not stance; opposites can show up if they reference the same tags.
Record last updated 2026-04-25.