Skip to content
@METR

METR

Model Evaluation and Threat Research

Model Evaluation and Threat Research (METR)

METR is a research nonprofit that works on assessing whether cutting-edge AI systems could pose catastrophic risks to society.

We build the science of accurately assessing risks, so that humanity is informed before developing transformative AI systems.

Read more about our work here.

Our Software

Popular repositories Loading

  1. eval-analysis-public eval-analysis-public Public

    Public repository containing METR's DVC pipeline for eval data analysis

    Python 204 40

  2. task-standard task-standard Public

    METR Task Standard

    TypeScript 174 36

  3. vivaria vivaria Public

    Vivaria is METR's tool for running evaluations and conducting agent elicitation research.

    TypeScript 134 39

  4. RE-Bench RE-Bench Public

    Python 133 18

  5. public-tasks public-tasks Public

    HTML 118 18

  6. hcast-public hcast-public Public

    HTML 19 4

Repositories

Showing 10 of 55 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.