Skip to content

Popular repositories Loading

  1. sleeper-agents sleeper-agents Public

    Python 12 1

  2. cluster-normalization cluster-normalization Public

    Jupyter Notebook 4 1

  3. liars-bench liars-bench Public

    Jupyter Notebook 4 2

  4. elk_old elk_old Public archive

    Python 2

  5. elk elk Public

    Forked from EleutherAI/elk

    Keeping language models honest by directly eliciting knowledge encoded in their activations. Building on "Discovering latent knowledge in language models without supervision" (Burns et al. 2022)

    Python 2 1

  6. rl-inter-spar rl-inter-spar Public

    Python 1

Repositories

Showing 10 of 17 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…