Cadenza Labs

sleeper-agents Public

Python 12 1

cluster-normalization Public

Jupyter Notebook 4 1

liars-bench Public

Jupyter Notebook 4 2

elk_old Public archive

Python 2

elk Public

Forked from EleutherAI/elk

Keeping language models honest by directly eliciting knowledge encoded in their activations. Building on "Discovering latent knowledge in language models without supervision" (Burns et al. 2022)

Python 2 1

rl-inter-spar Public

Python 1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cadenza Labs

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!