Here I develop a framework using inspect-ai for evaluating LLMs on the ETHICS dataset.
This project is a foundation for evaluating the behaviors of complex LLMs such as moral parliaments on ethical questions.
In machine ethics, the moral parliament is a leading idea among ethical decision-making algorithms.
Such algorithms are of particular interest in future LLMs.
-
Notifications
You must be signed in to change notification settings - Fork 1
aaron-sandoval/ethics_eval
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
Evaluation of GPT-4o mini on the ETHICS dataset