A toy Inspect implementation of the Bliss Attractor eval from Claude 4 System Card Welfare Assessment
-
Updated
Jun 5, 2025 - Python
A toy Inspect implementation of the Bliss Attractor eval from Claude 4 System Card Welfare Assessment
A Responsible AI Stewardship License for AI Safety and Welfare
🤖 Foster responsible AI development with the SAFE-AI License, ensuring safety and fairness in ethical AI infrastructure and project accountability.
TriEthix is a novel evaluation framework that systematically benchmarks frontier LLMs across three foundational ethical perspectives: virtue, deontology, and consequentialism in 3 steps: (Step-1) Moral Weights; (Step-2) Moral Consistency; and (Step-3) Moral Reasoning. TriEthix reveals robust moral profiles for AI Safety, Governance, and Welfare.
Add a description, image, and links to the ai-welfare topic page so that developers can more easily learn about it.
To associate your repository with the ai-welfare topic, visit your repo's landing page and select "manage topics."