Pinned Loading
-
NYTConnectionsBench
NYTConnectionsBench PublicAn evaluation benchmarking LLMs on the New York Times Connections word puzzle
Python
-
rag-evaluation
rag-evaluation PublicA QA RAG system that uses a custom chromadb to retrieve relevant passages and then uses an LLM to generate the answer.
-
ParentingBench
ParentingBench PublicBenchmark for evaluating LLM parenting advice quality and safety
Python
-
blur-reader
blur-reader PublicBlur Paragraphs: A Chrome extension for focused reading by blurring non-hovered text.
JavaScript 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



