GitHub

CreditEval is a financial credit reasoning evaluation system. It provides:

A batch evaluation pipeline (main.py) for offline scoring of CoT outputs.
A Flask backend API (app.py) exposing locate / evaluate / prompt-config endpoints.
A web UI (Risk-COT-Tagger/) for meta-evaluation, prompt editing and few-shot example management.

Installation

pip install -r requirements.txt

Command-line evaluation

main.py runs the full multi-dimension evaluation pipeline (accuracy, logical consistency, compliance) over a CoT jsonl file with user metadata:

python main.py \
  --cust_desc_file data/dag_4.3.2/cust_desc_file_sample.json \
  --cot_file data/dag_4.3.2/cot_file_sample.jsonl \
  --output results \
  --max_workers 30

Backend API + Web UI

Start the Flask backend (locate / evaluate / meta-evaluate):
```
python app.py
```
This exposes endpoints such as:
- GET /api/prompts/<dimension>/<stage>: read prompt config from common/config/*_prompt_config.yaml.
- PUT /api/prompts/<dimension>/<stage>: update prompt content.
- POST /api/locate: single-user feature extraction (locate stage).
- POST /api/locate/batch: batch locate over files.
- POST /api/evaluate/batch: batch evaluation over files.
Start the front-end HTTP server for Risk-COT-Tagger:
```
cd Risk-COT-Tagger
python start_server.py
```
Then open the printed URL (typically http://localhost:8000/start.html) to access:
- Meta-evaluation workbench for reviewing and curating errors.
- Prompt configuration pages for Locate and Evaluate stages.
- Few-shot example repository and instruction builder.***

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
Risk-COT-Tagger		Risk-COT-Tagger
common		common
evaluators		evaluators
llm_client		llm_client
utils		utils
.gitignore		.gitignore
README.md		README.md
app.py		app.py
env.example		env.example
logging_config.yaml		logging_config.yaml
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation

Command-line evaluation

Backend API + Web UI

About

Uh oh!

Releases

Packages

Languages

mug2mag/CreditEval

Folders and files

Latest commit

History

Repository files navigation

Installation

Command-line evaluation

Backend API + Web UI

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages