Skip to content

Conversation

@ScooterStuff
Copy link
Collaborator

Add a verify.py script that reads data.jsonl (a small sample of taco.jsonl) and checks whether the expected output matches the output produced by running the solution code.

Also, add summary_report.csv and overall_summary.csv files that display the requirements with mismatched input/output and the overall passing rate percentage.

@feixiangdejiahao
Copy link
Collaborator

Please re-extract the example for requirements that did not fully pass to generate a new dataset. The new dataset needs to satisfy the requirement that all the examples in the dataset are extracted and fully passed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants