📑 Paper
The official realease of EMNLP 2025 Findings "CoAT: Chain-of-Associated-Thoughts Framework for Enhancing Large Language Models Reasoning" by Jianfeng Pan, Senyou Deng, and Shaomang Huang.
The Comprehensive Reasoning Benchmark (CRB) dataset contains 205 professionally reviewed questions, each accompanied by its corresponding evaluation rules and total score, which together constitute the final evaluation entries. The data entry in the CRB is structured referring to the design principles of subjective questions in the Chinese Gaokao examination. Specifically, each entry consists of three components: the Question, the Judge Rules, and the Score. The Judge Rules outline a series of fundamental key points that must be addressed to provide an adequate response. Each key point corresponds to a specific score, and the inclusion of these key points in an answer results in the allocation of the corresponding score. Additionally, the Judge Rules incorporate higher-level criteria as bonus points. The Score assigned to each data entry represents the maximum attainable score for that entry.
If you think this work is useful for your research, please cite the following paper.
@inproceedings{pan-etal-2025-coat,
title = {{C}o{AT}: Chain-of-Associated-Thoughts Framework for Enhancing Large Language Models Reasoning},
author = {Pan, Jianfeng and Deng, Senyou and Huang, Shaomang},
editor = {Christodoulopoulos, Christos and Chakraborty, Tanmoy and Rose, Carolyn and Peng, Violet},
booktitle = {Findings of the Association for Computational Linguistics: EMNLP 2025},
month = nov,
year = 2025,
address = {Suzhou, China},
publisher = {Association for Computational Linguistics},
note = {\url{https://aclanthology.org/2025.findings-emnlp.700/}},
pages = {13028--13045},
ISBN = {979-8-89176-335-7}
}