Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Published in Proceedings of the Association for Computational Linguistics (ACL), Main Conference, 2025

Recommended citation: Yancheng He*, Shilong Li*, Jiaheng Liu*, et al. "Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?" ACL 2025.
Download Paper