CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning
Published in ArXiv, 2025
This paper introduces CoT-Space, a novel theoretical framework that recasts LLM reasoning as a continuous optimization problem, which provides a coherent explanation for empirical phenomena such as overthinking.
Recommended citation: Zeyu Gan, Hao Yi, Yong Liu. CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning. arXiv preprint arXiv:2509.04027, 2025. https://arxiv.org/abs/2509.04027
