Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents
Published in ArXiv, 2025
Recommended citation: Guoqing Wang, Sunhao Dai, Guangze Ye, Zeyu Gan, Wei Yao, Yong Deng, Xiaofeng Wu, Zhenzhe Ying. Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents. arXiv preprint arXiv:2510.14967, 2025. https://arxiv.org/abs/2510.14967
