YIFENG LIU@YIFENGLIU_AI·Original post
Excited to present our paper, "On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning," at ICLR 2026 (Pavilion 4 #4517) this afternoon! Project page: https://github.com/complex-reasoning/RPG Paper: https://arxiv.org/abs/2505.17508
