Story · Digg AI

From: Researchers present KL-regularized policy gradient paper at ICLR 2026

YIFENG LIU@YIFENGLIU_AI·5hOriginal post

Excited to present our paper, "On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning," at ICLR 2026 (Pavilion 4 #4517) this afternoon! Project page: https://github.com/complex-reasoning/RPG Paper: https://arxiv.org/abs/2505.17508

View on

Story · Digg AI · Digg