Jeon, Hwi, et al. “Optimizing Collaborative Filtering Recommender Systems With the GRPO Reinforcement Learning Algorithm”. Edelweiss Applied Science and Technology, vol. 9, no. 8, Aug. 2025, pp. 871-8, doi:10.55214/2576-8484.v9i8.9471.