Luo, Jianing. “Policy Gradient Methods for Multi-Agent Reinforcement Learning: A Comparative Study”. Highlights in Science, Engineering and Technology, vol. 140, May 2025, pp. 378-83, https://doi.org/10.54097/58j7ca95.