- [[GLM-5.2 moves from GRPO (Group Relative Policy Optimization) to PPO (Proximal Policy Optimization)]] - [[How are current AI systems mostly created, and who is pursuing alternatives]] - [[Current state and future of recursive AI self-improvement research. In how strong form is it real]]