- [[GLM-5.2 moves from GRPO (Group Relative Policy Optimization) to PPO (Proximal Policy Optimization)]]
- [[How are current AI systems mostly created, and who is pursuing alternatives]]
- [[Current state and future of recursive AI self-improvement research. In how strong form is it real]]