LLM Year in Review
a day ago
- #2025
- #AI
- #LLM
- Reinforcement Learning from Verifiable Rewards (RLVR) emerged as a major stage in LLM training, enabling spontaneous development of reasoning-like strategies.
- LLMs in 2025 were seen as 'summoning ghosts' rather than 'evolving animals', highlighting their unique and jagged intelligence profiles.
- Benchmarks lost trust due to susceptibility to RLVR and synthetic data generation, leading to 'benchmaxxing' practices.
- Cursor introduced a new layer of LLM apps, bundling and orchestrating LLM calls for specific verticals with features like context engineering and autonomy sliders.
- Claude Code (CC) demonstrated the first convincing LLM Agent, running locally on users' computers and integrating private data and context.
- Vibe coding became prominent, allowing anyone to build programs via English, democratizing programming and altering software development practices.
- Google Gemini Nano banana hinted at the future of LLM GUIs, combining text generation, image generation, and world knowledge for more visual and spatial interactions.