LLM Year in Review

a day ago

Copy Link

Reinforcement Learning from Verifiable Rewards (RLVR) emerged as a major stage in LLM training, enabling spontaneous development of reasoning-like strategies.
LLMs in 2025 were seen as 'summoning ghosts' rather than 'evolving animals', highlighting their unique and jagged intelligence profiles.
Benchmarks lost trust due to susceptibility to RLVR and synthetic data generation, leading to 'benchmaxxing' practices.
Cursor introduced a new layer of LLM apps, bundling and orchestrating LLM calls for specific verticals with features like context engineering and autonomy sliders.
Claude Code (CC) demonstrated the first convincing LLM Agent, running locally on users' computers and integrating private data and context.
Vibe coding became prominent, allowing anyone to build programs via English, democratizing programming and altering software development practices.
Google Gemini Nano banana hinted at the future of LLM GUIs, combining text generation, image generation, and world knowledge for more visual and spatial interactions.

Hasty Briefsbeta