The LLM Lobotomy

5 hours ago

Copy Link

The author is working on a product using Azure for LLMs and Audio Models, testing conversational flows with system prompts.
Over six months, the same LLM model has shown decreasing accuracy in responses despite using identical messages and prompts.
The author notes that newer models like gpt-5-mini and nano are slower and less accurate compared to previous versions like gpt-4o-mini.
Microsoft is suspected of intentionally degrading older models to push users towards newer, albeit inferior, versions.
The author criticizes this strategy, emphasizing the importance of accuracy and consistency, and considers moving away from Azure due to instability.
The author has documented proof of these issues and urges Microsoft to either improve products or maintain stable, backward-compatible services.

Hasty Briefsbeta