DeepSeek v3.1 Is Not Having a Moment

20 hours ago

Copy Link

DeepSeek delayed its new model release due to technical issues with Huawei’s Ascend chips, forcing them to use Nvidia chips for training.
DeepSeek v3.1 was introduced with hybrid inference modes, improved agent skills, and 128K context support, but received little attention.
Claims of strong performance (66 on SWE, 71.6% on Aider benchmark) were not widely corroborated, with mixed user feedback.
The model’s open-source weights are available, but limited hosting options hinder widespread testing.
DeepSeek’s progress is impacted by China’s push for domestic chips, slowing development compared to rivals.

Hasty Briefsbeta