DeepSeek v3.1 Is Not Having a Moment
20 hours ago
- #DeepSeek
- #AI Models
- #Chip Restrictions
- DeepSeek delayed its new model release due to technical issues with Huawei’s Ascend chips, forcing them to use Nvidia chips for training.
- DeepSeek v3.1 was introduced with hybrid inference modes, improved agent skills, and 128K context support, but received little attention.
- Claims of strong performance (66 on SWE, 71.6% on Aider benchmark) were not widely corroborated, with mixed user feedback.
- The model’s open-source weights are available, but limited hosting options hinder widespread testing.
- DeepSeek’s progress is impacted by China’s push for domestic chips, slowing development compared to rivals.