- Anthropic's Claude Code tool appears to inject around 20,000 extra tokens into requests starting from version 2.1.100, causing users to hit usage limits faster.
- Users, even on the $200 monthly Max plan, report depleting their quotas within hours, sometimes as quickly as 90 minutes, despite Anthropic acknowledging the issue without explanation.
- A developer's proxy test revealed that v2.1.100 billed 69,922 tokens for the same task that used 49,726 tokens in v2.1.98, with the inflation occurring server-side and invisible to users.
- The extra tokens may be linked to new session memory features, potentially diluting custom instructions and degrading output quality in long sessions.
- Community workaround: downgrade to v2.1.98 using 'npx [email protected]'. This issue adds to user frustration after Anthropic recently restricted subscription limits for third-party tools.
- RunPod advertises cheaper, on-demand GPU access for AI/ML tasks, offering hardware like H100 and RTX 4090, billed per second with quick setup.