Claude vs. Gemini: Testing on 1M Tokens of Context

12 days ago

Copy Link

Anthropic released Claude Sonnet 4 with a 1-million token context window, equivalent to the entire Harry Potter series in one prompt.
Three main tests were conducted: long context text analysis, long context code analysis, and AI Diplomacy.
In text analysis, Claude Sonnet 4 was faster and hallucinated less than Gemini models but provided less detailed answers.
For code analysis, Claude scored lower than Gemini models but was slightly faster.
In AI Diplomacy, Claude Sonnet 4 performed well, coming in second place with unoptimized prompts.
Claude Sonnet 4 is priced at $6 per 1 million tokens, while Gemini Pro and Flash are cheaper at $2.50 and $0.30 respectively.
The model is recommended for tasks requiring speed and reliability but Gemini is better for detailed analysis.

Hasty Briefsbeta