Hasty Briefsbeta

Claude vs. Gemini: Testing on 1M Tokens of Context

12 days ago
  • #AI
  • #Gemini
  • #Claude Sonnet 4
  • Anthropic released Claude Sonnet 4 with a 1-million token context window, equivalent to the entire Harry Potter series in one prompt.
  • Three main tests were conducted: long context text analysis, long context code analysis, and AI Diplomacy.
  • In text analysis, Claude Sonnet 4 was faster and hallucinated less than Gemini models but provided less detailed answers.
  • For code analysis, Claude scored lower than Gemini models but was slightly faster.
  • In AI Diplomacy, Claude Sonnet 4 performed well, coming in second place with unoptimized prompts.
  • Claude Sonnet 4 is priced at $6 per 1 million tokens, while Gemini Pro and Flash are cheaper at $2.50 and $0.30 respectively.
  • The model is recommended for tasks requiring speed and reliability but Gemini is better for detailed analysis.