Codex, Opus, Gemini Try to Build Counter Strike

13 days ago

Copy Link

Three major AI model updates were tested: Gemini 3 Pro, Codex Max 5.1, and Claude Opus 4.5.
The challenge was to build a basic multiplayer 3D UI version of Counter Strike.
Claude Opus 4.5 excelled in frontend tasks, creating visually appealing maps, characters, and guns.
Gemini 3 Pro performed best in backend tasks, handling multiplayer and persistence with fewer errors.
Codex Max 5.1 was a balanced performer, doing reasonably well in both frontend and backend tasks.
Each model was given 7 consecutive prompts divided into frontend (game mechanics) and backend (multiplayer functionality).
Claude's designs were more visually appealing, while Gemini's logical changes were more robust.
Codex had some initial bugs but fixed them quickly, though its visuals were less impressive.
All models successfully built a multiplayer FPS with zero hand-written code, showcasing their iterative capabilities.
The experiment highlighted areas for improvement, such as better handling of React hooks and reducing the steep learning curve for non-programmers.

Hasty Briefsbeta