Show HN: FLE v0.3 – Claude Code Plays Factorio

a day ago

Copy Link

The Factorio Learning Environment (FLE) v0.3.0 introduces major improvements for testing agents in long-term planning and world modeling.
FLE now supports headless operation, enabling scalable experimentation without the Factorio game client.
The environment conforms to the OpenAI gym interface, simplifying integration with existing research codebases.
Agents can interact with Factorio via Python code, receiving structured observations about the game state.
Frontier models like Claude, GPT, Gemini, and Grok show varying performance levels in FLE, with Claude leading in pragmatic errors.
Common failure patterns include API misunderstandings, syntactic errors, and difficulties in maintaining mental models of the factory layout.
FLE serves as a benchmark for evaluating agent capabilities in system engineering, logistics, and long-horizon planning.
The environment is open-source, encouraging community contributions and collaboration.

Hasty Briefsbeta