Elevating Intelligence via Efficient Model and Tool Orchestration

14 days ago

Copy Link

ToolOrchestra introduces small orchestrators to manage models and tools, improving intelligence and efficiency.
The method uses reinforcement learning with rewards based on outcomes, efficiency, and user preferences.
Orchestrator, an 8B model, outperforms GPT-5 in accuracy and cost-efficiency on tasks like HLE, tau2-Bench, and FRAMES.
Orchestrator achieves a 37.1% score on HLE, surpassing GPT-5's 35.1% while being 2.5x more efficient.
The model generalizes well to unseen tools, offering a robust performance-cost trade-off.
ToolOrchestra demonstrates that lightweight orchestration with diverse tools is more effective and scalable than existing methods.

Hasty Briefsbeta