Hasty Briefsbeta

Elevating Intelligence via Efficient Model and Tool Orchestration

14 days ago
  • #Efficiency Optimization
  • #AI Orchestration
  • #Reinforcement Learning
  • ToolOrchestra introduces small orchestrators to manage models and tools, improving intelligence and efficiency.
  • The method uses reinforcement learning with rewards based on outcomes, efficiency, and user preferences.
  • Orchestrator, an 8B model, outperforms GPT-5 in accuracy and cost-efficiency on tasks like HLE, tau2-Bench, and FRAMES.
  • Orchestrator achieves a 37.1% score on HLE, surpassing GPT-5's 35.1% while being 2.5x more efficient.
  • The model generalizes well to unseen tools, offering a robust performance-cost trade-off.
  • ToolOrchestra demonstrates that lightweight orchestration with diverse tools is more effective and scalable than existing methods.