Weave Router cuts AI coding costs 40% by routing tasks to optimal models
Weave has released an open-source model router called Weave Router, designed to plug into popular AI coding agents such as Claude Code, Codex, and Cursor. The tool acts as an Anthropic/OpenAI-compatible endpoint, analyzing each inference request and directing it to the most appropriate language model based on task complexity. Simpler tasks like codebase exploration are sent to faster, cheaper models such as DeepSeek V4 Flash, while complex planning or implementation work is routed to frontier models like Opus or GPT. The routing logic is powered by a reinforcement learning model trained on tens of thousands of agent traces, rewarding correct model selection when a task is completed successfully. Weave reports saving 40% on token costs during a month of internal use, with no observed drop in quality or development speed; the router is available for self-hosting under the Elastic License 2.0 or as a hosted service at weaverouter.com.
This is an AI-generated summary. ShortSingh links to the original source for the complete article.
Discussion (0)
Log in to join the discussion and vote.
Log in