Skillware v0.4.0 Adds Real-Time Token Budget Control for AI Agents
Skillware, an open registry of installable agent capabilities, has released version 0.4.0 featuring a new skill called monitoring/token_limiter that lets developers cap token usage for AI agents in real time. The tool works across major model providers including OpenAI, Gemini, Claude, DeepSeek, and Ollama using a single, provider-neutral Python implementation. After each model turn, the host loop passes cumulative token usage to the skill, which returns one of three decisions: continue, warn at 80% of budget, or force-terminate when the hard ceiling is reached. The skill does not directly cancel API sessions but returns a structured decision that the orchestrator must act upon, keeping it portable and free of provider-specific dependencies. An optional model ID parameter maps to bundled pricing data to provide indicative cost estimates for operations dashboards, though the figures are not invoice-grade.
This is an AI-generated summary. ShortSingh links to the original source for the complete article.

Discussion (0)
Log in to join the discussion and vote.
Log in