Skillware v0.4.0 Adds Real-Time Token Budget Control for AI Agents

·2 views

Skillware, an open registry of installable agent capabilities, has released version 0.4.0 featuring a new skill called monitoring/token_limiter that lets developers cap token usage for AI agents in real time. The tool works across major model providers including OpenAI, Gemini, Claude, DeepSeek, and Ollama using a single, provider-neutral Python implementation. After each model turn, the host loop passes cumulative token usage to the skill, which returns one of three decisions: continue, warn at 80% of budget, or force-terminate when the hard ceiling is reached. The skill does not directly cancel API sessions but returns a structured decision that the orchestrator must act upon, keeping it portable and free of provider-specific dependencies. An optional model ID parameter maps to bundled pricing data to provide indicative cost estimates for operations dashboards, though the figures are not invoice-grade.

Read the full story at DEV Community

This is an AI-generated summary. ShortSingh links to the original source for the complete article.

Discussion (0)

Google DeepMind Launches Gemini Image Flash Lite Model

Google DeepMind has released a new model called Gemini Image Flash Lite, accessible via its official DeepMind website. The release was noted on Hacker News, where it attracted initial community attention. The model appears to be part of Google's expanding Gemini AI lineup, targeting lightweight or efficient image-related tasks. Details about the model's full capabilities and intended use cases are available through the official DeepMind product page.

0 comments Read more at Hacker News

ProgrammingDEV Community ·

IRIS 2025.2 Introduces Native OAuth2 Support for Web Application Authentication

InterSystems IRIS 2025.2 now supports OAuth2 as a native authentication method for web applications, eliminating the need for manual token-validation workarounds. OAuth2 allows third-party apps to access protected APIs using scoped, time-limited, and revocable access tokens instead of sharing user credentials. A typical setup involves four roles: a resource owner, a client app, an authorization server such as Keycloak, and IRIS acting as the resource server that validates tokens and enforces access rules. IRIS can now parse an incoming access token and automatically establish a user context, including username and roles, similar to how other authentication types work. A hands-on demo using Docker, Keycloak, and Postman is available on Open Exchange to help developers reproduce and explore the integration locally.

0 comments Read more at DEV Community

ProgrammingHacker News ·

Crypto industry pours $189M into 2026 US election cycle, report finds

Cryptocurrency companies have collectively spent $189 million on the 2026 US election cycle, according to a new report. The figure highlights the industry's growing political influence as it seeks favorable regulation from lawmakers. The spending marks a significant escalation of crypto-sector involvement in American electoral politics. This financial commitment reflects the industry's broader push to shape policy outcomes in Washington ahead of the midterm elections.

0 comments Read more at Hacker News

ProgrammingDEV Community ·

Developer Shares Why He Switched from Relational Databases to MongoDB

A software developer has shared his experience transitioning from relational databases like PostgreSQL to MongoDB after hitting scalability and flexibility limits. He found that managing schema migrations and horizontal scaling for write-heavy, globally distributed applications was consuming too much development time. MongoDB's document-based, flexible schema model allowed him to prototype faster and evolve his data structure alongside application code. He noted that MongoDB's built-in AI features and cost-effective infrastructure made it a better fit for modern application demands. The developer acknowledged that relational databases remain superior for complex multi-table joins, emphasizing that the choice should depend on the specific use case.

0 comments Read more at DEV Community