Green CI Tests Confirm Correctness, Not Improvement — Here's the Difference

·2 views

A widely shared developer essay argues that passing test suites only confirm code correctness, not whether a change actually improves the system. The green bar in CI verifies that inputs map to expected outputs, but it does not measure outcomes like latency, user activation, or agent behavior. The author distinguishes Spec-Driven Development, which ends at a passing build, from Hypothesis-Driven Development, which starts with a predicted outcome and ends with measured validation. Frameworks from Thoughtworks and PMI have long addressed outcome measurement, yet most engineering teams treat test conformance as a proxy for improvement. The core argument is that correctness is necessary but not sufficient for progress, and the two questions — did it work as written, and did it make things better — require separate, deliberate instrumentation.

Read the full story at DEV Community

This is an AI-generated summary. ShortSingh links to the original source for the complete article.

Discussion (0)

Why Institutional Memory Is a Hidden but Critical Layer in Financial Systems

Distributed financial systems are typically described through code, databases, and architecture diagrams, but a significant portion of how they actually operate depends on knowledge held by engineers, analysts, and operators. This unwritten knowledge — covering edge cases, provider quirks, reconciliation anomalies, and recovery procedures — is what experts call institutional memory. When a system's correctness or recoverability relies on people remembering the right things at the right time, that memory effectively becomes part of the infrastructure. The danger arises when this knowledge is never formally encoded, leaving systems vulnerable whenever key personnel are unavailable or move on. The article argues that organizations must transform such fragile, undocumented assumptions into explicit, auditable, and resilient system design.

0 comments Read more at DEV Community

ProgrammingDEV Community ·

Tutorial: Build a Box Plot Calculator Using Vanilla JavaScript and SVG

A developer tutorial published on DEV Community walks readers through creating a fully functional box plot calculator using only vanilla JavaScript and SVG, with no external libraries such as D3 or Chart.js. The guide covers computing quartiles via linear interpolation (method 7, consistent with NumPy and pandas), detecting outliers using Tukey's fences, and rendering the chart as raw SVG elements. The tutorial emphasizes that choosing the wrong quartile method is a common mistake, noting at least nine different percentile computation approaches exist. The author also references a free companion tool, aiboxplot.com, which extends the concept with AI analysis, multi-dataset comparison, and export options.

0 comments Read more at DEV Community

ProgrammingDEV Community ·

NFL Teams Leave Wins on the Table by Ignoring 4th-Down Analytics, Data Shows

An analysis of 5,847 fourth-down situations across NFL seasons from 2018 to 2023 found that teams attempt conversions only 31% of the time in high-probability scenarios where math favors going for it. In situations such as 4th-and-2 or shorter within opponent territory before the fourth quarter, historical conversion rates range from 64% to 73%, making the expected value of attempting a conversion significantly higher than punting. Teams that follow data-backed conversion decisions gain approximately 2.3 additional wins per season, potentially turning an 8-9 record into a 10-7 finish. Across all 32 NFL franchises, analysts estimate this conservative play-calling costs the league roughly 40 wins per season in aggregate. Despite every NFL organization now employing dedicated analytics staff, a gap persists between what the data recommends and what coaches actually call on game day.

0 comments Read more at DEV Community

ProgrammingDEV Community ·

Three MCP Servers That Extend Claude Desktop With Files, GitHub, and Databases

A developer has shared three Model Context Protocol (MCP) servers they actively use with Claude Desktop for daily workflows. The filesystem server allows Claude to read project files directly, eliminating the need to manually copy-paste code into the chat. The GitHub server enables Claude to review pull requests, check issues, and browse repositories within the same conversation. A PostgreSQL server grants Claude direct database query access, letting users retrieve data without switching tools. Each server can be installed via the mcp-hub CLI in roughly two minutes, though users are cautioned to restrict filesystem paths carefully to avoid exposing their entire disk.

0 comments Read more at DEV Community