SShortSingh.
0
ProgrammingDEV Community ·

Why LLMs Fail in the Real World: The Overfitting Problem in RAG Evaluation

Overfitting is a common machine learning issue where a model performs well on training data but poorly on new, unseen inputs — a problem that also affects large language models (LLMs). In Retrieval-Augmented Generation (RAG) evaluation, overfitting can cause models to memorize training examples rather than learning generalizable patterns. AI platform Narrivo highlights that overfit models are prone to failing on out-of-distribution data and can be overly sensitive to minor input variations. To counter this, experts recommend strategies such as regularization techniques like dropout, data augmentation, early stopping, and evaluating models on diverse test sets. Addressing overfitting is considered critical to building LLMs that perform reliably in real-world deployment scenarios.

0
ProgrammingHacker News ·

Data Project Maps 5,000 Restaurant Menus from the Late 19th and Early 20th Century

A data journalism project published by The Pudding explores a collection of approximately 5,000 historical restaurant menus spanning the years 1880 to 1920. The project offers a visual and analytical look at dining culture during that four-decade period. The collection provides insights into food trends, pricing, and culinary habits of the era. The interactive piece was shared on Hacker News in June 2026, drawing early attention from the tech and data community.

0
ProgrammingDEV Community ·

Dev builds bare-metal PCI, NVMe, and FAT32 drivers for custom OS from scratch

A developer working on V.E.L.O.C.I.T.Y.-OS, a bare-metal operating system designed to run inside the CPU's L3 cache, has published Part 9 of a 12-part build series. After transitioning the kernel to Ring 0 in the previous installment, the developer faced the challenge of having no drivers to read from storage or load files. To solve this, they wrote a PCI configuration space scanner in Rust that queries buses, slots, and functions via legacy I/O ports to detect attached hardware. Using the scanner's output, an NVMe block storage driver was built by locating the mass storage controller, mapping its MMIO registers, and implementing the full NVMe startup and read sequence. A FAT32 parser was also developed, giving the OS the ability to locate and read files directly from disk without relying on any existing driver stack.

0
IndiaNDTV ·

Eight Bangladeshi Nationals Held at Agartala Airport in Twin Police Operations

Tripura Police arrested eight Bangladeshi nationals at Maharaja Bir Bikram Airport in Agartala in two separate operations. The arrests were carried out based on prior intelligence inputs gathered by the security agencies. The suspects were believed to be attempting to travel to other parts of India illegally. Both operations were conducted at the airport in Tripura's state capital. The arrests highlight ongoing efforts to curb illegal cross-border movement in the northeastern region.

0
ProgrammingDEV Community ·

Using OpenAI's tiktoken for Claude Token Counts Can Skew Cost Estimates by 20%

Developers using OpenAI's tiktoken tokenizer to estimate token counts for Anthropic's Claude models risk cost and context budget errors of 15–20%, and more on code or non-English text. This happens because Claude uses its own tokenizer, which splits text differently than tiktoken, causing systematic undercounting. Anthropic provides a dedicated countTokens API endpoint in its SDK that returns accurate, model-specific token counts before inference is run. Token counts also vary across Claude model versions, meaning cached counts from older models should not be reused when switching versions. The recommended fix is to always call countTokens against the specific Claude model being used, and never apply a blanket multiplier to convert counts between models.

0
ProgrammingDEV Community ·

Why Blockchain Failures Often Stem From Cross-Layer Protocol Breakdowns

Blockchain outages are rarely caused by a single component failure; instead, they typically emerge from interactions between layers such as transaction decoding, execution, and consensus. A single malformed transaction can freeze a chain entirely if validators keep rejecting the same invalid block without purging it from the mempool, causing leadership rotation to stall progress without ever producing a fork. Consensus reaching unanimous agreement on rejection does not mean the protocol is functioning correctly, as it can indefinitely halt progress while preserving safety. Proper error classification is critical — distinguishing permanently invalid transactions from local infrastructure faults prevents operational errors from being broadcast as consensus decisions. Robust testing must simulate multi-node fault scenarios including process kills, disk failures, and mixed software versions to reveal whether local failures can cascade into a global network outage.

0
IndiaTimes of India ·

Ben Stokes, 35, retires from Test cricket, ending international career

England cricket captain Ben Stokes has announced his retirement from Test cricket, shocking the cricketing world. The announcement was made while Stokes was mid-bowling spell during an ongoing Test match. England Cricket released an official statement confirming the 35-year-old's decision. Stokes will step away from international cricket once the current Test concludes.

0
ProgrammingDEV Community ·

How Video Transcripts Improve Search, Accessibility, and Content Discovery

Video transcripts convert spoken audio into searchable text, making it easier to locate specific moments within lengthy recordings without manual scrubbing. They serve a range of practical purposes, from improving accessibility for deaf or hard-of-hearing viewers to boosting SEO by allowing search engines to index video content. Content creators can also repurpose transcripts into blog posts, social media quotes, or summaries, maximizing the value of existing recordings. For researchers and marketers, transcripts enable trend analysis and data extraction from spoken material. Organizations managing large video libraries benefit further by using transcripts for efficient cataloging and retrieval of media assets.

0
ProgrammingDEV Community ·

Engineer Maps 10 Levels of AI Skill Construction for Enterprise Workflows

A developer who spent a year building AI agent skills for enterprise clients has published a detailed framework outlining ten distinct levels of skill construction. The guide progresses from a basic single-prompt Markdown file to a complex multi-skill business closure system capable of orchestrating eight or more agents end-to-end. The author argues that most teams treat AI skills as little more than fancy prompts, which fails when real-world workflows require conditional logic, external data, or structured processes. Each level introduces new components such as knowledge bases, validation scripts, workflow routing, and inter-skill coordination. The framework is intended to help developers identify when their current approach is insufficient and what architectural step to take next.

0
ProgrammingHacker News ·

Flock surveillance cameras capture far more data than just license plates

Flock Safety cameras, widely used by law enforcement and private communities across the US, are expanding rapidly in deployment. Beyond reading license plates, the cameras collect additional vehicle and environmental data, raising privacy concerns. The technology is increasingly being adopted by neighborhoods, businesses, and police departments seeking automated surveillance tools. Critics argue the broad data collection goes beyond what most residents and users are aware of when these systems are installed.

0
ProgrammingDEV Community ·

Team Risers Builds AI Coding Assistant with Persistent Memory at Hackathon

A team called Risers developed MemoCode AI, an AI-powered software engineering assistant, during a recent hackathon. The tool is designed to retain memory of previous conversations and maintain long-term project context across sessions. It aims to help developers write, debug, and improve code more efficiently by understanding ongoing project details. The team used the experience to deepen their practical knowledge of AI agents, memory systems, and prompt engineering. They credited the hackathon organizers for providing the opportunity to build real-world AI applications.

0
ProgrammingDEV Community ·

Kiro IDE Promotes Spec-Driven Development to Cut Bugs and Rework

AWS instructor Luca D'Addeo argues that popular AI coding tools like Copilot and Cursor encourage developers to write code before thinking through requirements, leading to costly rework. He proposes Spec-Driven Development, a workflow that requires teams to define structured specifications — covering requirements, design rationale, and implementation tasks — before writing any code. The approach is implemented through Kiro, an IDE designed to enforce this planning-first philosophy. D'Addeo claims the method can reduce development time by 70%, cut production bugs by 60%, and slash necessary refactoring by 80%. The article walks through a real-world REST API example to illustrate how upfront specification prevents common pitfalls such as missing validation, poor error handling, and unclear authentication rules.

0
ProgrammingDEV Community ·

2026 AI Automation Tool Index: 12 Platforms Compared by Pricing Model and Self-Hosting

A vendor-independent reference index published on DEV Community compares 12 major automation platforms available in 2026, including Zapier, Make, n8n, Microsoft Power Automate, and MuleSoft. The index organizes tools by billing model — per-task, per-operation, per-execution, flat-rate, or enterprise quote — arguing that pricing structure affects costs more than brand choice. Self-hosting capability is also highlighted as a key differentiator, with platforms like n8n and Activepieces offering open-source, self-hostable options for teams with data residency requirements. Each platform is mapped to a specific use-case profile, ranging from non-technical teams needing simple app integrations to large enterprises requiring governance and API-led connectivity. The index was compiled by the operator of Aiprosol, an automation consultancy, and is positioned as a no-affiliate, no-paid-ranking resource.

0
ProgrammingDEV Community ·

Dev builds bare-metal UEFI bootloader and Ring 0 kernel in ongoing OS series

A developer working on a custom bare-metal operating system called V.E.L.O.C.I.T.Y.-OS published Part 8 of a 12-part series on June 27, detailing the transition from UEFI boot services to direct CPU Ring 0 control. The project aims to build a Single-Address-Space Operating System running entirely within the CPU's L3 cache, bypassing conventional OS layers for microsecond-level execution. To safely exit UEFI, the developer implemented three core modules: a pre-allocated 16MB heap allocator, a Global Descriptor Table with flat 64-bit kernel segments, and an Interrupt Descriptor Table for exception handling. A Task State Segment with an Interrupt Stack Table was also configured to handle double-fault exceptions without triggering CPU resets. The series is written in Rust using no-std conventions, with upcoming parts covering bare-metal drivers, a spatial GUI renderer, multi-agent scheduling, and eventual self-optimization via a local language model.

0
WorldBBC World ·

Australian man charged with murder over teen girl found dead in suitcase in Thailand

An Australian man has been charged with murder following the discovery of a teenage girl's body in Thailand. The 17-year-old's remains were found stuffed inside a suitcase near a railway track on Saturday. Thai police located the body after it was discarded at the site. Authorities have since apprehended and formally charged the Australian national in connection with the killing.

0
SportsESPNcricinfo ·

Ben Stokes to retire from international cricket after third Test against India

England Test captain Ben Stokes has announced his retirement from international cricket, set to take effect after the third Test of the current series. Stokes informed his teammates of his decision at the start of the fourth day of play in Nottingham. The announcement marks the end of an era for English cricket, as Stokes has been one of the most influential figures in the team's recent history. He has led England under the 'Bazball' philosophy alongside coach Brendon McCullum, transforming the side's approach to Test cricket.

0
ProgrammingDEV Community ·

US Government Orders OpenAI to Restrict GPT-5.6 Access to Approved Partners Only

The Trump administration directed OpenAI to limit the release of its latest model, GPT-5.6, granting access only to government-approved partners during the preview phase. OpenAI CEO Sam Altman informed staff of the restriction on June 25, and the company issued an official confirmation the following day. The request came from two federal bodies — the Office of the National Cyber Director (ONCD) and the Office of Science and Technology Policy (OSTP) — acting under an executive order Trump signed in early June authorizing government review of frontier AI models before release. Anthropic faced a similar intervention earlier, with its Claude Fable 5 model pulled shortly after launch over safety concerns. OpenAI stated it disagrees with such approval systems becoming a long-term default, while critics warn the move creates an opaque, de facto licensing regime with no clear safety benchmarks.

0
ProgrammingDEV Community ·

V.E.L.O.C.I.T.Y.-OS JIT Gains Compiler Optimization Passes in Series Part 7

The seventh installment of the V.E.L.O.C.I.T.Y.-OS development series details the addition of classic compiler optimization passes to the project's JIT compiler. Four passes were implemented directly on the AST in src/compiler/nda_jit.rs before machine code emission, targeting redundant or unnecessary operations. These include constant folding, which evaluates static expressions like Add(5,3) at compile time, and constant propagation, which replaces variable loads with known constant values. Loop unrolling for small static-count loops was also added, alongside dead code elimination to remove unused nodes. The goal is to produce smaller and cleaner x86-64 machine code output from the JIT pipeline.

0
ProgrammingHacker News ·

Freelancers Seek Shared Database to Track Employers with Unpaid Contract History

A Hacker News user has raised a question about whether a publicly accessible list or website exists to track employers who have failed to pay contractors. The post was prompted by the user's personal experience working with multiple clients who delayed or withheld contract payments. The query has attracted 14 comments from the community, suggesting the topic resonates with other freelancers and independent contractors. No definitive platform was identified in the original post, highlighting a potential gap in resources available to protect contract workers from non-paying clients.

← NewerPage 100 of 183Older →