SShortSingh.
0
ProgrammingDEV Community ·

Developer Builds FoilSuite, a Local-First Browser and IoT Security Toolkit

A developer and PhD researcher at Singidunum University has released FoilSuite, an open-source security toolkit designed to operate entirely without sending user data to external servers. The suite includes FoilGuard, a Chrome extension that detects phishing, typosquatting, and Unicode impersonation attacks using on-device logic only. A companion tool, FoilVault, functions as a zero-knowledge password manager that blocks autofill if the current domain is flagged as suspicious. The third component, FoilLab, is a weekly challenge platform offering hands-on exercises in network analysis, IoT firmware reverse engineering, and log forensics. The project stems from the creator's research into decentralized, tamper-resistant communication for constrained IoT devices and aims to challenge the norm of relying on cloud infrastructure for security decisions.

0
ProgrammingDEV Community ·

Developer Builds 340MB Portable AI Agent That Runs From a USB Stick

A developer has created a self-contained AI agent package of just 340MB that can run on any x86_64 Linux machine without requiring system dependencies. The setup, called Norax, bundles a standalone Python environment, a CPU-only Ollama binary, and a memory database into a single directory. GPU libraries were stripped to keep the package lightweight, saving around 5GB of space, while relative paths ensure it works from any mount point. The AI model is downloaded on first run, and memory persists across sessions. The project demonstrates that a fully functional AI agent runtime with tools and an HTTP API can be made genuinely portable today.

0
ProgrammingDEV Community ·

Why Dynamic Pricing on Shopify Is an Architecture Problem, Not a Math One

A software developer has outlined a four-layer architecture designed to prevent dynamic pricing engines from causing widespread damage on live Shopify catalogs. The framework separates the roles of proposing a price, applying merchant policy, executing approved changes via the Admin API, and maintaining an immutable audit trail. A key principle is that the pricing engine can only suggest prices within hard-coded constraints, making rule violations structurally impossible rather than relying on runtime checks. The system is also designed to fail closed, meaning stale data or missing inputs freeze all price updates rather than allowing the engine to guess. The developer has published the framework as an open-source skeleton built on FastAPI, Celery, Postgres, and Redis, intended as a safe foundation for teams building toward production.

0
ProgrammingDEV Community ·

n8n Expressions Guide: Use Dynamic Data Across Workflow Nodes

n8n expressions allow users to inject live data, perform calculations, format dates, and build strings dynamically within any workflow node field. Written in JavaScript and wrapped in double curly braces, expressions can reference the current item's data via $json or pull values from any previous node by name. Built-in variables such as $now for timestamps, $vars for workflow-level constants, and $env for environment secrets extend functionality further. Common use cases include generating dynamic URLs, formatting dates for spreadsheets, combining name fields, and safely handling missing data with nullish coalescing. Developers can activate expression mode in any n8n input field by clicking the equals icon, with a live preview pane displaying results in real time.

0
WorldBBC World ·

Australia doubles fines for platforms breaching under-16 social media ban

Australia has strengthened enforcement of its social media ban for children under 16, which has been in effect since December 2025. The government announced plans to double the maximum penalty for platforms found to be in violation of the law. The move targets major social media companies that fail to prevent underage users from accessing their services. At least ten key platforms are subject to the existing restrictions. The tougher penalties signal Australia's intent to hold tech companies more accountable for compliance.

0
ProgrammingDEV Community ·

Sleep-Inspired Memory Consolidation Cuts AI Agent Memory Bloat by 58%

AI agents running over extended periods accumulate tens of thousands of memory items, causing slower retrieval and reduced accuracy as irrelevant data drowns out useful context. Drawing inspiration from how the human brain consolidates memories during sleep, a proposed system runs offline consolidation cycles during idle periods of 30 or more minutes. The process merges near-duplicate memories, scores and prunes the least important entries, and summarizes low-value memory clusters by topic, while permanently protecting recent, procedural, and credential-based memories. In testing, the approach reduced memory store size from over 12,000 items to roughly 5,000, cut retrieval latency by 40%, and improved recall accuracy by 15%. The consolidation can also be triggered when the store exceeds 8,000 items, on owner command, or during scheduled maintenance windows.

0
ProgrammingDEV Community ·

Developer Releases Local macOS App to Identify and Remove Instagram Non-Followers

A developer has built InstaClean, a macOS desktop application that helps users identify Instagram accounts they follow that do not follow them back. The app compares a user's following and followers lists locally, then displays non-followers with options to search, paginate, and bulk-unfollow selected accounts. Built with Python 3.9+, the tool uses Instagram web session cookies for authentication and runs entirely on the user's machine without any external server or cloud processing. Privacy was a core design priority, ensuring that no personal account data is transmitted or stored outside the local environment. The open-source project is available on GitHub and can be run directly via Python or packaged as a native macOS .app bundle using py2app.

0
ProgrammingDEV Community ·

One AI Policy Can't Serve Both Beginners and Power Users, Teams Learn

Software teams scaling AI-assisted coding beyond small groups face a structural policy problem: a single shared instruction document cannot simultaneously serve developers new to agentic coding and those already running advanced multi-agent workflows. Author Karl-Heinz Reichel, writing in June 2026, describes how his team's unified AI coding guidelines were welcomed by newer developers but immediately felt restrictive to experienced ones. He frames this as a category error — conflating awareness tools, which inform but cannot compel, with governance tools, which enforce behavior regardless of individual judgment. Applying a planning requirement as a soft guideline risks collapsing into an optional norm the moment a senior developer ignores it. The proposed solution mirrors an earlier insight from the team's merge-gate work: treat onboarding guardrails and power-user permissions as two genuinely separate layers rather than one compromise document.

0
ProgrammingDEV Community ·

Two-File Supply Chain Attack Targets npm Package via Rogue Registry Redirect

A security researcher discovered a malicious pull request submitted to their open-source repository, sebs/etherscan-api, designed to redirect all npm package resolution to an attacker-controlled server. The PR was disguised as a routine refactor, adding a convincing description with bullet points and developer-friendly language to lower the maintainer's guard. The attack used just two file changes: a .npmrc file that silently overrode the default npm registry with a bare IP address on a non-standard port using unencrypted HTTP, and a package.json update adding a fake dependency to justify the .npmrc change. Because the rogue registry handled all package resolution, the attacker could serve trojaned versions of any dependency fetched during installation. The submitting account appeared legitimate, featuring an aged profile, repository history, and GitHub achievement badges, illustrating how modern supply chain attacks rely as much on social engineering as technical exploits.

0
ProgrammingDEV Community ·

Entity Graph Retrieval Helps AI Agents Find Contextually Related Memories

A technique called entity graph retrieval addresses a key limitation of semantic search in AI agent memory systems: its inability to surface memories linked by related entities rather than similar words. The approach, demonstrated in a system called Norax, works by extracting named entities from stored memories, connecting co-occurring entities with weighted edges, and grouping them into clusters using the Louvain community detection algorithm. When a query arrives, the system identifies which entity community it belongs to and boosts memory results that share entities or community membership with the query. For example, a query mentioning 'Colby' would automatically surface wallet and payment memories because those entities cluster together, even without keyword overlap. The method requires no model training or external API calls, making it a low-cost addition that meaningfully improves how AI agents retrieve contextually relevant information.

0
ProgrammingHacker News ·

Guide: How to Self-Host Your Own DNS-over-HTTPS Service

A technical guide published on nochan.net walks users through setting up their own DNS-over-HTTPS (DoH) service. DoH encrypts DNS queries, improving privacy by preventing third parties from monitoring browsing activity. The article provides step-by-step instructions for self-hosting the service rather than relying on public DoH providers. It was shared on Hacker News in June 2026, attracting modest early engagement with 5 points and 1 comment.

0
ProgrammingDEV Community ·

Developer Builds AI Painting Attribution Tool Using PyTorch and ResNet-50

A developer has created an artist attribution system that uses deep learning to predict the likely creator of a painting from an input image. The project leverages transfer learning with a pretrained ResNet-50 model rather than building a convolutional neural network from scratch, making training faster and more practical. Given an image, the model returns the top predicted artist along with a confidence score and the top three candidate guesses. The system supports multiple hardware environments, including NVIDIA CUDA, Apple Silicon, and standard CPUs, and can also run on cloud platforms like Google Colab. The project is intended as a practical introduction to computer vision, image classification, and fine-tuning techniques using PyTorch and torchvision.

0
ProgrammingDEV Community ·

Developer Series Wraps Up Full RAG System Build Using Python, pgvector, and Gemini

A multi-part developer tutorial series on DEV Community has concluded, documenting the step-by-step construction of a complete Retrieval-Augmented Generation (RAG) system from scratch using Python. The project progressed from basic database setup with pgvector on PostgreSQL through document ingestion, cosine similarity search, and a full RAG pipeline, ultimately reaching multi-step agentic loops and Model Context Protocol (MCP) server deployments. Key technical decisions included capping Gemini embeddings at 768 dimensions to comply with pgvector's HNSW index limit, and using distinct task types for document storage versus query retrieval to preserve accuracy. The free tiers of Render and Supabase were used to host the MCP server and pgvector database respectively, with a specific connection pooler port required to bridge IPv6 compatibility issues. The author noted that evaluation frameworks, observability tooling, security hardening, LLMOps practices, and fine-tuning were intentionally left out of scope for future exploration.

0
ProgrammingDEV Community ·

Tutorial: Deploy a RAG System to the Cloud Using Render and Supabase for Free

A developer tutorial on DEV Community walks through migrating a locally hosted RAG (Retrieval-Augmented Generation) system to the cloud using Render and Supabase. The guide replaces a local Docker-based pgvector setup with Supabase's managed PostgreSQL service, while hosting the MCP HTTP server on Render's free tier. Both platforms offer free starting tiers with no credit card required, making the setup accessible for developers. The tutorial covers provisioning a Supabase project, configuring pgvector, migrating existing documents via a Python script, and adapting the server code to read environment variables set by Render. A key technical note highlights that Render requires IPv4, so Supabase's connection pooler on port 6543 must be used instead of the default port 5432.

0
ProgrammingDEV Community ·

AI Data Centers: Real Local Harm Behind Modest Global Energy Footprint

Global AI data centers consumed roughly 415 terawatt-hours of electricity in 2024, accounting for about 1.5% of worldwide electricity use, with projections staying below 3% of demand by 2030, according to the International Energy Agency. While these figures appear manageable on a global scale, the impact is far more acute at the local level, where data centers cluster in specific regions, straining water supplies, air quality, and power grids not designed for such loads. In the United States alone, data centers accounted for an estimated 4.4% of national electricity consumption in 2023, with that share potentially reaching 12% by 2028. Countries like Ireland have become cautionary examples of what happens when compute infrastructure concentrates in a single location. Analysts and conservationists argue the solution is not to halt AI development but to govern the build-out so that surrounding communities and ecosystems are treated as genuine stakeholders rather than absorbed costs.

0
ProgrammingDEV Community ·

How Developers Can Run Open Source AI Models Locally in 2026

Running AI models locally on personal hardware has become accessible to everyday developers, requiring no API keys, internet connection, or cloud services. A mid-range laptop in 2026 can handle models that were considered cutting-edge just a few years ago, thanks to maturing tools like Ollama and LM Studio. The key limiting factor for local AI is available memory — VRAM on a GPU or unified memory on a Mac — which determines which models a device can run. Developers can get started in about ten minutes by installing a lightweight runtime and pulling a quantized 7–8 billion parameter model. Local AI offers clear advantages in privacy, cost control, and offline capability, though it does not replace cloud models at the highest performance tiers.

0
ProgrammingDEV Community ·

Developer Guide: Turning pgvector Search Tools into a Reusable MCP Server

A developer tutorial on DEV Community walks through converting hardcoded pgvector search functions into a standalone Model Context Protocol (MCP) server. Previously, AI agents could only access these search tools from within a single Python script, limiting reusability. Using the FastMCP library, the search functions are exposed as a protocol-compliant server that any LLM client — including Claude Desktop or Gemini agents — can connect to. The server implements three core primitives: tools for searching documents, resources for reading category data, and reusable prompt templates. The underlying search logic remains unchanged; only the deployment and access layer is restructured.

0
ProgrammingDEV Community ·

Multi-Signal Memory Architecture Boosts AI Agent Recall by Up to 60%

Most AI agent memory systems rely solely on embedding similarity for retrieval, which can fail when queries involve indirect references or long-term context spanning weeks of conversation. A system called Norax addresses this by combining four retrieval signals: keyword matching, dense vector similarity, temporal decay, and entity graph reranking, each assigned a learned weight. Temporal decay boosts recent memories while penalizing older ones, with configurable decay rates depending on memory type. Entity graph reranking, implemented as a community-detected graph where co-occurring entities form edges, is credited as the biggest performance driver by catching relationships embedding similarity alone misses. In testing on agent memory workloads, this multi-signal approach outperformed single-signal retrieval by 40 to 60 percent on recall at 10.

← NewerPage 29 of 83Older →