Function-Level Chunking Significantly Improves LLM Code Retrieval Accuracy

·1 views

A developer building a retrieval-augmented generation (RAG) system over a codebase found that splitting source code by fixed token counts produced fragmented, near-useless results. The core issue was that standard line-based chunking broke functions mid-body, leaving embeddings that represented incomplete code with no signature or context. Switching to function-aware chunking — one complete function per chunk, including its signature and doc comment — dramatically improved the quality of answers from the language model. The approach uses syntax tree parsers like ts-morph for TypeScript or AST-based tools for Solidity to extract whole function nodes rather than raw text slices. Running the embedding pipeline locally via Ollama ensured that private codebases never left the developer's machine while still enabling accurate, name-based semantic queries.

Read the full story at DEV Community

This is an AI-generated summary. ShortSingh links to the original source for the complete article.

Discussion (0)

Oracle PeopleSoft Vulnerabilities Exploited in Attack on Nissan and 100+ Firms

A coordinated cyberattack exploiting vulnerabilities in Oracle PeopleSoft has compromised more than 100 organizations, including Nissan, exposing sensitive employee data. Attackers leveraged known flaws in PeopleSoft's Java deserialization handlers and HTTP endpoints to achieve remote code execution on application servers. Once inside, threat actors were able to harvest authentication tokens, LDAP credentials, password hashes, and OAuth secrets stored within the platform. Because PeopleSoft systems typically integrate with enterprise identity infrastructure such as Active Directory and cloud HR platforms, the breach creates pathways for lateral movement across connected networks. The campaign highlights the elevated risk posed by centralized identity management systems that hold privileged access to broader enterprise environments.

0 comments Read more at DEV Community

ProgrammingDEV Community ·

Developer builds AI agent to automate AWS-to-GKE app migration with human oversight

A software developer created an AI-powered tool called a 'skill' for the Antigravity CLI (agy) to automate the refactoring of cloud-dependent codebases from AWS to Google Kubernetes Engine (GKE). The tool addresses common migration pain points such as hardcoded AWS credentials, proprietary SDK usage like boto3, and local disk storage incompatible with ephemeral Kubernetes pods. It works by scanning cloud dependencies, spawning parallel subagents to refactor code and infrastructure, and validating changes on a local Kubernetes cluster before deployment. A mandatory human-in-the-loop (HITL) approval gate is built in to prevent any unsupervised changes from reaching production environments. The approach contrasts with simple scripted find-and-replace methods by using an LLM agent capable of understanding semantic context and adapting to the current state of the codebase.

0 comments Read more at DEV Community

ProgrammingDEV Community ·

Go's Built-In pprof Tool Lets Developers Profile Live Services in Minutes

Go includes a built-in profiling tool called pprof that requires no third-party software or agents to operate. Developers can enable it by importing the net/http/pprof package, which registers HTTP endpoints exposing CPU, memory, goroutine, and mutex data. A 30-second CPU sample can be collected using the go tool pprof command, and results can be visualized as flame graphs through a built-in web UI. Flame graphs help identify bottlenecks such as excessive memory allocations, JSON serialization overhead, or lock contention by showing which functions consume the most CPU time. For security, pprof endpoints should only be bound to localhost or a private interface, never exposed on a public port.

0 comments Read more at DEV Community

ProgrammingDEV Community ·

Telnyx Number Lookup API lets developers identify carrier and line type via Python

A developer tutorial on DEV Community walks through building a Python Flask service that queries the Telnyx Number Lookup API to retrieve carrier name, line type, and portability status for any phone number. The service wraps the API with a 24-hour in-memory cache, so repeated lookups are served locally without additional API calls. Three endpoints handle lookups via JSON body, URL path, and cache inspection respectively. Common use cases include screening inbound calls, enriching sales leads, routing messages based on line type, and detecting number porting for compliance purposes. The full code is available in the Telnyx open-source examples repository on GitHub.

0 comments Read more at DEV Community

Function-Level Chunking Significantly Improves LLM Code Retrieval Accuracy

Discussion (0)

Related stories

Oracle PeopleSoft Vulnerabilities Exploited in Attack on Nissan and 100+ Firms

Developer builds AI agent to automate AWS-to-GKE app migration with human oversight

Go's Built-In pprof Tool Lets Developers Profile Live Services in Minutes

Telnyx Number Lookup API lets developers identify carrier and line type via Python