SemanticScuttle - klotz.me » Tags: context+llm

Tags: context* + llm*

0 bookmark(s) - Sort by: Date ↓ / Title /

New Research Reassesses the Value of AGENTS.md Files for AI Coding

A new ETH Zurich study challenges the common practice of using `AGENTS.md` files with AI coding agents. LLM-generated context files decrease performance (3% lower success rate, +20% steps/costs).Human-written files offer small gains (4% success rate) but also increase costs. Researchers recommend omitting context files unless manually written with non-inferable details (tooling, build commands).They tested this using a new dataset, AGENTbench, with four agents.

2026-03-08 Tags: llm, agents, agents.md, coding, coding agents, context, etf zurich by klotz

Understanding Context and Contextual Retrieval in RAG

RAG combines language models with external knowledge. This article explores context & retrieval in RAG, covering search methods (keywords, TF-IDF, embeddings/FAISS/Chroma), context length challenges (compression, re-ranking), and contextual retrieval (query & conversation history).

2026-03-08 Tags: rag, retrieval-augmented generation, context, contextual retrieval, semantic search, embeddings, faiss, chroma, llm, large language models, knowledge retrieval by klotz

Doc-to-LoRA: Learning to Instantly Internalize Contexts

This research introduces Doc-to-LoRA (D2L), a method for efficiently processing long documents with Large Language Models (LLMs). D2L creates small, adaptable "LoRA" modules that distill key information from a document, allowing the LLM to answer questions without needing the entire document in memory. This significantly reduces latency and memory usage, enabling LLMs to handle contexts much longer than their original capacity and facilitating faster knowledge updates.

2026-02-27 Tags: context, llm, lora, optimization by klotz

Context Optimization: Stop Passing Junk to Your LLM

Here’s the simplest version — key sentence extraction:

<pre>
```
def extract_relevant_sentences(document, query, top_k=5):
sentences = document.split('.')
query_embedding = embed(query)
scored = »
for sentence in sentences:
similarity = cosine_sim(query_embedding, embed(sentence))
scored.append((sentence, similarity))
scored.sort(key=lambda x: x 1 » , reverse=True)
return '. '.join( s[0 » for s in scored :top_k » ])
```
</pre>

For each sentence, compute similarity to the query. Keep the top 5. Discard the rest

2026-02-10 Tags: llm, context, rag, summarization, kiranvutukuri, python by klotz

Introducing MCP CLI - A way to call MCP Servers Efficiently

mcp-cli is a lightweight CLI that enables dynamic discovery of MCP servers, reducing token consumption and making tool interactions more efficient for AI coding agents.

2026-01-09 Tags: mcp, cli, model context protocol, llm, agents, context, bun by klotz

Recursive Language Models (RLM)

Python implementation of Recursive Language Models for processing unbounded context lengths. Process 100k+ tokens with any LLM by storing context as variables instead of prompts.

2026-01-06 Tags: llm, recursive, context, python, litellm, long context, mit, alex zhang by klotz

An LLM does not need to understand MCP

This blog post explains that Large Language Models (LLMs) don't need to understand the Model Context Protocol (MCP) to utilize tools. MCP standardizes tool calling, simplifying agent development for developers while the LLM simply generates tool call suggestions based on provided definitions. The article details tool calling, MCP's function, and how it relates to context engineering.

2025-08-07 Tags: llm, mcp, model context protocol, tools, context, agents, t by klotz

Context Engineering

>"This document provides a comprehensive overview of the engineering repository, which implements a systematic approach to context engineering for Large Language Models (LLMs). The repository bridges theoretical foundations with practical implementations, using a biological metaphor to organize concepts from simple prompts to complex neural field systems."

2025-07-01 Tags: llm, context by klotz

Long context support in LLM 0.24 using fragments and template plugins

LLM 0.24 introduces fragments and template plugins to better utilize long context models, improving storage efficiency and enabling new features like querying logs by fragment and leveraging documentation. It also details improvements to template handling and model support.

2025-04-08 Tags: llm, context, simon willison by klotz

Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens

Qwen2.5-1M models and inference framework support for long-context tasks, with a context length of up to 1M tokens.

2025-01-27 Tags: qwen2.5-1m, context, qwen, llm, github by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: context* + llm*

Linked Tags

Related Tags