SemanticScuttle - klotz.me

Tags: context*

0 bookmark(s) - Sort by: Date ↓ / Title /

Chat History Storage Patterns in Microsoft Agent Framework

This article explores the critical architectural decision of where to store conversation history when building AI agents. It examines how different storage strategies impact user experience, privacy, cost, and portability. The author compares service-managed versus client-managed storage models and details how modern APIs support both linear threads and forking/branching capabilities.
Key topics include:
* Service-Managed vs. Client-Managed storage tradeoffs
* Linear (single-threaded) vs. Forking-capable conversation models
* Strategies for context window management and compaction such as truncation, summarization, and sliding windows
* How Microsoft Agent Framework abstracts these patterns using AgentSession and ChatHistoryProvider to ensure provider-agnostic code
* Practical implementation examples for the Responses API in different modes

2026-04-25 Tags: llm agents, chat history, microsoft, agent framework, software, architecture, llm, context, kmeans by klotz

New Research Reassesses the Value of AGENTS.md Files for AI Coding

A new ETH Zurich study challenges the common practice of using `AGENTS.md` files with AI coding agents. LLM-generated context files decrease performance (3% lower success rate, +20% steps/costs).Human-written files offer small gains (4% success rate) but also increase costs. Researchers recommend omitting context files unless manually written with non-inferable details (tooling, build commands).They tested this using a new dataset, AGENTbench, with four agents.

2026-03-08 Tags: llm, agents, agents.md, coding, coding agents, context, etf zurich by klotz

Understanding Context and Contextual Retrieval in RAG

RAG combines language models with external knowledge. This article explores context & retrieval in RAG, covering search methods (keywords, TF-IDF, embeddings/FAISS/Chroma), context length challenges (compression, re-ranking), and contextual retrieval (query & conversation history).

2026-03-08 Tags: rag, retrieval-augmented generation, context, contextual retrieval, semantic search, embeddings, faiss, chroma, llm, large language models, knowledge retrieval by klotz

Doc-to-LoRA: Learning to Instantly Internalize Contexts

This research introduces Doc-to-LoRA (D2L), a method for efficiently processing long documents with Large Language Models (LLMs). D2L creates small, adaptable "LoRA" modules that distill key information from a document, allowing the LLM to answer questions without needing the entire document in memory. This significantly reduces latency and memory usage, enabling LLMs to handle contexts much longer than their original capacity and facilitating faster knowledge updates.

2026-02-27 Tags: context, llm, lora, optimization by klotz

Context Optimization: Stop Passing Junk to Your LLM

Here’s the simplest version — key sentence extraction:

<pre>
```
def extract_relevant_sentences(document, query, top_k=5):
sentences = document.split('.')
query_embedding = embed(query)
scored = »
for sentence in sentences:
similarity = cosine_sim(query_embedding, embed(sentence))
scored.append((sentence, similarity))
scored.sort(key=lambda x: x 1 » , reverse=True)
return '. '.join( s[0 » for s in scored :top_k » ])
```
</pre>

For each sentence, compute similarity to the query. Keep the top 5. Discard the rest

2026-02-10 Tags: llm, context, rag, summarization, kiranvutukuri, python by klotz

Introducing MCP CLI - A way to call MCP Servers Efficiently

mcp-cli is a lightweight CLI that enables dynamic discovery of MCP servers, reducing token consumption and making tool interactions more efficient for AI coding agents.

2026-01-09 Tags: mcp, cli, model context protocol, llm, agents, context, bun by klotz

Recursive Language Models (RLM)

Python implementation of Recursive Language Models for processing unbounded context lengths. Process 100k+ tokens with any LLM by storing context as variables instead of prompts.

2026-01-06 Tags: llm, recursive, context, python, litellm, long context, mit, alex zhang by klotz

An LLM does not need to understand MCP

This blog post explains that Large Language Models (LLMs) don't need to understand the Model Context Protocol (MCP) to utilize tools. MCP standardizes tool calling, simplifying agent development for developers while the LLM simply generates tool call suggestions based on provided definitions. The article details tool calling, MCP's function, and how it relates to context engineering.

2025-08-07 Tags: llm, mcp, model context protocol, tools, context, agents, t by klotz

Why Knowledge Graphs are Critical to Agent Context

This article discusses the importance of knowledge graphs in providing context for AI agents, highlighting their advantages over traditional retrieval systems in terms of precision, reasoning, and explainability.

2025-07-12 Tags: knowledge graph, agents, context, rag, vector, search, kuzudb by klotz

Context Engineering

>"This document provides a comprehensive overview of the engineering repository, which implements a systematic approach to context engineering for Large Language Models (LLMs). The repository bridges theoretical foundations with practical implementations, using a biological metaphor to organize concepts from simple prompts to complex neural field systems."

2025-07-01 Tags: llm, context by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: context*

Linked Tags

Related Tags