SemanticScuttle - klotz.me

Long context support in LLM 0.24 using fragments and template plugins

LLM 0.24 introduces fragments and template plugins to better utilize long context models, improving storage efficiency and enabling new features like querying logs by fragment and leveraging documentation. It also details improvements to template handling and model support.

2025-04-08 Tags: llm, context, simon willison by klotz

Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens

Qwen2.5-1M models and inference framework support for long-context tasks, with a context length of up to 1M tokens.

2025-01-27 Tags: qwen2.5-1m, context, qwen, llm, github by klotz

StreamingLLM (llama.cpp & llamacpp_HF loaders)

This PR implements the StreamingLLM technique for model loaders, focusing on handling context length and optimizing chat generation speed.

2024-11-26 Tags: streamingllm, llama.cpp, context, chat, llm, oobabooga by klotz

Anthropic's “Contextual Retrieval” Technique Enhances RAG Accuracy by 67%

"Contextual Retrieval tackles a fundamental issue in RAG: the loss of context when documents are split into smaller chunks for processing. By adding relevant contextual information to each chunk before it's embedded or indexed, the method preserves critical details that might otherwise be lost. In practical terms, this involves using Anthropic’s Claude model to generate chunk-specific context. For instance, a simple chunk stating, “The company’s revenue grew by 3% over the previous quarter,” becomes contextualized to include additional information such as the specific company and the relevant time period. This enhanced context ensures that retrieval systems can more accurately identify and utilize the correct information."

2024-09-22 Tags: anthropic, context, llm, rag, bm25 ai claude, embeddings tf-idf by klotz

Provide context to GitHub Copilot Chat

This article explains how to provide context to GitHub Copilot Chat for better code suggestions and assistance. It covers techniques like highlighting code, using slash commands, leveraging workspace information, and specifying relevant files.

2024-09-16 Tags: github copilot, context, code generation, llm by klotz

mem0ai/mem0

Mem0: The Memory Layer for Personalized AI. Provides an intelligent, adaptive memory layer for Large Language Models (LLMs), enhancing personalized AI experiences.

2024-07-29 Tags: mem0, python, llm, context, rag, vector-database by klotz

Researchers Recreate Human Episodic Memory to Give LLMs Infinite Context

Researchers from Huawei and University College London have developed a new approach called EM-LLM, which integrates aspects of human episodic memory and event cognition into large language models (LLMs). This allows LLMs to have infinite context lengths while maintaining their regular functioning.

2024-07-16 Tags: em-llm, llm, context by klotz

Introducing Pinboard

Noema Research introduces Pinboard, a developer tool for improved productivity. Pinboard, a command-line tool, efficiently manages files and terminal references, enhancing development workflows. Key features include flexible pinning, contextual updates, clipboard integration, an interactive shell, and undo functionality.

2024-07-10 Tags: cli, shell, linux, pinboard, llm, context, foss by klotz

LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

The article proposes a new framework, LongRAG, that aims to improve the performance of Retrieval-Augmented Generation (RAG) by using long retriever and reader components. LongRAG processes Wikipedia into larger 4K-token units, reducing the total units from 22M to 600K, thus decreasing the burden on the retriever. The top-k retrieved units (≈30K tokens) are then fed to a long-context Language Model for zero-shot answer extraction. LongRAG achieves EM of 62.7% on NQ and 64.3% on HotpotQA (full-wiki), which is on par with the state-of-the-art model.

2024-06-24 Tags: retrieval-augmented generation, context, llm, question answering by klotz

ChatGPT Glossary: 44 AI Terms That Everyone Should Know

Stay informed about the latest artificial intelligence (AI) terminology with this comprehensive glossary. From algorithm and AI ethics to generative AI and overfitting, learn the essential AI terms that will help you sound smart over drinks or impress in a job interview.

SemanticScuttle - klotz.me

Tags: context*

Linked Tags

Related Tags