SemanticScuttle - klotz.me » klotz: text+llm+rag

klotz: text* + llm* + rag*

Beyond RAG: Precision Filtering in a Semantic World

This article discusses how traditional machine learning methods, particularly outlier detection, can be used to improve the precision and efficiency of Retrieval-Augmented Generation (RAG) systems by filtering out irrelevant queries before document retrieval.

2024-11-13 Tags: rag, outlier detection, filtering, llm, semantic retrieval, nlp, muzlin, production engineering, logs by klotz

New Technique Makes RAG Systems Much Better at Retrieving the Right Documents

Researchers from Cornell University developed a technique called 'contextual document embeddings' to improve the performance of Retrieval-Augmented Generation (RAG) systems, enhancing the retrieval of relevant documents by making embedding models more context-aware.

Standard methods like bi-encoders often fail to account for context-specific details, leading to poor performance in application-specific datasets. Contextual document embeddings address this by enhancing the sensitivity of the embedding model to subtle differences in documents, particularly in specialized domains.

The researchers proposed two complementary methods to improve bi-encoders:

- Modifying the training process using contrastive learning to distinguish between similar documents.
- Modifying the bi-encoder architecture to incorporate corpus context during the embedding process.

These modifications allow the model to capture both the general context and specific details of documents, leading to better performance, especially in out-of-domain scenarios. The new technique has shown consistent improvements over standard bi-encoders and can be adapted for various applications beyond text-based models.

2024-10-10 Tags: rag, embedding, document retrieval, llm by klotz

Discovering Semantic Search and RAG with Large Language Models (LLMs)

Foundational concepts, practical implementation of semantic search, and the workflow of RAG, highlighting its advantages and versatile applications.

The article provides a step-by-step guide to implementing a basic semantic search using TF-IDF and cosine similarity. This includes preprocessing steps, converting text to embeddings, and searching for relevant documents based on query similarity.

2024-10-04 Tags: llm, semantic search, rag, nlp, embeddings, asymmetric by klotz

A Visual Exploration of Semantic Text Chunking

The article explains semantic text chunking, a technique for automatically grouping similar pieces of text to be used in pre-processing stages for Retrieval Augmented Generation (RAG) or similar applications. It uses visualizations to understand the chunking process and explores extensions involving clustering and LLM-powered labeling.

2024-09-21 Tags: text, chunking, nlp, rag, dimensionality reduction, hierarchical clustering, umap, summarization, llm by klotz

Agentic Chunking for RAGs

This article discusses the importance of chunking, embedding, and indexing in RAGs (Recursive Auto-Segmented Graphs). The author compares recursive character splitting and semantic splitting techniques for text chunking and suggests the use of agentic chunking for superior RAG retrieval.

2024-08-27 Tags: rag, text, chunking, llm by klotz

Advanced RAG Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.

2024-08-01 Tags: rag, nlp, machine learning, information retrieval, natural language processing, llm, embeddings, semantic search by klotz

Developer APIs to Accelerate LLM Projects - nlmatics/llmsherpa

The llmsherpa project provides APIs to accelerate Large Language Model (LLM) projects. It includes features like LayoutPDFReader for PDF text parsing, smart chunking for vector search and Retrieval Augmented Generation, and table analysis. It is open-sourced under Apache 2.0 license.

2024-06-27 Tags: llm, pdf, text, parsing, retrieval augmented generation, foss, github, cpdomina by klotz

Getting Started with RAG

This article explains Retrieval Augmented Generation (RAG), a method to reduce the risk of hallucinations in Large Language Models (LLMs) by limiting the context in which they generate answers. RAG is demonstrated using txtai, an open-source embeddings database for semantic search, LLM orchestration, and language model workflows.

2024-06-23 Tags: rag, llm, hallucinations, txtai, embeddings database, semantic search, orchestration, text, github by klotz

Combining the Best of Both Worlds: Retrieval-Augmented Generation for Knowledge-Intensive Natural Language Processing

This article discusses Retrieval-Augmented Generation (RAG) models, a new approach that addresses the limitations of traditional models in knowledge-intensive Natural Language Processing (NLP) tasks. RAG models combine parametric memory from pre-trained seq2seq models with non-parametric memory from a dense vector index of Wikipedia, enabling dynamic knowledge access and integration.

2024-05-28 Tags: retrieval-augmented generation, nlp, llm, parametric memory by klotz

Overcoming the Limits of RAG with ColBERT

ColBERT is a new way of scoring passage relevance using a BERT language model that substantially solves the problems with dense passage retrieval.

2024-03-12 Tags: llm, rag, embedding, bert, colbert, cosine distance, concept expansion by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: text* + llm* + rag*

Linked Tags

Related Tags