SemanticScuttle - klotz.me » Tags: vector database

Tags: vector database*

0 bookmark(s) - Sort by: Date ↓ / Title /

Sparse Priming Representations (SPR) is a research project focused on developing and sharing techniques for efficiently representing complex ideas, memories, or concepts using a minimal set of keywords, phrases, or statements, enabling language models or subject matter experts to quickly reconstruct the original idea with minimal context.

2025-08-20 Tags: sparse priming representations, llm, latent space, in-context learning, rag, vector database, knowledge management by klotz

Hitchhiker’s Guide to RAG: From Tiny Files to Tolstoy with OpenAI’s API and LangChain

Scaling a simple RAG pipeline from simple notes to full books. This post elaborates on how to utilize larger files with your RAG pipeline by adding an extra step to the process — chunking.

2025-08-20 Tags: rag, openai, langchain, llm, vector database, faiss, chunking, medium by klotz

How To Use LLMs For 301 Redirects At Scale

Unlock the power of 301 redirects at scale LLMs to enhance user experience and optimize your website's SEO strategy.

2025-05-31 Tags: redirect, llm, seo, pinecone, vector database by klotz

Why Your RAG Embeddings Are Costing You a Fortune (And How I Fixed It)

This article details the often overlooked cost of storing embeddings for RAG systems, and how quantization techniques (int8 and binary) can significantly reduce storage requirements and improve retrieval speed without substantial accuracy loss.

2025-04-30 Tags: rag, embedding, vector database, transformers, llm, quantization by klotz

Let’s Build a RAG-Powered Research Paper Assistant

This article details building a Retrieval-Augmented Generation (RAG) system to assist with research paper tasks, specifically question answering over a PDF document. It covers document loading, splitting, embedding with Sentence Transformers, using ChromaDB as a vector database, and implementing a query interface with LangChain.

2025-04-23 Tags: docker, rag, langchain, sentence transformers, chromadb, vector database, pdf, llm by klotz

A Coding Implementation to Build a Document Search Agent (DocSearchAgent) with Hugging Face, ChromaDB, and Langchain

This tutorial demonstrates how to build a powerful document search engine using Hugging Face embeddings, Chroma DB, and Langchain for semantic search capabilities.

2025-03-21 Tags: document, search, hugging face, chromadb, langchain, vector database, embedding, agents, llm by klotz

Creating an AI-Powered Tutor Using Vector Database and Groq for Retrieval-Augmented Generation (RAG): Step by Step Guide

This article provides a step-by-step guide to creating an AI-powered English tutor using Retrieval-Augmented Generation (RAG). It integrates a vector database (ChromaDB) for storing and retrieving relevant English language learning materials and Groq API for generating structured and engaging lessons. The tutorial covers installing necessary libraries, setting up the environment, defining a vector database class, implementing AI lesson generation, and combining vector retrieval with AI generation.

2025-02-02 Tags: tutor, vector database, chromadb, rag, llm, knowledge retrieval by klotz

Elasticsearch Was Great, But Vector Databases Are the Future

The article discusses the evolution of search databases and how vector databases are emerging as a powerful alternative to traditional search engines like Elasticsearch.

2024-11-19 Tags: elasticsearch, vector database, search engine, bm25, tf-idf, embedding by klotz

Using Redis for Real-Time RAG Goes Beyond a Vector Database

This article discusses the importance of real-time access for Retrieval Augmented Generation (RAG) and how Redis can enable this through its real-time vector database, semantic cache, and LLM memory capabilities, leading to faster and more accurate responses in GenAI applications.

2024-10-07 Tags: redis, rag, real-time, genai, vector database, semantic cache, llm, memory by klotz

Semantic Caching for Faster, Smarter LLM Apps

Explore how semantic caching, which understands the meaning behind user queries, can boost performance and relevance in AI applications by storing and retrieving data based on intent.

2024-09-12 Tags: semantic caching, llm, vector database, redis by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: vector database*

Linked Tags

Related Tags