SemanticScuttle - klotz.me » klotz: semantic search

klotz: semantic search*

Discovering Semantic Search and RAG with Large Language Models (LLMs)

Foundational concepts, practical implementation of semantic search, and the workflow of RAG, highlighting its advantages and versatile applications.

The article provides a step-by-step guide to implementing a basic semantic search using TF-IDF and cosine similarity. This includes preprocessing steps, converting text to embeddings, and searching for relevant documents based on query similarity.

2024-10-04 Tags: llm, semantic search, rag, nlp, embeddings, asymmetric by klotz

Working with Embeddings: Closed versus Open Source

An article discussing the use of embeddings in natural language processing, focusing on comparing open source and closed source embedding models for semantic search, including techniques like clustering and re-ranking.

2024-09-27 Tags: embeddings, natural language processing, semantic search, open source, closed source, retrieval applications, clustering, re-ranking, llm by klotz

Embeddings Are Kind of Shallow

The author explores semantic search using embeddings on U.S. Presidents, comparing four models: BGE, ST, Ada, and Large. The findings show that while embeddings capture interesting data, their limitations and inability to understand subtext and perform certain semantic tasks highlight their shallowness compared to full language models.

2024-09-24 Tags: embeddings, semantic search, llm by klotz

How to Use Hybrid Search for Better LLM RAG Retrieval

Combining dense embeddings with BM25 for advanced local LLM RAG pipeline

2024-08-12 Tags: rag, lm, bm25, hybrid search, semantic search, keyword search by klotz

Advanced RAG Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.

2024-08-01 Tags: rag, nlp, machine learning, information retrieval, natural language processing, llm, embeddings, semantic search by klotz

Andrew Ng Launches A New Short Course on Embedding Models

Andrew Ng has launched a new short course on embedding models, covering their history, architecture, and capabilities. The course, taught by Vectara's Ofer Mendelevitch, explores word, sentence, and cross-encoder models, BERT training, and building dual encoder models for semantic search.

2024-08-01 Tags: andrew ng, embedding models, machine learning, deeplearning, vectara, semantic search, llm, course by klotz

All-in-one open-source embeddings database for semantic search, LLM orchestration, and language model workflows

txtai is an open-source embeddings database for various applications such as semantic search, LLM orchestration, language model workflows, and more. It allows users to perform vector search with SQL, create embeddings for text, audio, images, and video, and run pipelines powered by language models for question-answering, transcription, translation, and more.

2024-06-22 Tags: github, txtai, embeddings, semantic search, llm, python, hugging face transformers, fastapi by klotz

Getting Started with RAG

This article explains Retrieval Augmented Generation (RAG), a method to reduce the risk of hallucinations in Large Language Models (LLMs) by limiting the context in which they generate answers. RAG is demonstrated using txtai, an open-source embeddings database for semantic search, LLM orchestration, and language model workflows.

2024-06-23 Tags: rag, llm, hallucinations, txtai, embeddings database, semantic search, orchestration, text, github by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: semantic search*

Linked Tags

Related Tags