SemanticScuttle - klotz.me » Tags: embeddings+llm

Tags: embeddings* + llm*

0 bookmark(s) - Sort by: Date ↓ / Title /

A simple project demonstrating Retrieval Augmented Generation (RAG) using SQLite, sqlite-vec, and OpenAI. It embeds text files, stores them in a SQLite database, and retrieves relevant documents using vector search. The project features lightweight single-file SQLite databases, vector search capabilities, and OpenAI integration for embeddings and chat responses.

2025-02-20 Tags: sqlite, rag, sqlite-vec, vector search, embeddings, llm, github, edizaguirre by klotz

PGAI Vectorizer: Automate AI Embeddings With One SQL Command in PostgreSQL

Learn how to automate AI embedding creation using PostgreSQL with pgai Vectorizer. Streamline your AI workflow with simple SQL commands.

ntegration: PGAI Vectorizer integrates AI capabilities into PostgreSQL, enabling users to generate AI embeddings directly within the database. Ease of Use: It simplifies the process of creating embeddings using a single SQL command, eliminating the need for multiple tools and complex pipelines. Automatic Sync: Embeddings are automatically updated as data changes, ensuring that embeddings stay current without manual intervention. Model Flexibility: Users can quickly switch between different AI models without reprocessing data. Scalability: Optimizes search performance with vector indexes, making it suitable for large datasets. Customization: Allows users to define chunking and formatting rules to tailor embeddings to their specific needs.

2024-11-22 Tags: llm, vectorizer, postgresql, embeddings, sql by klotz

Discovering Semantic Search and RAG with Large Language Models (LLMs)

Foundational concepts, practical implementation of semantic search, and the workflow of RAG, highlighting its advantages and versatile applications.

The article provides a step-by-step guide to implementing a basic semantic search using TF-IDF and cosine similarity. This includes preprocessing steps, converting text to embeddings, and searching for relevant documents based on query similarity.

2024-10-04 Tags: llm, semantic search, rag, nlp, embeddings, asymmetric by klotz

Working with Embeddings: Closed versus Open Source

An article discussing the use of embeddings in natural language processing, focusing on comparing open source and closed source embedding models for semantic search, including techniques like clustering and re-ranking.

2024-09-27 Tags: embeddings, natural language processing, semantic search, open source, closed source, retrieval applications, clustering, re-ranking, llm by klotz

Embeddings Are Kind of Shallow

The author explores semantic search using embeddings on U.S. Presidents, comparing four models: BGE, ST, Ada, and Large. The findings show that while embeddings capture interesting data, their limitations and inability to understand subtext and perform certain semantic tasks highlight their shallowness compared to full language models.

2024-09-24 Tags: embeddings, semantic search, llm by klotz

Sage: Chat with any codebase

Sage is a tool that allows developers to chat with any codebase using two commands. It provides a functional chat interface for code, supports running locally or on the cloud, and has a modular design for swapping components.

2024-09-19 Tags: sage, codebase, chat, developers, github, embeddings, llm, vector stores, modular by klotz

Advanced RAG Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.

2024-08-01 Tags: rag, nlp, machine learning, information retrieval, natural language processing, llm, embeddings, semantic search by klotz

All-in-one open-source embeddings database for semantic search, LLM orchestration, and language model workflows

txtai is an open-source embeddings database for various applications such as semantic search, LLM orchestration, language model workflows, and more. It allows users to perform vector search with SQL, create embeddings for text, audio, images, and video, and run pipelines powered by language models for question-answering, transcription, translation, and more.

2024-06-22 Tags: github, txtai, embeddings, semantic search, llm, python, hugging face transformers, fastapi by klotz

Multilingual RAG, Algorithmic Thinking, Outlier Detection, and Other Problem-Solving Highlights

The highlighted articles cover a variety of topics, including algorithmic thinking for data scientists, outlier detection in time-series data, route optimization for visiting NFL teams, minimum vertex coloring problem solution, high-cardinality features, multilingual RAG (Rapidly-explainable AI) system development, fine-tuning smaller transformer models, long-form visual understanding, multimodal image-text models, the theoretical underpinnings of learning, data science stress management, and reinforcement learning.

2024-06-21 Tags: towardsdatascience, llm, machine learning, embeddings, papers, summaries by klotz

Contextual Transformer Embeddings Using Self-Attention Explained with Diagrams and Python Code

This article is part of a series titled ‘LLMs from Scratch’, a complete guide to understanding and building Large Language Models (LLMs). In this article, we discuss the self-attention mechanism and how it is used by transformers to create rich and context-aware transformer embeddings.

The Self-Attention mechanism is used to add context to learned embeddings, which are vectors representing each word in the input sequence. The process involves the following steps:

Learned Embeddings: These are the initial vector representations of words, learned during the training phase. The weights matrix, storing the learned embeddings, is stored in the first linear layer of the Transformer architecture.
Positional Encoding: This step adds positional information to the learned embeddings. Positional information helps the model understand the order of the words in the input sequence, as transformers process all words in parallel, and without this information, they would lose the order of the words.
Self-Attention: The core of the Self-Attention mechanism is to update the learned embeddings with context from the surrounding words in the input sequence. This mechanism determines which words provide context to other words, and this contextual information is used to produce the final contextualized embeddings.

2024-06-01 Tags: transformer, attention, self-attention, embeddings, nlp, deep learning, llm, machine learning by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: embeddings* + llm*

Linked Tags

Related Tags