SemanticScuttle - klotz.me » Tags: embeddings

Tags: embeddings*

0 bookmark(s) - Sort by: Date ↓ / Title /

This article details how to automate embedding generation and updates in Postgres using Supabase Vector, Queues, Cron, and pg_net extension with Edge Functions, addressing the issues of drift, latency, and complexity found in traditional external embedding pipelines.

2025-04-02 Tags: supabase, postgres, embeddings, semantic search, rag, pgvector, edge functions, database by klotz

SQLite RAG Tutorial

A simple project demonstrating Retrieval Augmented Generation (RAG) using SQLite, sqlite-vec, and OpenAI. It embeds text files, stores them in a SQLite database, and retrieves relevant documents using vector search. The project features lightweight single-file SQLite databases, vector search capabilities, and OpenAI integration for embeddings and chat responses.

2025-02-20 Tags: sqlite, rag, sqlite-vec, vector search, embeddings, llm, github, edizaguirre by klotz

PGAI Vectorizer: Automate AI Embeddings With One SQL Command in PostgreSQL

Learn how to automate AI embedding creation using PostgreSQL with pgai Vectorizer. Streamline your AI workflow with simple SQL commands.

ntegration: PGAI Vectorizer integrates AI capabilities into PostgreSQL, enabling users to generate AI embeddings directly within the database. Ease of Use: It simplifies the process of creating embeddings using a single SQL command, eliminating the need for multiple tools and complex pipelines. Automatic Sync: Embeddings are automatically updated as data changes, ensuring that embeddings stay current without manual intervention. Model Flexibility: Users can quickly switch between different AI models without reprocessing data. Scalability: Optimizes search performance with vector indexes, making it suitable for large datasets. Customization: Allows users to define chunking and formatting rules to tailor embeddings to their specific needs.

2024-11-22 Tags: llm, vectorizer, postgresql, embeddings, sql by klotz

Discovering Semantic Search and RAG with Large Language Models (LLMs)

Foundational concepts, practical implementation of semantic search, and the workflow of RAG, highlighting its advantages and versatile applications.

The article provides a step-by-step guide to implementing a basic semantic search using TF-IDF and cosine similarity. This includes preprocessing steps, converting text to embeddings, and searching for relevant documents based on query similarity.

2024-10-04 Tags: llm, semantic search, rag, nlp, embeddings, asymmetric by klotz

Working with Embeddings: Closed versus Open Source

An article discussing the use of embeddings in natural language processing, focusing on comparing open source and closed source embedding models for semantic search, including techniques like clustering and re-ranking.

2024-09-27 Tags: embeddings, natural language processing, semantic search, open source, closed source, retrieval applications, clustering, re-ranking, llm by klotz

Embeddings Are Kind of Shallow

The author explores semantic search using embeddings on U.S. Presidents, comparing four models: BGE, ST, Ada, and Large. The findings show that while embeddings capture interesting data, their limitations and inability to understand subtext and perform certain semantic tasks highlight their shallowness compared to full language models.

2024-09-24 Tags: embeddings, semantic search, llm by klotz

Sage: Chat with any codebase

Sage is a tool that allows developers to chat with any codebase using two commands. It provides a functional chat interface for code, supports running locally or on the cloud, and has a modular design for swapping components.

2024-09-19 Tags: sage, codebase, chat, developers, github, embeddings, llm, vector stores, modular by klotz

Visualizing Stochastic Regularization for Entity Embeddings

This article explores how stochastic regularization in neural networks can improve performance on unseen categorical data, especially high-cardinality categorical features. It uses visualizations and SHAP values to understand how entity embeddings respond to this regularization technique.

2024-08-06 Tags: stochastic regularization, entity, embeddings, categorical data, shap, log embedding by klotz

Introducing sqlite-vec v0.1.0: a vector search SQLite extension that runs everywhere

Introducing sqlite-vec, a new SQLite extension for vector search written entirely in C. It's a stable release and can be installed in multiple ways. It runs on various platforms, is fast, and supports quantization techniques for efficient storage and search.

2024-08-04 Tags: sqlite-vec, sqlite, vector search, embeddings, quantization, database, ann, knn, machine learning by klotz

Unlocking Concept Measurement

This article explores the use of word2vec and GloVe algorithms for concept analysis within text corpora. It discusses the history of word2vec, its ability to perform semantic arithmetic, and compares it with the GloVe algorithm.

2024-08-02 Tags: embeddings, word2vec, glove, concept measurement, text analysis, semantic embeddings, natural language processing by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: embeddings*

Linked Tags

Related Tags