SemanticScuttle - klotz.me » Tags: nlp+embedding

Tags: nlp* + embedding*

0 bookmark(s) - Sort by: Date ↓ / Title /

This page details the command-line utility for the Embedding Atlas, a tool for exploring large text datasets with metadata. It covers installation, data loading (local and Hugging Face), visualization of embeddings using SentenceTransformers and UMAP, and usage instructions with available options.

2025-08-13 Tags: embedding, text, data, visualization, umap, sentence transformers, command line, hugging face, parquet, duckdb by klotz

Pairwise Cross-Variance Classification

Multi-class zero-shot embedding classification and error checking. This project improves zero-shot image/text classification using a novel dimensionality reduction technique and pairwise comparison, resulting in increased agreement between text and image classifications.

2025-06-04 Tags: classification, machine learning, multimodal learning, similarity search, embedding, image classification by klotz

Building LLM Workflows - - some observations

A post with pithy observations and clear conclusions from building complex LLM workflows, covering topics like prompt chaining, data structuring, model limitations, and fine-tuning strategies.

2025-05-09 Tags: llm, localllama, prompt engineering, fine-tuning, agentic loops, context window, bert, xml, cot, workflow, reddit by klotz

Why Your RAG Embeddings Are Costing You a Fortune (And How I Fixed It)

This article details the often overlooked cost of storing embeddings for RAG systems, and how quantization techniques (int8 and binary) can significantly reduce storage requirements and improve retrieval speed without substantial accuracy loss.

2025-04-30 Tags: rag, embedding, vector database, transformers, llm, quantization by klotz

The power of the humble embedding

Ryan speaks with Edo Liberty, Founder and CEO of Pinecone, about building vector databases, the power of embeddings, the evolution of RAG, and fine-tuning AI models.

2025-04-02 Tags: pinecone, machine learning, embedding, vector databases, semantic search, rag by klotz

Primer LLM Embedding

This Space demonstrates a simple method for embedding text using a LLM (Large Language Model) via the Hugging Face Inference API. It showcases how to convert text into numerical vector representations, useful for semantic search and similarity comparisons.

2025-03-28 Tags: llm, embedding, hugging face, inference, api, semantic search, vector representation, text embedding by klotz

A Coding Implementation to Build a Document Search Agent (DocSearchAgent) with Hugging Face, ChromaDB, and Langchain

This tutorial demonstrates how to build a powerful document search engine using Hugging Face embeddings, Chroma DB, and Langchain for semantic search capabilities.

2025-03-21 Tags: document, search, hugging face, chromadb, langchain, vector database, embedding, agents, llm by klotz

Qodo-Embed-1-1.5B

Qodo-Embed-1-1.5B is a state-of-the-art code embedding model designed for retrieval tasks in the software development domain. It supports multiple programming languages and is optimized for natural language-to-code and code-to-code retrieval, making it highly effective for applications such as code search and retrieval-augmented generation.

2025-03-04 Tags: qodo-embed-1, code, embedding, llm, software development, huggingface by klotz

Qodo’s open code embedding model sets new enterprise standard, beating OpenAI, Salesforce

Qodo releases Qodo-Embed-1-1.5B, an open-source code embedding model that outperforms competitors from OpenAI and Salesforce, enhancing code search, retrieval, and understanding for enterprise development teams.

2025-03-04 Tags: qodo, code, embedding, llm, search, retrieval, software engineering by klotz

A Complete Introduction to Using BERT Models

This article provides a comprehensive guide on the basics of BERT (Bidirectional Encoder Representations from Transformers) models. It covers the architecture, use cases, and practical implementations, helping readers understand how to leverage BERT for natural language processing tasks.

2025-02-07 Tags: bert, natural language processing, machine learning, transformers, text classification, sentiment analysis, llm by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: nlp* + embedding*

Linked Tags

Related Tags