SemanticScuttle - klotz.me » Tags: text+machine learning

Tags: text* + machine learning*

0 bookmark(s) - Sort by: Date ↓ / Title /

A Beginner’s Reading List for Large Language Models for 2026

A curated reading list for those starting to learn about Large Language Models (LLMs), covering foundational concepts, practical applications, and future trends, updated for 2026.

2026-02-06 Tags: llm, machine learning, deep learning, nlp, reading list, 2026 by klotz

GenAI_Agents

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive AI systems.

2026-01-02 Tags: agents, nlp, llm, machine learning, natural language processing by klotz

The Optimal Architecture for Small Language Models

This article details research into finding the optimal architecture for small language models (70M parameters), exploring depth-width tradeoffs, comparing different architectures, and introducing Dhara-70M, a diffusion model offering 3.8x faster throughput with improved factuality.

2025-12-27 Tags: llm, nlp, small language models, architecture, diffusion, llama, gemma, deep learning by klotz

Choosing the Right Chunking Strategy: A Comprehensive Guide to RAG Optimization

This article explores different chunking strategies for Retrieval-Augmented Generation (RAG) systems, comparing nine approaches using the agenticmemory library to improve retrieval accuracy and reduce hallucinations.

2025-12-22 Tags: llm, performance, rag, chunking, embedding, vector database, rag optimization by klotz

Command Line Utility | Embedding Atlas

This page details the command-line utility for the Embedding Atlas, a tool for exploring large text datasets with metadata. It covers installation, data loading (local and Hugging Face), visualization of embeddings using SentenceTransformers and UMAP, and usage instructions with available options.

2025-08-13 Tags: embedding, text, data, visualization, umap, sentence transformers, command line, hugging face, parquet, duckdb by klotz

Topic Model Labelling with LLMs

Python tutorial for reproducible labeling of cutting-edge topic models with GPT4-o-mini. The article details training a FASTopic model and labeling its results using GPT-4.0 mini, emphasizing reproducibility and control over the labeling process.

2025-07-15 Tags: llm, machine learning, nlp, python, topic modeling, fastopic, turftopic, gpt-4, classification by klotz

Pairwise Cross-Variance Classification

Multi-class zero-shot embedding classification and error checking. This project improves zero-shot image/text classification using a novel dimensionality reduction technique and pairwise comparison, resulting in increased agreement between text and image classifications.

2025-06-04 Tags: classification, machine learning, multimodal learning, similarity search, embedding, image classification by klotz

Building LLM Workflows - - some observations

A post with pithy observations and clear conclusions from building complex LLM workflows, covering topics like prompt chaining, data structuring, model limitations, and fine-tuning strategies.

2025-05-09 Tags: llm, localllama, prompt engineering, fine-tuning, agentic loops, context window, bert, xml, cot, workflow, reddit by klotz

Why Your RAG Embeddings Are Costing You a Fortune (And How I Fixed It)

This article details the often overlooked cost of storing embeddings for RAG systems, and how quantization techniques (int8 and binary) can significantly reduce storage requirements and improve retrieval speed without substantial accuracy loss.

2025-04-30 Tags: rag, embedding, vector database, transformers, llm, quantization by klotz

The power of the humble embedding

Ryan speaks with Edo Liberty, Founder and CEO of Pinecone, about building vector databases, the power of embeddings, the evolution of RAG, and fine-tuning AI models.

2025-04-02 Tags: pinecone, machine learning, embedding, vector databases, semantic search, rag by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: text* + machine learning*

Linked Tags

Related Tags