SemanticScuttle - klotz.me » Tags: retrieval augmented generation

Tags: retrieval augmented generation*

0 bookmark(s) - Sort by: Date ↓ / Title /

A 100-line minimalist LLM framework for Agents, Task Decomposition, RAG, etc. It models the LLM workflow as a Graph + Shared Store with nodes handling simple tasks, connected through actions for agents, and orchestrated by flows for task decomposition.

2025-03-04 Tags: llm framework, agents, task decomposition, rag, graph, shared store, nodes, flows, communication, batch, async, utility function, design pattern by klotz

PocketFlow

Minimalist LLM Framework in 100 Lines. Enable LLMs to Program Themselves.

2025-03-04 Tags: open-source, framework, llm, flow, agent, rag by klotz

Prompt Engineering Patterns for Successful RAG Implementations

A guide on implementing prompt engineering patterns to make RAG implementations more effective and efficient, covering patterns like Direct Retrieval, Chain of Thought, Context Enrichment, Instruction-Tuning, and more.

2025-02-27 Tags: prompt engineering, rag, llm by klotz

6 Common LLM Customization Strategies Briefly Explained

The article explains six essential strategies for customizing Large Language Models (LLMs) to better meet specific business needs or domain requirements. These strategies include Prompt Engineering, Decoding and Sampling Strategy, Retrieval Augmented Generation (RAG), Agent, Fine-Tuning, and Reinforcement Learning from Human Feedback (RLHF). Each strategy is described with its benefits, limitations, and implementation approaches to align LLMs with specific objectives.

2025-02-25 Tags: llm, prompt engineering, rag, agent, fine-tuning, rlhf by klotz

SearchResearch Commentary (2/13/25): Using NotebookLM to Help with DeepResearch

This article explores the use of Google's NotebookLM (NLM) as a tool for research, particularly in analyzing the impact of the Aswan High Dam on schistosomiasis in Egypt. The author details how NLM can be used to create a research assistant-like experience, allowing users to 'have a conversation' with uploaded content to gain insights and answers from the material.

2025-02-21 Tags: notebooklm, google, deepresearch, llm, research tools, rag, dan russell by klotz

SQLite RAG Tutorial

A simple project demonstrating Retrieval Augmented Generation (RAG) using SQLite, sqlite-vec, and OpenAI. It embeds text files, stores them in a SQLite database, and retrieves relevant documents using vector search. The project features lightweight single-file SQLite databases, vector search capabilities, and OpenAI integration for embeddings and chat responses.

2025-02-20 Tags: sqlite, rag, sqlite-vec, vector search, embeddings, llm, github, edizaguirre by klotz

Retrieval Augmented Generation in SQLite

The article explores the concept of Retrieval-Augmented Generation (RAG) using SQLite, specifically with the sqlite-vec extension and the OpenAI API. It outlines a simplified approach to RAG, moving away from complex frameworks and cloud vector databases, using SQLite's virtual tables for vector search and semantic understanding.

2025-02-20 Tags: rag, llm sqlite, sqlite-vec, vector search, machine learning, data science by klotz

Building a Secure RAG with Python, LangChain, and OpenFGA

Learn how to use Okta FGA to secure your LangChain RAG agent in Python.

2025-02-14 Tags: rag, llm, langchain, authorization, okta by klotz

Creating an AI-Powered Tutor Using Vector Database and Groq for Retrieval-Augmented Generation (RAG): Step by Step Guide

This article provides a step-by-step guide to creating an AI-powered English tutor using Retrieval-Augmented Generation (RAG). It integrates a vector database (ChromaDB) for storing and retrieving relevant English language learning materials and Groq API for generating structured and engaging lessons. The tutorial covers installing necessary libraries, setting up the environment, defining a vector database class, implementing AI lesson generation, and combining vector retrieval with AI generation.

2025-02-02 Tags: tutor, vector database, chromadb, rag, llm, knowledge retrieval by klotz

Llama Stack v0.1.0 Release

Llama Stack v0.1.0 introduces a stable API release enabling developers to build RAG applications and agents, integrate with various tools, and use telemetry for monitoring and evaluation. This release provides a comprehensive interface, rich provider ecosystem, and multiple developer interfaces, along with sample applications for Python, iOS, and Android.

2025-01-25 Tags: llama stack, rag, agents, api, github, llm by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: retrieval augmented generation*

Linked Tags

Related Tags