SemanticScuttle - klotz.me » klotz: llm+rag

klotz: llm* + rag*

Bookmarks on this page are managed by an admin user.

Using Llamafiles for Embeddings in Local RAG Applications This bookmark is certified by an admin user.

A blog post discussing the use of Llamafiles for embeddings in Retrieval-Augmented Generation (RAG) applications and recommending the best models based on performance on RAG-relevant tasks.

2024-05-17 Tags: llamafiles, embeddings, rag, text embedding models, llm by klotz

Proxy Fine-Tuning LLMs This bookmark is certified by an admin user.

- Proxy fine-tuning is a method to improve large pre-trained language models without directly accessing their weights.
- It operates on top of black-box LLMs by utilizing only their predictions.
- The approach combines elements of retrieval-based techniques, fine-tuning, and domain-specific adaptations.
- Proxy fine-tuning can be used to achieve the performance of heavily-tuned large models by only tuning smaller models.

2024-05-11 Tags: proxy, fine-tuning, llm, retrieval-augmented generation, domain-specific adaptations, data delivery, rag, catastrophic forgetting, drift by klotz

Service Development Kit: Terraform AWS ECS Setup with Rust Actix App, Postgress RDS, LLM, RAG, Cloudflare, and more." This bookmark is certified by an admin user.

Service Development Kit that uses Terraform, AWS ECS, Rust, Actix App, Postgress RDS, LLM, RAG, Cloudflare
• step-by-step guide on how to set up the service development kit, including creating an SSL certificate, setting up Terraform, and configuring Cloudflare.
• Rust, LLM, and RAG in the service development kit.

2024-05-07 Tags: terraform, aws ecs, rust, actix app, postgress rds, llm, rag, production engineering by klotz

Retrieving and Filtering with Metadata This bookmark is certified by an admin user.

In this tutorial, we will build a RAG system with a self-querying retriever in the LangChain framework. This will enable us to filter the retrieved movies using metadata, thus providing more meaningful movie recommendations.

2024-04-27 Tags: recommender systems, llm, langchain, rag, movie by klotz

Building RAG Application using Gemma 7B LLM & Upstash Vector Database This bookmark is certified by an admin user.

Retrieval-Augmented Generation (RAG) is the concept of providing large language models (LLMs) with additional information from an external knowledge source. This allows them to generate more accurate and contextual answers while reducing hallucinations. In this article, we will provide a step-by-step guide to building a complete RAG application using the latest open-source LLM by Google Gemma 7B and Upstash serverless vector database.

2024-03-12 Tags: llm, rag, gemma, upstash by klotz

Overcoming the Limits of RAG with ColBERT This bookmark is certified by an admin user.

ColBERT is a new way of scoring passage relevance using a BERT language model that substantially solves the problems with dense passage retrieval.

2024-03-12 Tags: llm, rag, embedding, bert, colbert, cosine distance, concept expansion by klotz

Implementing semantic cache to improve a RAG system with FAISS This bookmark is certified by an admin user.

In this notebook, we will explore a typical RAG solution where we will utilize an open-source model and the vector database Chroma DB. However, we will integrate a semantic cache system that will store various user queries and decide whether to generate the prompt enriched with information from the vector database or the cache.

2024-03-12 Tags: llm, rag, chromadb, faiss, cache by klotz

12 RAG Pain Points and Their Solutions This bookmark is certified by an admin user.

2024-01-30 Tags: rag, llm by klotz

Deploying LLM Apps to AWS, the Open-Source Self-Service Way This bookmark is certified by an admin user.

A step-by-step guide on deploying LlamaIndex RAGs to AWS ECS fargate

2024-01-15 Tags: llm, aws fargate, terraform, llamaindex, rag by klotz

Democratizing LLMs: 4-bit Quantization for Optimal LLM Inference This bookmark is certified by an admin user.

A deep dive into model quantization with GGUF and llama.cpp and model evaluation with LlamaIndex

2024-01-15 Tags: llm, gguf, georgi gerganov, llama.cpp, llamaindex, huggingface, rag by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: llm* + rag*

Linked Tags

Related Tags