SemanticScuttle - klotz.me » klotz: llm+github

klotz: llm* + github*

Free and Open Source LLM projects on GitHub.

Tutorial for Building an LLM Router for High-Quality and Cost-Effective Responses

This tutorial provides a step-by-step guide on building an LLM router to balance the use of high-quality closed LLMs like GPT-4 and cost-effective open-source LLMs, achieving high response quality while minimizing costs. The approach includes preparing labeled data, finetuning a causal LLM classifier, and offline evaluation using the RouteLLM framework.

2024-07-04 Tags: llm, router, tutorial, gpt-4, mixtral-8x7b, github by klotz

Llama-agents

Llama-agents is an async-first framework for building, iterating, and productionizing multi-agent systems, including multi-agent communication, distributed tool execution, human-in-the-loop, and more!

2024-07-01 Tags: llama-agents, github, llm, actor by klotz

tegridydev/auto-md

Automates conversion of various file types and GitHub repositories into LLM-ready Markdown documents.

2024-06-29 Tags: github, python, markdown, llm, auto-md by klotz

Quick Python tool/script to convert zips + download / convert GitHub repos to llm ready (RAG etc) .md files

A mini python based tool designed to convert various types of files and GitHub repositories into LLM-ready Markdown documents with metadata, table of contents, and consistent heading styles. Supports multiple file types, handles zip files, and has GitHub integration.

2024-06-29 Tags: python, github, rag, llm, qanda, reddit, markdown, auto-md by klotz

Developer APIs to Accelerate LLM Projects - nlmatics/llmsherpa

The llmsherpa project provides APIs to accelerate Large Language Model (LLM) projects. It includes features like LayoutPDFReader for PDF text parsing, smart chunking for vector search and Retrieval Augmented Generation, and table analysis. It is open-sourced under Apache 2.0 license.

2024-06-27 Tags: llm, pdf, text, parsing, retrieval augmented generation, foss, github, cpdomina by klotz

Code in Context: How AI Can Help Improve Our Documentation

Unblocked can not only ingest your code repositories, but also related material — your website, your product documentation, your conversations in GitHub issues and Slack — in order to provide a service that I call context assembly. I picked up that term from Jack Ozzie, back when he was working with his brother Ray on Groove, a peer-to-peer successor to Ray’s greatest hit, Lotus Notes, which pioneered what became known as knowledge management. Like Notes, Groove brought information work into shared spaces where you could search your mail, calendars, documents, and data all at once.

2024-06-25 Tags: llm, documentation, github, slack by klotz

All-in-one open-source embeddings database for semantic search, LLM orchestration, and language model workflows

txtai is an open-source embeddings database for various applications such as semantic search, LLM orchestration, language model workflows, and more. It allows users to perform vector search with SQL, create embeddings for text, audio, images, and video, and run pipelines powered by language models for question-answering, transcription, translation, and more.

2024-06-22 Tags: github, txtai, embeddings, semantic search, llm, python, hugging face transformers, fastapi by klotz

Getting Started with RAG

This article explains Retrieval Augmented Generation (RAG), a method to reduce the risk of hallucinations in Large Language Models (LLMs) by limiting the context in which they generate answers. RAG is demonstrated using txtai, an open-source embeddings database for semantic search, LLM orchestration, and language model workflows.

2024-06-23 Tags: rag, llm, hallucinations, txtai, embeddings database, semantic search, orchestration, text, github by klotz

GitHub Copilot Chat: From Prompt Injection to Data Exfiltration

This post highlights how the GitHub Copilot Chat VS Code Extension was vulnerable to data exfiltration via prompt injection when analyzing untrusted source code.

2024-06-16 Tags: github, copilot, chat, prompt injection, llm, security, wunderwuzzi by klotz

Retrochat v0.0.4 Release

Retrochat is chat application that supports Llama.cpp, Kobold.cpp, and Ollama. It highlights new features, commands for configuration, chat management, and models, and provides a download link for the release.

2024-06-14 Tags: retrochat, llama.cpp, llm, ollama, chat, github, cli, text ui by klotz

SemanticScuttle - klotz.me

klotz: llm* + github*

Linked Tags

Related Tags