Free and Open Source LLM projects on GitHub.
0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag
A simple project demonstrating Retrieval Augmented Generation (RAG) using SQLite, sqlite-vec, and OpenAI. It embeds text files, stores them in a SQLite database, and retrieves relevant documents using vector search. The project features lightweight single-file SQLite databases, vector search capabilities, and OpenAI integration for embeddings and chat responses.
Harbor is a containerized LLM toolkit that allows you to run LLMs and additional services with ease, featuring a CLI and a companion App for managing AI services.
Boxy is a Boxer-inspired box editor that provides various functionalities for managing and manipulating boxes. It supports key bindings, modules, and allows running in a browser. The editor includes features such as mouse and keyboard interactions, saving and restoring boxes, markdown visualization, and LLM inference.
The article discusses four open-source AI research agents that serve as cost-effective alternatives to OpenAI’s Deep Research AI Agent. These alternatives offer robust search capabilities, AI-powered extraction, and reasoning features, allowing researchers to automate and optimize their workflows without incurring high costs.
Introducing agent mode for GitHub Copilot in VS Code, announcing the general availability of Copilot Edits, and providing a first look at the SWE agent codenamed Project Padawan.
This repository provides an overview of resources for the paper 's1: Simple test-time scaling', which includes minimal recipes for test-time scaling and strong reasoning performance. It covers artifacts, structure, inference, training, evaluation, data, visuals, and citation details.
A set of tools to help you work with Mistral models, including tokenization, validation, and normalization code.
A quickstart guide to installing, configuring, and using the Goose AI agent for software development tasks.
Qwen2.5-1M models and inference framework support for long-context tasks, with a context length of up to 1M tokens.
This folder contains some example client scripts using our Python SDK for connecting with Llama Stack Distros. Instructions are provided for setting up dependencies and running demo scripts and apps.
First / Previous / Next / Last
/ Page 1 of 0