SemanticScuttle - klotz.me » klotz: llama.cpp+llm

klotz: llama.cpp* + llm*

smol-tools

A collection of lightweight AI-powered tools built with LLaMA.cpp and small language models.

2024-11-07 Tags: smollm, smol_tools, llama.cpp, llm, self-hostexd, summarizer, rewriter, agent by klotz

TIL: Quantize and use Llama 3.1 with llama.cpp on a Mac

A guide on how to download, convert, quantize, and use Llama 3.1 8B model with llama.cpp on a Mac.

2024-09-28 Tags: llama.cpp, quantization, llm, howto by klotz

TIL: Building llamafiles from Llama 3.2 GGUFs

A step-by-step guide on building llamafiles from Llama 3.2 GGUFs, including scripting and Dockerization.

2024-09-28 Tags: llamafile, llama.cpp, llm, llama 3.2, gguf, model quantization, docker, mozilla-ocho by klotz

How to Get JSON Output from LLMs: A Practical Guide

Tutorial on enforcing JSON output with Llama.cpp or the Gemini’s API for structured data generation from LLMs.

2024-08-25 Tags: llm, json, gbnf, llama.cpp, gemini, tools by klotz

large-model-proxy

Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources.

2024-07-22 Tags: llm, proxy, llama.cpp, github, golang by klotz

LLooM: Leverage raw LLM logits to weave threads

This page provides information about LLooM, a tool that uses raw LLM logits to weave threads in a probabilistic way. It includes instructions on how to use LLooM with various environments, such as vLLM, llama.cpp, and OpenAI. The README also explains the parameters and configurations for LLooM.

2024-07-04 Tags: lloom, llm, logits, vllm, llama.cpp, openai, greedy decoding, beamsearch, github by klotz

llama.cpp quant names

An explanation of the quant names used in the llama.cpp implementation, as well as information on the different types of quant schemes available.

2024-06-23 Tags: llama.cpp, quantization, llm by klotz

Retrochat v0.0.4 Release

Retrochat is chat application that supports Llama.cpp, Kobold.cpp, and Ollama. It highlights new features, commands for configuration, chat management, and models, and provides a download link for the release.

2024-06-14 Tags: retrochat, llama.cpp, llm, ollama, chat, github, cli, text ui by klotz

ShelbyJenkins/llm_utils - GitHub

Utilities for Llama.cpp, OpenAI, Anthropic, Mistral-rs. A collection of tools for interacting with various large language models. The code is written in Rust and includes functions for loading models, tokenization, prompting, text generation, and more.

2024-06-04 Tags: github, llm_utils, llama.cpp, rust, large language model by klotz

localllm/llm-tool at main · GoogleCloudPlatform/localllm

llm-tool provides a command-line utility for running large language models locally. It includes scripts for pulling models from the internet, starting them, and managing them using various commands such as 'run', 'ps', 'kill', 'rm', and 'pull'. Additionally, it offers a Python script named 'querylocal.py' for querying these models. The repository also come

2024-02-08 Tags: llm, localllama, self-hosted, google, gcp, foss, llama.cpp, github by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: llama.cpp* + llm*

Linked Tags

Related Tags