SemanticScuttle - klotz.me » Tags: llama.cpp

Tags: llama.cpp*

0 bookmark(s) - Sort by: Date / Title ↑ /

(2) index - LocalLLaMA

2023-06-12 Tags: llm, llama, llama.cpp, prompt by klotz
A direct comparison between llama.cpp, AutoGPTQ, ExLlama, and transformers perplexities - LLM blog

2023-07-16 Tags: llm, comparison, benchmark, perplexity, llama.cpp, exllama, autogptq, linux by klotz
abetlen/llama-cpp-python: Python bindings for llama.cpp

llama-cpp-python offers a web server which aims to act as a drop-in replacement for the OpenAI API. This allows you to use llama.cpp compatible models with any OpenAI compatible client (language libraries, services, etc).

2023-06-09 Tags: openai, llama.cpp, llama, python, api, foss, github by klotz
Democratizing LLMs: 4-bit Quantization for Optimal LLM Inference

A deep dive into model quantization with GGUF and llama.cpp and model evaluation with LlamaIndex

2024-01-15 Tags: llm, gguf, georgi gerganov, llama.cpp, llamaindex, huggingface, rag by klotz
Example prompt using the new grammar-guided generation functionality from llama.cpp

2024-02-13 Tags: gbnf, llama.cpp, text extraction, functions, json, github by klotz
ggml.ai

2023-06-15 Tags: ggml, georgi gerganov, llama.cpp, lmm, support by klotz
ggml/examples/starcoder at master · ggerganov/ggml · GitHub

See https://huggingface.co/TheBloke/starchat-beta-GGML for weights

2023-06-09 Tags: starcoder, starchat, llama.cpp, ggml, chat by klotz
GoogleCloudPlatform/localllm: Run LLMs locally on Cloud Workstations

- create a custom base image for a Cloud Workstation environment using a Dockerfile
. Uses:

Quantized models from

2024-02-08 Tags: llm, google, locallama, github, foss, gguf, huggingface, llama.cpp by klotz
How is LLaMa.cpp possible?

2023-06-06 Tags: llama, llm, llama.cpp, quantization, self-hosted by klotz
jlonge4/local_llama: This repo is to showcase how you can run a model locally and offline, free of OpenAI dependencies.

2023-06-25 Tags: llama.cpp, llama, pdf, chat, self-hosted, github, jlonge, hacks by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0

About - Propulsed by SemanticScuttle