SemanticScuttle - klotz.me » klotz: llama.cpp+gguf

klotz: llama.cpp* + gguf*

TIL: Building llamafiles from Llama 3.2 GGUFs

A step-by-step guide on building llamafiles from Llama 3.2 GGUFs, including scripting and Dockerization.

2024-09-28 Tags: llamafile, llama.cpp, llm, llama 3.2, gguf, model quantization, docker, mozilla-ocho by klotz
GoogleCloudPlatform/localllm: Run LLMs locally on Cloud Workstations
- create a custom base image for a Cloud Workstation environment using a Dockerfile . Uses:
Quantized models from
2024-02-08 Tags: llm, google, locallama, github, foss, gguf, huggingface, llama.cpp by klotz
Democratizing LLMs: 4-bit Quantization for Optimal LLM Inference

A deep dive into model quantization with GGUF and llama.cpp and model evaluation with LlamaIndex

2024-01-15 Tags: llm, gguf, georgi gerganov, llama.cpp, llamaindex, huggingface, rag by klotz

First / Previous / Next / Last / Page 1 of 0