SemanticScuttle - klotz.me » klotz: gguf+llama.cpp

klotz: gguf* + llama.cpp*

Bookmarks on this page are managed by an admin user.

GoogleCloudPlatform/localllm: Run LLMs locally on Cloud Workstations

- create a custom base image for a Cloud Workstation environment using a Dockerfile
. Uses:

Quantized models from

2024-02-08 Tags: llm, google, locallama, github, foss, gguf, huggingface, llama.cpp by klotz
Democratizing LLMs: 4-bit Quantization for Optimal LLM Inference

A deep dive into model quantization with GGUF and llama.cpp and model evaluation with LlamaIndex

2024-01-15 Tags: llm, gguf, georgi gerganov, llama.cpp, llamaindex, huggingface, rag by klotz

First / Previous / Next / Last / Page 1 of 0