SemanticScuttle - klotz.me » klotz: gguf+huggingface

klotz: gguf* + huggingface*

Ollama just made it easier to use AI on your laptop — with no internet required

Ollama now supports HuggingFace GGUF models, making it easier for users to run AI models locally without internet. The GGUF format allows for the use of AI models on modest-sized consumer hardware.

2024-10-24 Tags: ollama, huggingface, gguf, llm, localllama by klotz
GoogleCloudPlatform/localllm: Run LLMs locally on Cloud Workstations

- create a custom base image for a Cloud Workstation environment using a Dockerfile
. Uses:

Quantized models from

2024-02-08 Tags: llm, google, locallama, github, foss, gguf, huggingface, llama.cpp by klotz
Democratizing LLMs: 4-bit Quantization for Optimal LLM Inference

A deep dive into model quantization with GGUF and llama.cpp and model evaluation with LlamaIndex

2024-01-15 Tags: llm, gguf, georgi gerganov, llama.cpp, llamaindex, huggingface, rag by klotz

First / Previous / Next / Last / Page 1 of 0