This document details how to run Qwen models locally using the Text Generation Web UI (oobabooga), covering installation, setup, and launching the web interface.
A guide on how to download, convert, quantize, and use Llama 3.1 8B model with llama.cpp on a Mac.
An explanation of the quant names used in the llama.cpp implementation, as well as information on the different types of quant schemes available.