A step-by-step guide on building llamafiles from Llama 3.2 GGUFs, including scripting and Dockerization.
- create a custom base image for a Cloud Workstation environment using a Dockerfile
. Uses:
Quantized models from
A deep dive into model quantization with GGUF and llama.cpp and model evaluation with LlamaIndex