A ruby script calculates VRAM requirements for large language models (LLMs) based on model, bits per weight, and context length. It can determine required VRAM, maximum context length, or best bpw given available VRAM.
A space on Hugging Face showcasing the LLM-Model-VRAM-Calculator, a tool designed to calculate the required VRAM for a specific machine learning model.