klotz: vram* + ruby reddit* + foss* + context length* + llm* + cli* + quantization*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

  1. A ruby script calculates VRAM requirements for large language models (LLMs) based on model, bits per weight, and context length. It can determine required VRAM, maximum context length, or best bpw given available VRAM.

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: Tags: vram + ruby reddit + foss + context length + llm + cli + quantization

About - Propulsed by SemanticScuttle