Tags: quantization* + github*

0 bookmark(s) - Sort by: Date ↓ / Title /

  1. HQQ is a fast and accurate model quantizer that skips the need for calibration data. It's super simple to implement (just a few lines of code for the optimizer). It can crunch through quantizing the Llama2-70B model in only 4 minutes!
    2024-02-24 Tags: , , by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: tagged with "quantization+github"

About - Propulsed by SemanticScuttle