SemanticScuttle - klotz.me

Tags: cpu* + llm*

0 bookmark(s) - Sort by: Date ↓ / Title /

LocalScore

LocalScore is an open benchmark to evaluate local AI task performance across various hardware configurations, measuring Prompt Processing speed, Token Generation speed, Time-to-First-Token (TTFT), and a combined LocalScore.

2025-04-17 Tags: llm, benchmark, performance, gpu, cpu, inference, localscore by klotz
NVIDIA DGX Spark

NVIDIA DGX Spark is a desktop-friendly AI supercomputer powered by the NVIDIA GB10 Grace Blackwell Superchip, delivering 1000 AI TOPS of performance with 128GB of memory. It is designed for prototyping, fine-tuning, and inference of large AI models.

2025-03-24 Tags: machine learning, nvidia, dgx spark, llm, grace blackwell, ai development, inference, data science, gpu, cpu by klotz
GGUF Quantization with Imatrix and K-Quantization to Run LLMs on Your CPU

This article explains how to accurately quantize a Large Language Model (LLM) and convert it to the GGUF format for efficient CPU inference. It covers using an importance matrix (imatrix) and K-Quantization method with Gemma 2 Instruct as an example, while highlighting its applicability to other models like Qwen2, Llama 3, and Phi-3.

2024-09-14 Tags: gguf, quantization, llm, cpu, inference, imatrix by klotz
PowerInfer - High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

2023-12-24 Tags: llm, serving, cpu, gpu, github by klotz
How to Fine-Tune Llama2 for Python Coding | Towards Data Science

2023-08-28 Tags: llama-2, llm, fine tuning, cpu by klotz
Fine-Tune Your LLM Without Maxing Out Your GPU | by John Adeojo | Jul, 2023 | Towards Data Science

2023-08-03 Tags: llm, gpu, cpu, fine-tune by klotz
llama-2 on cpu inference for document q-and a

2023-07-22 Tags: llama-2, llm, cpu, inference, document, q-and a, langchain by klotz
Hugging face getting started

2023-06-25 Tags: huggingface, llm, python, cpu by klotz
Falcon - A guide to finetune and inference - Lightning AI

2023-06-14 Tags: falcon, llm, fine-tune, cpu by klotz
TheBloke/starchat-beta-GGML · Hugging Face

See https://github.com/ggerganov/ggml/tree/master/examples/starcoder for runtime

2023-06-09 Tags: starchat, starcoder, chat, gptq, llm, cpu by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0

About - Propulsed by SemanticScuttle

SemanticScuttle - klotz.me

Tags: cpu* + llm*

Linked Tags

Related Tags