SemanticScuttle - klotz.me » klotz: llm+cpu

klotz: llm* + cpu*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

NVIDIA DGX Spark

NVIDIA DGX Spark is a desktop-friendly AI supercomputer powered by the NVIDIA GB10 Grace Blackwell Superchip, delivering 1000 AI TOPS of performance with 128GB of memory. It is designed for prototyping, fine-tuning, and inference of large AI models.

2025-03-24 Tags: machine learning, nvidia, dgx spark, llm, grace blackwell, ai development, inference, data science, gpu, cpu by klotz
GGUF Quantization with Imatrix and K-Quantization to Run LLMs on Your CPU

This article explains how to accurately quantize a Large Language Model (LLM) and convert it to the GGUF format for efficient CPU inference. It covers using an importance matrix (imatrix) and K-Quantization method with Gemma 2 Instruct as an example, while highlighting its applicability to other models like Qwen2, Llama 3, and Phi-3.

2024-09-14 Tags: gguf, quantization, llm, cpu, inference, imatrix by klotz
PowerInfer - High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

2023-12-24 Tags: llm, serving, cpu, gpu, github by klotz
How to Fine-Tune Llama2 for Python Coding | Towards Data Science

2023-08-28 Tags: llama-2, llm, fine tuning, cpu by klotz
Fine-Tune Your LLM Without Maxing Out Your GPU | by John Adeojo | Jul, 2023 | Towards Data Science

2023-08-03 Tags: llm, gpu, cpu, fine-tune by klotz
llama-2 on cpu inference for document q-and a

2023-07-22 Tags: llama-2, llm, cpu, inference, document, q-and a, langchain by klotz
Hugging face getting started

2023-06-25 Tags: huggingface, llm, python, cpu by klotz
Falcon - A guide to finetune and inference - Lightning AI

2023-06-14 Tags: falcon, llm, fine-tune, cpu by klotz
TheBloke/starchat-beta-GGML · Hugging Face

See https://github.com/ggerganov/ggml/tree/master/examples/starcoder for runtime

2023-06-09 Tags: starchat, starcoder, chat, gptq, llm, cpu by klotz
LocalAI

LocalAI is a project that aims to provide a local and open source alternative to OpenAI API, which allows users to access large language models (LLMs) without relying on cloud services or paying fees. LocalAI supports various LLMs, such as GPT-3, GPT-Neo, and GPT-J, and also provides a graphical user interface (GUI) for easy interaction and customization.

2023-10-16 Tags: localai, github, llm, cpu by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0

About - Propulsed by SemanticScuttle

SemanticScuttle - klotz.me

klotz: llm* + cpu*

Linked Tags

Related Tags