SemanticScuttle - klotz.me

klotz: cpu*

Apple unleashes M5, the next big leap in AI performance for Apple silicon

M5 delivers over 4x the peak GPU compute performance for AI compared to M4, featuring a next-generation GPU with a Neural Accelerator in each core, a more powerful CPU, a faster Neural Engine, and higher unified memory bandwidth.

2025-10-15 Tags: m5, apple, llm, gpu, cpu, neural engine, macbook pro, ipad pro, apple vision pro by klotz

My mind was blown: running a 120B parameter AI model on a budget GPU at home

A 120 billion parameter OpenAI model can now run on consumer hardware thanks to the Mixture of Experts (MoE) technique, which significantly reduces memory requirements and allows processing on CPUs while offloading key parts to modest GPUs.

2025-08-21 Tags: llm, mixture of experts, 120b, gpu, cpu, openai, gpt-oss-120b by klotz

AES-NI CPU Support

The article shows how to check if a Linux CPU supports AES‑NI, Intel’s hardware‑accelerated AES instruction set. It explains what AES‑NI is, why it speeds up encryption, and then lists three easy methods: use cpuid and grep for “aes”, grep the /proc/cpuinfo file, or run lscpu and look for the “aes” flag. If none of these commands report AES‑NI, the CPU relies on slower software encryption, which is still secure. The first CPUs to expose this feature were Intel’s Westmere chips in 2010. In the CPUID specification the flag is simply called AES (bit 25 of ECX). The “NI” (New Instructions) part is just a marketing name for the feature set. There isn’t a distinct “aes_ni” bit in the CPUID leaf. So, when you run <tt>lscpu | grep -i aes or cat /proc/cpuinfo | grep aes</tt>, the presence of aes tells you that the CPU supports AES‑NI. There is no separate aes_ni flag because the kernel already uses the more concise aes.

2025-08-20 Tags: aes, aes-ni, cpu, hardware, intel by klotz

LocalScore

LocalScore is an open benchmark to evaluate local AI task performance across various hardware configurations, measuring Prompt Processing speed, Token Generation speed, Time-to-First-Token (TTFT), and a combined LocalScore.

2025-04-17 Tags: llm, benchmark, performance, gpu, cpu, inference, localscore by klotz

NVIDIA DGX Spark

NVIDIA DGX Spark is a desktop-friendly AI supercomputer powered by the NVIDIA GB10 Grace Blackwell Superchip, delivering 1000 AI TOPS of performance with 128GB of memory. It is designed for prototyping, fine-tuning, and inference of large AI models.

2025-03-24 Tags: machine learning, nvidia, dgx spark, llm, grace blackwell, ai development, inference, data science, gpu, cpu by klotz

6502.sh - A 6502 emulator written in busybox ash

6502.sh is a 6502 emulator and debugger written in busybox ash compliant shell script, featuring 32k RAM, 16k ROM, an interactive debugger, and STDIO directed to an ACIA compatible serial port.

2025-03-17 Tags: 6502, cpu, emulator, busybox, ash, shell, t, codebert, foss by klotz

A 6502, In The Shell

A 6502 system emulated in a busybox ash shell script, featuring RAM, ROM, and an emulated serial port on STDIO, with built-in monitor and debugger.

2025-03-17 Tags: 6502, cpu, shell, emulator, hacks, hackaday, bash by klotz

GGUF Quantization with Imatrix and K-Quantization to Run LLMs on Your CPU

This article explains how to accurately quantize a Large Language Model (LLM) and convert it to the GGUF format for efficient CPU inference. It covers using an importance matrix (imatrix) and K-Quantization method with Gemma 2 Instruct as an example, while highlighting its applicability to other models like Qwen2, Llama 3, and Phi-3.

2024-09-14 Tags: gguf, quantization, llm, cpu, inference, imatrix by klotz

6809 CPU Manual

2024-01-18 Tags: 6809, cpu by klotz

PowerInfer - High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

PowerInfer is a CPU/GPU LLM inference engine leveraging activation locality for your device.

2026-01-13 Tags: llm, serving, cpu, gpu, github by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: cpu*

Linked Tags

Related Tags