SemanticScuttle - klotz.me » klotz: gpt-oss+openai

klotz: gpt-oss* + openai*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

Inside GPT-OSS: OpenAI’s Latest LLM Architecture

An in-depth look at the architecture of OpenAI's GPT-OSS models, detailing tokenization, embeddings, transformer blocks, Mixture of Experts, attention mechanisms (GQA and RoPE), and quantization techniques.

2025-09-27 Tags: llm, gpt-oss, openai, transformer, mixture of experts, moe, attention, gqa, rope, quantization, machine learning, .qwen3–30b-a3b. by klotz
The Illustrated GPT-OSS

OpenAI's release of GPT-OSS marks their first major open source LLM since GPT-2, featuring improvements in reasoning, tool usage, and problem-solving capabilities. The article explores its architecture, message formatting, reasoning modes, and tokenizer details.

2025-08-22 Tags: gpt-oss, openai, llm, reasoning, tool usage, tokenizer, mixture-of-experts by klotz
120B runs awesome on just 8GB VRAM!

A user demonstrates how to run a 120B model efficiently on hardware with only 8GB VRAM by offloading MOE layers to CPU and keeping only attention layers on GPU, achieving high performance with minimal VRAM usage.

2025-08-21 Tags: 120b, moe, llama.cpp, gpt-oss, localllama, gpt-oss-120b, openai, llm by klotz
guide : running gpt-oss with llama.cpp

2025-08-19 Tags: gpt-oss, -20b, openai, github, llama.cpp, llm, ggml by klotz
Introducing gpt-oss

OpenAI releases gpt-oss-120b and gpt-oss-20b, two state-of-the-art open-weight language models that deliver strong real-world performance at low cost. They outperform similarly sized open models on reasoning tasks and are optimized for efficient deployment.

2025-08-06 Tags: gpt-oss, open-weight models, llm, reasoning, openai, o3, o4-mini, machine learning, inference by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0

About - Propulsed by SemanticScuttle

SemanticScuttle - klotz.me

klotz: gpt-oss* + openai*

Linked Tags

Related Tags