SemanticScuttle - klotz.me » Tags: llama-cpp

Tags: llama-cpp*

0 bookmark(s) - Sort by: Date ↓ / Title /

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model

An exploration of the new Qwen3.6-27B open weight model, which claims flagship-level agentic coding performance that surpasses previous larger MoE models while being significantly smaller in size. The author tests a quantized version using llama-server and demonstrates its impressive ability to generate complex SVG graphics locally.
Key points:
- Qwen3.6-27B outperforms the older Qwen3.5-397B-A17B on coding benchmarks.
- Dramatic reduction in model size from 807GB to approximately 55.6GB for the base version.
- Successful local execution using a 16.8GB quantized GGUF version via llama.cpp.
- High-quality SVG generation capabilities for complex prompts like a pelican riding a bicycle.

2026-04-22 Tags: simon willison, local-llms, llm, qwen, llama-cpp, coding, frontier models by klotz

LlamaBarn for macos

LlamaBarn is a macOS menu bar app for running local LLMs. It provides a simple way to install and run models locally, connecting to apps via an OpenAI-compatible API.

2026-02-01 Tags: macos, swift, llm, llama-cpp, local llm, menu bar, ggerganov by klotz

Maximilian-Winter/llama-cpp-agent:

The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). It provides a simple yet robust interface using llama-cpp-python, allowing users to chat with LLM models, execute structured function calls and get structured output.

2024-01-14 Tags: llm, functions, json, structure, python, github, llama-cpp, agent by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: llama-cpp*

Linked Tags

Related Tags