SemanticScuttle - klotz.me » klotz: tailscale

klotz: tailscale*

Using a local LLM in OpenCode with llama.cpp

A comprehensive technical guide on setting up a high-performance local large language model environment for agentic coding tasks. The author demonstrates how to run a quantized Qwen3.5-27B model on a remote RTX 4090 workstation and access it from a MacBook using Tailscale, integrating the setup with OpenCode and Codex.
Key topics include:
* Step-by-step llama.cpp build configuration for CUDA support.
* Using Tailscale to create a secure network between client and GPU machine.
* Optimizing VRAM usage through specific quantization (UD-Q4_K_XL) and context size management.
* Implementing a corrected chat template to prevent tool-calling errors in agentic workflows.
* Performance insights regarding hybrid architectures and KV cache precision.

2026-04-11 Tags: llama.cpp, opencode, qwen3.5, local llm, rtx 4090, tailscale, coding assistant, gguf by klotz

Setting up a reverse proxy was easier than every guide made it seem

This article discusses the author's experience setting up reverse proxies for self-hosted services, finding the process surprisingly straightforward despite extensive and often overwhelming documentation. It compares several popular options like Nginx, Traefik, Caddy, Envoy, SWAG, and HAProxy, ultimately recommending Caddy for its simplicity and features. It also touches on the relative ease of reverse proxy setup compared to configuring the services they front.

2026-02-16 Tags: reverse proxy, self-hosting, nginx, traefik, caddy, envoy, haproxy, docker, kubernetes, home network, pangolin, tailscale by klotz

Using Codex CLI with gpt-oss:120b on an NVIDIA DGX Spark via Tailscale

This article details how the author successfully ran OpenAI's Codex CLI against a gpt-oss:120b model hosted on an NVIDIA DGX Spark, accessed through a Tailscale network. It covers the setup of Tailscale, Ollama configuration, and the process of running the Codex CLI with the remote model, including building a Space Invaders game.

2025-11-07 Tags: llm, codex, gpt-oss, nvidia dgx spark, tailscale, ollama, ai, large language model, space invaders by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: tailscale*

Linked Tags

Related Tags