SemanticScuttle - klotz.me » klotz: router+llm

klotz: router* + llm*

This section details how to load and use multiple models with the llama.cpp server. It covers configuring the server to handle multiple models, the model path format, and considerations for memory usage.

2025-12-07 Tags: llama.cpp, api, control plane, model, llm, router by klotz

LLM Council

LLM Council works together to answer your hardest questions. A local web app that uses OpenRouter to send queries to multiple LLMs, have them review/rank each other's work, and finally a Chairman LLM produces the final response.

2025-11-23 Tags: llm, ai, openai, google, anthropic, router, python, react, fastapi, karpathy, github, foss by klotz

Tutorial for Building an LLM Router for High-Quality and Cost-Effective Responses

This tutorial provides a step-by-step guide on building an LLM router to balance the use of high-quality closed LLMs like GPT-4 and cost-effective open-source LLMs, achieving high response quality while minimizing costs. The approach includes preparing labeled data, finetuning a causal LLM classifier, and offline evaluation using the RouteLLM framework.

2024-07-04 Tags: llm, router, tutorial, gpt-4, mixtral-8x7b, github by klotz

GitHub semantic router

2024-02-11 Tags: llm, semantic, router, github by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: router* + llm*

Linked Tags

Related Tags