SemanticScuttle - klotz.me

klotz: proxy*

How to Host Your Own Website with Docker and Nginx Proxy Manager

This article details how to host a personal website using Docker, Nginx Proxy Manager, and Ghost, offering a self-hosted alternative to paid hosting services.

2025-04-20 Tags: docker, nginx, proxy, self-hosting, website, linux, ddns, cloudflare, reverse proxy, homelab by klotz

katanemo/archgw

Arch is an intelligent gateway for agents, designed to securely handle prompts, integrate with APIs, and provide rich observability, built on Envoy Proxy.

The ArchGW project focuses on simplifying the development of **agentic applications** – applications powered by Large Language Models (LLMs) that can perform actions and interact with tools. Here's a breakdown of the use cases and examples highlighted:

**Core Use Cases:**

* **Routing:** Intelligent routing of prompts to the correct agents or tools.
* **Tools Use:** Simplifying the integration of prompts with tools/APIs for common tasks.
* **Guardrails:** Centralized configuration for safety and preventing harmful outcomes.
* **LLM Access:** Centralized access and management of LLMs with retries for reliability.
* **Observability:** Providing W3C-compatible tracing and metrics for monitoring LLM interactions.

**Specific Examples & Demos:**

* **Weather Forecast Agent:** A sample application demonstrating core function calling capabilities.
* **Network Operator Agent:** An agent that can interact with network devices (retrieve statistics, reboot).
* **Connecting to SaaS APIs:** Demonstrates integrating 3rd party SaaS APIs into agentic chat experiences.
* **LLM Router:** Using Arch as a gateway to route requests to different LLMs (GPT-4o, Mistral) based on configuration or headers. The example shows how to switch between LLMs using the `x-arch-llm-provider-hint` header.
* **Currency Exchange Agent:** A quickstart guide builds an agent that fetches currency exchange rates from an API (Frankfurter.app). This demonstrates setting up configuration files, starting the gateway, and interacting with the agent via curl.

**Overall, ArchGW aims to address common challenges in building agentic apps:**

* Managing complex routing logic.
* Integrating with various LLMs and tools.
* Ensuring safety and reliability.
* Providing observability into LLM interactions.

2025-04-28 Tags: llm, proxy, github, katanemo, archgw, inference engineering by klotz

BerriAI/litellm README.md

LiteLLM is a library to deploy and manage LLM (Large Language Model) APIs using a standardized format. It supports multiple LLM providers, includes proxy server features for load balancing and cost tracking, and offers various integrations for logging and observability.

2024-10-23 Tags: litellm, llm, api, proxy, logging by klotz

large-model-proxy

Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources.

2024-07-22 Tags: llm, proxy, llama.cpp, github, golang by klotz

Tuning Language Models by Proxy

Introduces proxy-tuning, a lightweight decoding-time algorithm that operates on top of black-box LMs to achieve the same end as direct tuning. The method tunes a smaller LM, then applies the difference between the predictions of the small tuned and untuned LMs to shift the original predictions of the larger untuned model in the direction of tuning, while retaining the benefits of larger-scale pretraining.

2024-05-11 Tags: proxy, fine tuning, llm, llama2-70b by klotz

Improve LLMs with Proxy Tuning

In this tutorial, learn how to improve the performance of large language models (LLMs) by utilizing a proxy tuning approach, which enables more efficient fine-tuning and better integration with the AI model.

2024-05-11 Tags: llm, proxy, tuning, fine-tuning by klotz

Proxy Fine-Tuning LLMs

- Proxy fine-tuning is a method to improve large pre-trained language models without directly accessing their weights.
- It operates on top of black-box LLMs by utilizing only their predictions.
- The approach combines elements of retrieval-based techniques, fine-tuning, and domain-specific adaptations.
- Proxy fine-tuning can be used to achieve the performance of heavily-tuned large models by only tuning smaller models.

2024-05-11 Tags: proxy, fine-tuning, llm, retrieval-augmented generation, domain-specific adaptations, data delivery, rag, catastrophic forgetting, drift by klotz

NodeMaven: The First Proxy Provider Prioritizing IP Quality

NodeMaven is a proxy service that prioritizes IP quality, ensuring high-quality residential proxies with clean records, super sticky sessions, and unmatched customer support. NodeMaven's advanced proxy filtering system screens IPs in real-time, and the provider offers access to 5+ million premium residential IPs, city-based targeting, unlimited concurrent sessions, and industry-expert level support.

2024-04-30 Tags: proxy, saas by klotz

GitHub - aseigneurin/spark-ui-proxy: Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall

2019-08-13 Tags: spark, proxy by klotz

Introduction to modern network load balancing and proxying

2018-10-16 Tags: istio, load balancing, proxy, medium, kubernetes, envoy, production engineering by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: proxy*

Linked Tags

Related Tags