SemanticScuttle - klotz.me » Tags: kubernetes+api

Tags: kubernetes* + api*

0 bookmark(s) - Sort by: Date ↓ / Title /

El Reg's essential guide to deploying LLMs in production

Running GenAI models is easy. Scaling them to thousands of users, not so much. This guide details avenues for scaling AI workloads from proofs of concept to production-ready deployments, covering API integration, on-prem deployment considerations, hardware requirements, and tools like vLLM and Nvidia NIMs.

2025-04-28 Tags: llm, ai, production engineering, inference engineering, deployment, vllm, nvidia, kubernetes, inference, api, scaling, gpu, machine learning by klotz
The guide to kubectl I never had.

This guide provides an introduction to kubectl, the command-line tool used to communicate with the Kubernetes API. It covers command syntax, useful commands, flags, and tips and tricks. It also discusses the ecosystem of plugins and tools built to expand the functionalities of kubectl and Kubernetes.

2024-05-26 Tags: kubectl, kubernetes, api, cli, production engineering by klotz
The difference between API Gateways and Service Mesh | Cloud Native Computing Foundation

2021-04-17 Tags: api gateway, api, service mesh, kubernetes, production engineering by klotz
Using API Gateways to Facilitate Your Transition from Monolith to Microservices

2018-06-25 Tags: api, gateway, trafeik, ambassador, kubernetes, production engineering by klotz

First / Previous / Next / Last / Page 1 of 0