SemanticScuttle - klotz.me

Tags: async*

0 bookmark(s) - Sort by: Date ↓ / Title /

How We Reduced LLM Costs by 90% with 5 Lines of Code

A technical article explaining how a small change in async Python code—using a semaphore to limit concurrency—reduced LLM request volume and costs by 90% without sacrificing performance.

2025-08-22 Tags: llm, cost, async, python, semaphore, concurrency, efficiency, code optimization by klotz
Pocket Flow

A 100-line minimalist LLM framework for Agents, Task Decomposition, RAG, etc. It models the LLM workflow as a Graph + Shared Store with nodes handling simple tasks, connected through actions for agents, and orchestrated by flows for task decomposition.

2025-03-04 Tags: llm framework, agents, task decomposition, rag, graph, shared store, nodes, flows, communication, batch, async, utility function, design pattern by klotz
Meet D-ASYNC: A Framework for Writing Distributed Cloud-Native Applications - The New Stack

2018-06-05 Tags: architecture, async by klotz
Java 8: Writing asynchronous code with CompletableFuture

2016-05-30 Tags: java, concurrency, async, futures by klotz
REST Commander - Parallel Async HTTP Client as a Service, Speaks HTTP at Scale (Formerly known as REST Superman)

2014-12-11 Tags: rest, orchestration, ebay, async, http by klotz

First / Previous / Next / Last / Page 1 of 0