Tags: performance*

Tools or advice for measuring or improving software and system performance.

0 bookmark(s) - Sort by: Date ↓ / Title /

  1. Pogocache is a new open-source caching software focusing on low latency and CPU efficiency. It supports multiple protocols (Memcache, Valkey/Redis, HTTP, PostgreSQL) and claims better throughput and lower latency than alternatives. It's written in C and designed for high performance and scalability.
  2. timep is an efficient and accurate state-of-the-art trap-based profiler and flamegraph generator for bash code. It maps the full call-stack tree for the bash code being profiled, and (optionally) uses that call-stack tree to generate a FlameGraph of the profiled bash commands!
  3. The article details the author's investigation into slow Python tool startup times. They used the `python -X importtime` feature to identify import bottlenecks and visualized the resulting data using Kevin Michel's `python-importtime-graph` tool, revealing a dense treemap of import times.
  4. Pandas 3.0 will significantly boost performance by replacing NumPy with PyArrow as its default engine, enabling faster loading and reading of columnar data.
  5. LocalScore is an open benchmark to evaluate local AI task performance across various hardware configurations, measuring Prompt Processing speed, Token Generation speed, Time-to-First-Token (TTFT), and a combined LocalScore.
  6. This guide shows how to work around low performance issues when running Ubuntu 24.04 on a Raspberry Pi 5 by adjusting the EEPROM config file to override the maximum current value.
  7. A developer recounts how Claude Code helped resolve a critical memory usage issue in an API endpoint, reducing memory usage by 99% and providing detailed solutions and evidence.
  8. The article explores the evolution of large language model (LLM) serving, highlighting significant advancements from pre-2020 frameworks to the introduction of vLLM in 2023. It discusses the challenges of efficient memory management in LLM serving and how vLLM's PagedAttention technique revolutionizes the field by reducing memory wastage and enabling better utilization of GPU resources.
    2025-02-17 Tags: , , , by klotz
  9. A tool to estimate the memory requirements and performance of Hugging Face models based on quantization levels.
    2025-01-28 Tags: , , , by klotz
  10. Investigation into the effect of DDR5 speed on local LLM inference speed.

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: tagged with "performance"

About - Propulsed by SemanticScuttle