SemanticScuttle - klotz.me » klotz: performance

klotz: performance*

Tools or advice for measuring or improving software and system performance.

This article explores the challenges and possibilities of writing portable and efficient SIMD code in Rust, aiming for a "fearless SIMD" approach with high-level, safe, and composable primitives.

2025-11-01 Tags: simd, rust, performance, vectorization, fearless concurrency, intrinsics, avx, sse, compiler, optimization, raph levien’s blog by klotz

How I Built Lightning-Fast Vector Search for Legal Documents

This article details the process of building a fast vector search system for a large legal dataset (Australian High Court decisions). It covers choosing embedding providers, performance benchmarks, using USearch and Isaacus embeddings, and the importance of API terms of service. It focuses on achieving speed and scalability while maintaining reasonable accuracy.

2025-10-21 Tags: vector search, embeddings, legal documents, usearch, isaacus, performance, scalability, nlp, information retrieval, rag by klotz

Polars vs pandas: What's the Difference?

This tutorial compares Polars and pandas, covering syntax, performance, LazyFrames, conversions, and plotting to help you choose the right library for your data analysis needs.

2025-10-16 Tags: polars, pandas, data analysis, dataframes, performance, lazyframes, python, data science by klotz

guide : running gpt-oss with llama.cpp · Discussion #15396

A detailed guide for running the new gpt-oss models locally with the best performance using `llama.cpp`. The guide covers a wide range of hardware configurations and provides CLI argument explanations and benchmarks for Apple Silicon devices.

2025-10-04 Tags: llama.cpp, gpt-oss, large language model, inference, apple silicon, benchmarks, performance, gguf by klotz

kitty

The fast, feature-rich, GPU based terminal emulator. It's capable, scriptable, composable, cross-platform, and innovative.

2025-09-04 Tags: python, terminal, emulator, kitty, gpu, shell, scriptable, cross-platform, performance by klotz

Pogocache: Open Source Caching Software with Low Latency and Multiple Wire Protocols

Pogocache is a new open-source caching software focusing on low latency and CPU efficiency. It supports multiple protocols (Memcache, Valkey/Redis, HTTP, PostgreSQL) and claims better throughput and lower latency than alternatives. It's written in C and designed for high performance and scalability.

2025-08-31 Tags: caching, pogocache, open source, low latency, performance, redis, memcache, http, postgresql by klotz

timep

timep is an efficient and accurate state-of-the-art trap-based profiler and flamegraph generator for bash code. It maps the full call-stack tree for the bash code being profiled, and (optionally) uses that call-stack tree to generate a FlameGraph of the profiled bash commands!

2025-07-18 Tags: bash, profile, performance, profiler, timing, flamegraph, jkool702, github by klotz

python-importtime-graph

The article details the author's investigation into slow Python tool startup times. They used the `python -X importtime` feature to identify import bottlenecks and visualized the resulting data using Kevin Michel's `python-importtime-graph` tool, revealing a dense treemap of import times.

2025-06-21 Tags: performance, python, visualization, import, simon willison by klotz

Python Pandas Ditches NumPy for Speedier PyArrow

Pandas 3.0 will significantly boost performance by replacing NumPy with PyArrow as its default engine, enabling faster loading and reading of columnar data.

2025-05-27 Tags: python, pandas, numpy, pyarrow, data analysis, performance, machine learning by klotz

LocalScore

LocalScore is an open benchmark to evaluate local AI task performance across various hardware configurations, measuring Prompt Processing speed, Token Generation speed, Time-to-First-Token (TTFT), and a combined LocalScore.

2025-04-17 Tags: llm, benchmark, performance, gpu, cpu, inference, localscore by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: performance*

Linked Tags

Related Tags