SemanticScuttle - klotz.me » Tags: compression

Tags: compression*

0 bookmark(s) - Sort by: Date ↓ / Title /

Implementing Prompt Compression to Reduce Agentic Loop Costs

- Understanding why agentic loops increase token costs over time
- Techniques for selective information removal from prompt histories
- Strategies to maintain reasoning capabilities during compression
- Practical implementation steps for optimizing LLM workflows

2026-05-12 Tags: prompt, compression, llm, agents, agentic loops, token optimization by klotz
KV Cache Transform Coding for Compact Storage in LLM Inference

This paper introduces KVTC, a lightweight transform coder designed to compress key-value (KV) caches, which are crucial for efficient large language model (LLM) serving. KV caches enable reuse across conversation turns, but can consume significant GPU memory. KVTC addresses this by applying techniques from classical media compression – PCA-based decorrelation, adaptive quantization, and entropy coding – to reduce cache size without requiring changes to the underlying model. The authors demonstrate that KVTC achieves up to 20x compression while maintaining reasoning accuracy and long-context performance, and even higher compression for specific applications.

2026-03-18 Tags: llm, kv cache, kvtc, compression, machine learning, transformers by klotz
Introducing OpenZL: An Open Source Format-Aware Compression Framework

OpenZL is a new open source data compression framework that offers lossless compression for structured data, achieving performance comparable to specialized compressors by applying configurable transforms to reveal hidden order in the data.

2025-10-08 Tags: compression, open source, data compression, format-aware, zstandard, openzl, lossless compression by klotz
An Analysis of DeepMind's 'Language Modeling Is Compression' Paper

A detailed analysis of the DeepMind/Meta study: how large language models achieve unprecedented compression rates on text, image, and audio data - and the implications of these results

2023-09-28 Tags: deep mind, llm, compression, information, bottleneck by klotz
Compression-based document similarity vs ngrams

2023-07-22 Tags: compression, document, similarity, lz4, gzip, ngram by klotz
Convert .mov to .mp4 on a

ffmpeg -i demo.mov -vcodec h264 demo.mp4

2022-02-09 Tags: mac, brew, video, mov, quicktime, mp4, compression, codec, foss, howto, ffmpeg by klotz
atomicobject/heatshrink: data compression library for embedded/real-time systems

2021-07-14 Tags: compression, firmware, c, arduino, heatshrink, github, foss by klotz
Autoencoders for Image Reconstruction in Python and Keras

2019-10-12 Tags: autoencoder, machine learning, image processing, compression, representation, tutorial, python by klotz
A Deep Learning Approach to Data Compression – The Berkeley Artificial Intelligence Research Blog

2019-09-20 Tags: compression, entropy, lossless, deep learning, data, computer science by klotz
itty bitty site

2018-07-06 Tags: url, hacks, compression, web by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0

About - Propulsed by SemanticScuttle

SemanticScuttle - klotz.me

Tags: compression*

Linked Tags

Related Tags