klotz: performance*

Tools or advice for measuring or improving software and system performance.

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

  1. Explores the performance benefits of running Python without the Global Interpreter Lock (GIL), using the new experimental Python 3.13.0b4 pre-release with the --disable-gil flag. Discusses how this change can lead to faster execution times for CPU-intensive tasks in data science and machine learning.
  2. A discussion post on Reddit's LocalLLaMA subreddit about logging the output of running models and monitoring performance, specifically for debugging errors, warnings, and performance analysis. The post also mentions the need for flags to output logs as flat files, GPU metrics (GPU utilization, RAM usage, TensorCore usage, etc.) for troubleshooting and analytics.
  3. Improving the memory and computational efficiency of Large Language Models (LLMs) for handling long input sequences, including retrieval augmented questions answering, summarization, and chat tasks. It covers various techniques, such as lower precision computing, Flash Attention algorithm, positional embedding methods, and key-value caching strategies. These methods help reduce memory consumption and increase inference speeds while maintaining high accuracy levels in LLM applications. Furthermore, it highlights some advanced approaches like Multi-Query-Attention (MQA) and Grouped-Query-Attention (GQA), which further enhance computational and memory efficiency without compromising performance.
  4. 2023-11-18 Tags: , , , , by klotz
  5. 2023-10-13 Tags: , , , by klotz
  6. 2023-06-09 Tags: , , , , , by klotz
  7. 2023-04-02 Tags: , , , , by klotz
  8. Distributable streaming

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: Tags: performance

About - Propulsed by SemanticScuttle