Tags: cuda* + machine learning*

0 bookmark(s) - Sort by: Date ↓ / Title /

  1. This blog post details how to implement high-performance matrix multiplication using NVIDIA cuTile, focusing on Tile loading, computation, storage, and block-level parallel programming. It also covers best practices for Tile programming and performance optimization strategies.
  2. PygmalionAI's large-scale inference engine designed for serving Pygmalion models to a large number of users with blazing fast speeds. Integrates work from projects like vLLM, TensorRT-LLM, xFormers, AutoAWQ, AutoGPTQ, SqueezeLLM, Exllamav2, TabbyAPI, AQLM, KoboldAI, Text Generation WebUI, and Megatron-LM.
    2024-06-21 Tags: , , , by klotz
  3. Lambda Stack is an all-in-one package that provides a one line installation and managed upgrade path for deep learning and AI software, ensuring that you always have the most up-to-date versions of PyTorch, TensorFlow, CUDA, CuDNN, and NVIDIA Drivers.
  4. 2021-04-13 Tags: , , , by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: tagged with "cuda+machine learning"

About - Propulsed by SemanticScuttle