SemanticScuttle - klotz.me

klotz: cuda*

NVIDIA CUDA 13.1 Powers Next-Gen GPU Programming with NVIDIA CUDA Tile and Performance Gains

NVIDIA CUDA 13.1 introduces CUDA Tile, a tile-based programming model, and performance gains across developer tools and libraries. It also features runtime API exposure of green contexts and a rewritten CUDA programming guide.

2025-12-08 Tags: ncudua, cuda 13, cublas, cuda, cudf, cufft, cusparse, nccl, cuda tile by klotz

Simplify GPU Programming with NVIDIA CUDA Tile in Python

CUDA Tile is a new Python package that simplifies GPU programming by automatically tiling loops, handling data transfer, and optimizing memory access. It allows developers to write concise and readable code that leverages the full power of NVIDIA GPUs without needing to manually manage the complexities of parallel programming.

2025-12-08 Tags: cuda, gpu, python, parallel programming, tiling, optimization, nvidia by klotz

Docker Model Runner on the new NVIDIA DGX Spark: a new paradigm for developing AI locally

This article details the integration of Docker Model Runner with the NVIDIA DGX Spark, enabling faster and simpler local AI model development. It covers setup, usage, and benefits like data privacy, offline availability, and ease of customization.

2025-10-15 Tags: docker, model runner, nvidia, dgx spark, ai, ml, local development, containerization, gpu, cuda by klotz

Canonical Announces Plans To Support NVIDIA CUDA, Easy Installation On Ubuntu

Canonical announced today that they will formally support the NVIDIA CUDA toolkit and also make it available via the Ubuntu repositories. This aims to simplify CUDA installation and usage on Ubuntu, particularly with the rise of AI development.

2025-09-19 Tags: ubuntu, cuda, nvidia, canonical, llm, linux, development by klotz

Linux new kernel 6.5.0-14(ubuntu 22.04) can not compile NVIDIA display card driver

A user, nicholasdavidroberts, expresses gratitude to Daniel for providing a PPA and patched 390 driver that resolved their NVIDIA driver compilation issues on Ubuntu 22.04 with kernel 6.5.0-14.

```
execute_with_retries apt-get install -y -qq gcc-12
update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-11 11
update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-12 12
update-alternatives --set gcc /usr/bin/gcc-12
```

2025-05-31 Tags: linux, ubuntu, nvidia, driver, kernel, 6.5.0-14, 390, dkms, cuda by klotz

Accelerating JSON Processing on Apache Spark with GPUs

Learn how GPU acceleration can significantly speed up JSON processing in Apache Spark, reducing runtime and costs for enterprise data applications.

2025-02-04 Tags: json, apache spark, gpu, nvidia, cuda, data engineering by klotz

Nvidia's CUDA moat may not be as impenetrable as you think • The Register

The article discusses the competition Nvidia faces from Intel and AMD in the GPU market. While these competitors have introduced new accelerators that match or surpass Nvidia's offerings in terms of memory capacity, performance, and price, Nvidia maintains a strong advantage through its CUDA software ecosystem. CUDA has been a significant barrier for developers switching to alternative hardware due to the effort required to port and optimize existing code. However, both Intel and AMD have developed tools to ease this transition, like AMD's HIPIFY and Intel's SYCL. Despite these efforts, the article notes that the majority of developers now write higher-level code using frameworks like PyTorch, which can run on different hardware with varying levels of support and performance. This shift towards higher-level programming languages has reduced the impact of Nvidia's CUDA moat, though challenges still exist in ensuring compatibility and performance across different hardware platforms.

2024-12-25 Tags: nvidia, cuda, the register, gpu, llm, pytorch by klotz

PygmalionAI/aphrodite-engine

PygmalionAI's large-scale inference engine designed for serving Pygmalion models to a large number of users with blazing fast speeds. Integrates work from projects like vLLM, TensorRT-LLM, xFormers, AutoAWQ, AutoGPTQ, SqueezeLLM, Exllamav2, TabbyAPI, AQLM, KoboldAI, Text Generation WebUI, and Megatron-LM.

2024-06-21 Tags: machine learning, cuda, pygmalion, github by klotz

Lambda Stack: An Always Updated AI Software Stack, Usable Everywhere

Lambda Stack is an all-in-one package that provides a one line installation and managed upgrade path for deep learning and AI software, ensuring that you always have the most up-to-date versions of PyTorch, TensorFlow, CUDA, CuDNN, and NVIDIA Drivers.

2024-05-16 Tags: deep learning, software, lambda stack, pytorch, tensorflow, cuda, cudnn, nvidia drivers by klotz

henrymai/podman_wsl2_cuda_rootless

This is why cuda-12 doesn't work with podman 3.4.4 on ubuntu 22.04 I think:

- Rootless configuration for nvidia container runtime
- Setup missing hook for nvidia container runtime
- Increase memlock and stack ulimits

2023-11-21 Tags: nvidia, cuda, ubuntu, 22.04, fix by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: cuda*

Linked Tags

Related Tags