Nvidia's DGX Spark is a relatively affordable AI workstation that prioritizes capacity over raw speed, enabling it to run models that consumer GPUs cannot. It features 128GB of memory and is based on the Blackwell architecture.
Nvidia introduces the Rubin CPX GPU, designed to accelerate AI inference by decoupling the context and generation phases. It utilizes GDDR7 memory for lower cost and power consumption, aiming to redefine AI infrastructure.
Canonical announced today that they will formally support the NVIDIA CUDA toolkit and also make it available via the Ubuntu repositories. This aims to simplify CUDA installation and usage on Ubuntu, particularly with the rise of AI development.
Nvidia has expanded its Jetson lineup with the Jetson AGX Thor Developer Kit, a compact platform that carries the new Jetson T5000 system-on-module. Marketed as a developer system, the dimensions and form factor place it firmly in the realm of a mini PC, although its design and purpose align more with edge AI deployment than home computing.
This blog post details a fine-tuning workflow for the gpt-oss model that recovers post-training accuracy while retaining the performance benefits of FP4. It involves supervised fine-tuning (SFT) on an upcasted BF16 version of the model, followed by quantization-aware training (QAT) using NVIDIA TensorRT Model Optimizer. The article also discusses the benefits of using NVFP4 for even better convergence and accuracy recovery.
Nvidia’s NeMo Retriever models and RAG pipeline make quick work of ingesting PDFs and generating reports based on them. Chalk one up for the plan-reflect-refine architecture.
This comment details a workaround for nvidia-driver-390 on Ubuntu systems with kernel 6.5.0. It links to related bug reports and provides instructions to add a PPA and install updated drivers.
```
sudo add-apt-repository ppa:dtl131/nvidiaexp
sudo apt update
sudo apt install nvidia-drivers-390
```
A user, nicholasdavidroberts, expresses gratitude to Daniel for providing a PPA and patched 390 driver that resolved their NVIDIA driver compilation issues on Ubuntu 22.04 with kernel 6.5.0-14.
```
execute_with_retries apt-get install -y -qq gcc-12
update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-11 11
update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-12 12
update-alternatives --set gcc /usr/bin/gcc-12
```