SemanticScuttle - klotz.me » Tags: gpu+nvidia

Tags: gpu* + nvidia*

0 bookmark(s) - Sort by: Date ↓ / Title /

Docker Model Runner on the new NVIDIA DGX Spark: a new paradigm for developing AI locally

This article details the integration of Docker Model Runner with the NVIDIA DGX Spark, enabling faster and simpler local AI model development. It covers setup, usage, and benefits like data privacy, offline availability, and ease of customization.

2025-10-15 Tags: docker, model runner, nvidia, dgx spark, ai, ml, local development, containerization, gpu, cuda by klotz

DGX Spark Nvidia’s desktop supercomputer: first look

Nvidia's DGX Spark is a relatively affordable AI workstation that prioritizes capacity over raw speed, enabling it to run models that consumer GPUs cannot. It features 128GB of memory and is based on the Blackwell architecture.

2025-10-14 Tags: llm, nvidia, dgx spark, gpu, hardware, machine learning by klotz

Nvidia's new CPX GPU aims to change the game in AI inference — how the debut of cheaper and cooler GDDR7 memory could redefine AI inference infrastructure

Nvidia introduces the Rubin CPX GPU, designed to accelerate AI inference by decoupling the context and generation phases. It utilizes GDDR7 memory for lower cost and power consumption, aiming to redefine AI infrastructure.

2025-10-05 Tags: nvidia, cpx gpu, inference, gddr7, rubin, hardware, data center, gpu, llm by klotz

Nvidia quietly unveiled its fastest mini PC ever, capable of topping 2070 TFLOPS - and if you squint enough, you might even think it looks like an RTX 5090

Nvidia has expanded its Jetson lineup with the Jetson AGX Thor Developer Kit, a compact platform that carries the new Jetson T5000 system-on-module. Marketed as a developer system, the dimensions and form factor place it firmly in the realm of a mini PC, although its design and purpose align more with edge AI deployment than home computing.

2025-08-31 Tags: nvidia, jetson, mini pc, llm, robotics, blackwell, arm, gpu by klotz

El Reg's essential guide to deploying LLMs in production

Running GenAI models is easy. Scaling them to thousands of users, not so much. This guide details avenues for scaling AI workloads from proofs of concept to production-ready deployments, covering API integration, on-prem deployment considerations, hardware requirements, and tools like vLLM and Nvidia NIMs.

2025-04-28 Tags: llm, ai, production engineering, inference engineering, deployment, vllm, nvidia, kubernetes, inference, api, scaling, gpu, machine learning by klotz

NVIDIA DGX Spark

NVIDIA DGX Spark is a desktop-friendly AI supercomputer powered by the NVIDIA GB10 Grace Blackwell Superchip, delivering 1000 AI TOPS of performance with 128GB of memory. It is designed for prototyping, fine-tuning, and inference of large AI models.

2025-03-24 Tags: machine learning, nvidia, dgx spark, llm, grace blackwell, ai development, inference, data science, gpu, cpu by klotz

Enterprises Ignite Big Savings With NVIDIA-Accelerated Apache Spark

NVIDIA's Project Aether automates the qualification, testing, configuration, and optimization of Spark workloads for GPU acceleration, enabling enterprises to process data more efficiently and cost-effectively.

2025-03-20 Tags: nvidia, apache spark, gpu, project aether, andy feng by klotz

Accelerating JSON Processing on Apache Spark with GPUs

Learn how GPU acceleration can significantly speed up JSON processing in Apache Spark, reducing runtime and costs for enterprise data applications.

2025-02-04 Tags: json, apache spark, gpu, nvidia, cuda, data engineering by klotz

Nvidia's CUDA moat may not be as impenetrable as you think • The Register

The article discusses the competition Nvidia faces from Intel and AMD in the GPU market. While these competitors have introduced new accelerators that match or surpass Nvidia's offerings in terms of memory capacity, performance, and price, Nvidia maintains a strong advantage through its CUDA software ecosystem. CUDA has been a significant barrier for developers switching to alternative hardware due to the effort required to port and optimize existing code. However, both Intel and AMD have developed tools to ease this transition, like AMD's HIPIFY and Intel's SYCL. Despite these efforts, the article notes that the majority of developers now write higher-level code using frameworks like PyTorch, which can run on different hardware with varying levels of support and performance. This shift towards higher-level programming languages has reduced the impact of Nvidia's CUDA moat, though challenges still exist in ensuring compatibility and performance across different hardware platforms.

2024-12-25 Tags: nvidia, cuda, the register, gpu, llm, pytorch by klotz

Run:ai - Accelerate AI Development & Innovation

Run:ai offers a platform to accelerate AI development, optimize GPU utilization, and manage AI workloads. It is designed for GPUs, offers CLI & GUI interfaces, and supports various AI tools & frameworks.

2024-08-26 Tags: llm, orchestration, infrastructure, gpu, workload management, k8s, nvidia, production engineering by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: gpu* + nvidia*

Linked Tags

Related Tags