NVIDIA RTX PRO 6000 Blackwell Server Edition on Nebius AI Cloud

Run AI inference, scientific simulation, and physical AI workloads on NVIDIA’s universal data center GPU with RTX technology, available on Nebius AI Cloud in flexible self-service configurations.

What makes RTX PRO 6000 on Nebius different

GPU performance, optimized in-house

Our GPU clusters are optimized across every layer of the stack — from in-house designed servers to a software layer validated against NVIDIA benchmarks — so your AI workloads run at the highest achievable performance.

AI without operational overhead

We take care of the infrastructure, at any scale. Run containerized experiments with Serverless Jobs or deploy multi-GPU inference clusters, without touching drivers, networking, or cluster configuration.

Security and reliability by design

Every Nebius cluster comes with auto-healing that detects and recovers from hardware failures with minimum possible interruption. Our platform is also built on industry security and compliance standards, so your data and workloads always stay secure.

What teams run on NVIDIA RTX PRO 6000

Cost-effective AI inference

Deploy and serve large language models on a single GPU. With 96 GB of memory, RTX PRO 6000 handles 70B-class models in quantized precision or 30B–40B models at full precision. Up to 4 MIG instances per card allow multiple isolated workloads to share one GPU efficiently.

Visual computing and physical AI

Accelerate workloads at the intersection of AI and graphics: vision-language-action models, embodied reasoning, digital twins, synthetic data generation, and NVIDIA Omniverse applications. Fourth-generation RT Cores and fifth-generation Tensor Cores handle both the AI and rendering sides of these workloads in a single GPU.

Scientific simulation

Run scientific and traditionally CPU-based HPC workloads at FP32 precision, including molecular dynamics, molecular docking, drug discovery, and physics-based simulation. RTX PRO 6000’s compute profile and large memory capacity make it a cost-effective choice for these workloads.

NVIDIA RTX PRO 6000 Blackwell Server Edition specifications

Specification

RTX PRO 6000 Blackwell Server Edition

CUDA parallel processing cores

24,064

NVIDIA Tensor Cores

752 (fifth-generation)

NVIDIA RT Cores

188 (fourth-generation)

Single-precision performance (FP32)

120 TFLOPS

Peak FP4 AI performance

4 PFLOPS

RT Core performance

355 TFLOPS

GPU memory

96 GB GDDR7 with ECC

Memory interface

512-bit

Memory bandwidth

1,597 GB/s

Multi-Instance GPU (MIG)

Up to 4× 24 GB

NVENC / NVDEC / JPEG

4× / 4× / 4×

Confidential Computing

Supported

Secure boot with root of trust

Yes

Graphics bus

PCI Express 5.0×16

Form factor

4.4″ (H) × 10.5″ (L), dual-slot

Thermal solution

Passive; air- and liquid-cooled

Power consumption

Up to 600 W (configurable)

Power connector

1× PCIe CEM5 16-pin

Source: NVIDIA RTX PRO 6000 Blackwell Server Edition official datasheet.

Built for inference, simulation, and physical AI

The RTX PRO 6000 Blackwell Server Edition pairs strong single-precision performance with fourth-generation RT Cores for real-time photorealistic rendering and visualization. This combination makes it one of the few GPU platforms that handles AI inference and graphics-intensive workloads with equal efficiency — from agentic AI and LLM serving to digital twins, synthetic data generation, and Omniverse-based physical AI applications.

With 96 GB of memory and MIG support, it sustains these workloads comfortably: large models fit without quantization trade-offs, and multiple isolated workloads can share a single GPU efficiently.

NVIDIA Exemplar Cloud Validation

Nebius is a Reference Platform NVIDIA Cloud Partner and holds NVIDIA Exemplar Cloud validation across multiple GPU generations — from NVIDIA H200 to NVIDIA GB300 NVL72. Exemplar Cloud is awarded to providers that demonstrate real-world training performance against NVIDIA’s benchmarking standards, not just peak specifications.

Frequently Asked Questions

The NVIDIA RTX PRO 6000 Blackwell Server Edition is a professional data center GPU built on NVIDIA Blackwell architecture. It combines 96 GB of GDDR7 memory, fifth-generation Tensor Cores with FP4 support, and fourth-generation RT Cores in a passively cooled, dual-slot form factor designed for 24/7 data center operation.

Get started with NVIDIA RTX PRO 6000 Blackwell Server Edition on Nebius

Launch your first instance in minutes or talk to our team to find the right setup for your workload.