NVIDIA Dynamo logo

NVIDIA Dynamo

by NVIDIA
Orchestration

NVIDIA Dynamo Platform is a Kubernetes-native inference platform for serving generative AI and reasoning models at scale. It installs the Dynamo operator with custom resources for graph and component deployments, plus bundled NATS messaging and etcd for a self-contained starting point. Optional Kai Scheduler and Grove integrations can be enabled when clusters need advanced scheduling or multinode orchestration for large model serving workloads.

Key features

Dynamo operator and CRDs

Manage Dynamo graph, component, checkpoint, model, and scaling resources through Kubernetes-native APIs.

Self-contained defaults

Start with bundled NATS messaging and persistent etcd, then switch to externally managed services when production operations require them.

Distributed workload orchestration

Enable Kai Scheduler and Grove integrations for advanced scheduling and multinode orchestration.

Configurable routing

Expose Dynamo workloads through Kubernetes Ingress or Istio when the cluster has the matching routing layer configured.


Pricing

Additional Nebius infrastructure costs may apply. Use the Nebius Pricing Page to estimate your infrastructure costs.

Self-managed

NVIDIA Dynamo on Kubernetes

Deploy NVIDIA Dynamo Platform on Kubernetes to orchestrate distributed generative AI and reasoning model inference workloads.

Free
Charged for resources
Setup time20+ minutes
ScalingAuto
MaintenanceSelf-managed (cluster)
Deploy
White-glove

Deploy with a solutions architect

Some applications are easier with a hand on the wheel. Talk to an architect who has deployed this in production.

  • Architecture review & sizing
  • Hands-on deploy session
  • 30 days of follow-up support
Talk to an expert

Security & compliance

Run NVIDIA Dynamo on infrastructure built for AI workloads

Reliable AI infrastructure backed by top-tier NVIDIA GPUs, purpose-built for demanding inference workloads. Multiple deployment methods — virtual machines for full hardware control, Kubernetes for scalable cluster deployments, and managed serverless applications for teams that want inference running without infrastructure overhead

Learn about Nebius AI Cloud

Security & compliance, out of the box

Nebius meets a broad set of security and compliance standards. Fine-grained IAM controls, audit logs, and encrypted storage are available out of the box — so teams can meet security requirements without additional tooling.

Explore the Trust center

Support

Application support

Provided by NVIDIA. See the documentation and project links above.

Infrastructure support

Provided by Nebius for the underlying cloud infrastructure.