Nebius AI Cloud “Aether 3.6”: Operating production AI with more control, efficiency, and confidence
Introducing Aether 3.6. Our latest quarterly platform release responds to how AI teams now operate at scale and with maturity, delivering a smoother developer experience, a stronger security and compliance foundation, advances across our in-house storage portfolio, and natural-language control over your AI cloud.
Train the draft model for your workload
Custom Speculator Training is now in Token Factory. Move from production data to a workload-specific draft model, then deploy it alongside the base model, in one workflow.
AI infrastructure that speaks your language
Introducing Nebius Echo, an AI agent built right into the Nebius console. Ask questions, inspect resources, and create infrastructure in plain language. No setup required; find it right inside the console.
MLPerf® Training 6.0: Leading NVIDIA HGX B300 and competitive NVIDIA GB300 NVL72 results on NVIDIA Blackwell Ultra systems
MLPerf® Training 6.0 results are in. Across six configurations on NVIDIA Blackwell Ultra systems, Nebius posted the #1 single-node HGX B300 times for Llama-3.1-8B and GPT-OSS 20B pre-training and came within 3.1% of the fastest GB300 NVL72 result on all three benchmarks at 72 GPUs. This post covers the full results and methodology.
Nebius Cloud Logs are now available in Datadog: Trace AI incidents across every layer
Nebius AI Cloud Logs now stream into Datadog Log Management. If you already run on Datadog, you can investigate your Nebius workloads alongside the rest of your stack and trace an AI incident across every layer without switching tools.
Introducing the Nebius Agents Blueprint: open architecture for production-ready AI agents
Today we’re introducing the Nebius Agents Blueprint, an open reference architecture for building, operating, and continuously improving AI agents in production. This post covers the six-component composable stack — inference, orchestration, retrieval, grounding, observability, and simulation — and the case study behind it: a compliance audit agent that achieved 72% lower cost and 20% higher precision over a GPT-based prototype by improving the system, not the model.
Building a compliance audit agent using Nebius Agents Blueprint
Using Nebius Agents Blueprint, we built a regulatory compliance audit agent and ran the same audit task through four configurations — from a $470 prototype to a production system at 72% lower cost with 1.00 recall. This post covers what changed at each step, what each configuration revealed about the one before it, and the production failures that only became visible once the operational architecture existed to find them.
NVIDIA retail AI blueprints, now running on Nebius
Nebius has collaborated with NVIDIA to bring two retail AI blueprints to production on Nebius infrastructure: the NVIDIA Agentic Commerce Blueprint and the NVIDIA Retail Catalog Enrichment Blueprint for automated product content. Both are open-source reference architectures built on NVIDIA NIMs that retailers and developers can customize and deploy today with a 1-click deployment on Nebius AI Cloud.
Building transaction foundation models on Nebius AI Cloud
Today, we’re exploring how transaction foundation models move from developer examples to production systems. Using NVIDIA’s TFM blueprint and Revolut’s PRAGMA model, we show how Nebius AI Cloud supports the full lifecycle — from GPU-accelerated data preparation and multi-node training to managed inference on Token Factory.
Run physical AI workflows, not glue code
The Nebius Physical AI Workbench turns NVIDIA Cosmos 3, NVIDIA Isaac Sim, NVIDIA Isaac GR00T and other Physical AI tools into composable building blocks that agents can wire together. We are building it in the open, and it is available now on GitHub.
Nebius and Tavily: Bringing agentic search into the production AI stack
Nebius has acquired Tavily to bring agentic search into the Nebius AI Cloud platform. In this post, we explain how Tavily adds real-time web access to the Token Factory stack, helping developers build production AI agents that can reason, verify information and act on live context instead of relying only on static model knowledge.
Data Lab: Your best dataset is already in your logs
Today, we’re launching Data Lab in Nebius Token Factory. It is a new workspace for turning production logs and existing datasets into reusable training data for post-training workflows. Data Lab helps teams explore inference logs, curate datasets and move directly into model iteration without rebuilding pipelines or copying production data across environments.
The AI cloud will be won at the software layer
Yesterday we announced that Clarifai, a core AI engineering and research team, is joining Nebius. This follows the recently-announced agreement to acquire Eigen AI. Both are deliberate moves on the same bet: that the AI infrastructure opportunity will be decided at the software layer.