In February, we at Nebius were busy preparing several new data center regions and working on our large-scale participation in NVIDIA GTC. We also rolled out some improvements to the platform and contributions to open source — as well as new initiatives to support AI research: a brand new biotech award and a research credits program.
We’re thrilled to announce a major upgrade to our US-based compute capacity. To bring it to life, we’ve joined forces with DataOne, an AI hosting infrastructure company, to ensure that the first phase of the New Jersey facility goes live this summer.
We’re also launching a colocation facility in Iceland with Verne, a provider of sustainably powered data centers across the Nordics, and expect it to go live in March. Along with our DC in Finland and colocations in France and Missouri, US, a total number of Nebius regions rises to five.
We are proud to be the Platinum sponsor of NVIDIA GTC 2025 — the premier developer conference at the heart of AI. Don’t miss your chance to meet our leaders, architects and tech experts. Visit our booth #809 and get NVIDIA GPU credits for your projects.
Nebius’ AI, hardware and development leads will also give two tech talks:
From zero to scale: how to build an efficient AI cloud platform from scratch
Enabling the agent-first future: advancing test-time search and compute infrastructure for agentic systems
Managed PostgreSQL is now generally available to all Nebius users. We’ve been improving the service to empower AI innovators with a reliable tool for storing structured data in the cloud. As the service transitions to GA, we are introducing pricing and SLAs.
We’re open-sourcing Kvax, our Flash Attention implementation. Designed for efficient training with long sequences, Kvax supports context parallelism and optimized computation of document masks. It outperforms many other Flash Attention implementations in long-context training with dense packing, achieving state-of-the-art performance.
AI Discovery is our brand new annual award for startups that are using AI in drug discovery, biotech, genomics and healthtech. Apply until April 30, 2025, and compete for $100,000 in AI Cloud credits.
Introducing Nebius research credits program. Eligible specialists can now gain access to AI Cloud or Studio inference tokens, helping in modern studies and tackling scientific challenges.
The final stop in our ‘Nebius AI Cloud Unveiled’ series will be in San Francisco! Join us on March 13 for a deep dive into our AI Cloud Accelerated by NVIDIA. Our AI developers will share insights on how we build the cloud and contribute to the AI field and open source.
For GA, Managed PostgreSQL got a big documentation overhaul. With the new articles, you can learn how to manage and connect to databases, work with users and extensions, and migrate or replicate your data from external clusters to Nebius AI Cloud.
Two new tutorials added to Compute. A quick-start tutorial focuses on hosting your first LLM on Compute virtual machines. And if you need to secure network access to your VMs, deploy a jump server with WireGuard.
You have to see performance to believe it. In Nebius AI Cloud, you can see performance on monitoring dashboards in the console, which we’ve started to cover in the documentation. Learn about dashboards and metrics for VMs and storage volumes in Compute, Managed Kubernetes clusters and nodes and Managed PostgreSQL clusters.
We are now accepting pre-orders for NVIDIA GB200 NVL72 and NVIDIA HGX B200 clusters to be deployed in our data centers in the United States and Finland from early 2025. Based on NVIDIA Blackwell, the architecture to power a new industrial revolution of generative AI, these new clusters deliver a massive leap forward over existing solutions.