Cloud for Scientific AI
and Healthcare

From drug discovery and genomics to medical imaging and digital health, Nebius AI Cloud provides the infrastructure, performance, and compliance needed to accelerate scientific innovation at scale.

Talk to an expert

Scaling AI Mental Health: Sword Health and Nebius

Sword Health uses Nebius AI Cloud to power Dawn, its large-scale AI mental health agent.

How Science teams use Nebius

BioPharma and drug discovery

Train proprietary biology foundation models and run end-to-end drug discovery pipelines from protein design to compound screening on GPU clusters built for computational chemistry and biology.

Capabilities:

Protein structure prediction & design (AlphaFold2, Boltz-2, RFdiffusion)
Molecular docking & virtual screening (DiffDock, GROMACS, Amber)
Foundation model fine-tuning for drug candidates
ADMET property prediction at scale

Why Nebius AI Cloud for HCLS

Automate workflows and optimize AI models

Our platform helps to shorten development cycles in biotech, enabling faster drug discovery, clinical trials and healthcare innovations.

Lower your operational costs

By offering a cloud-native platform, Nebius reduces the need for costly physical infrastructure, while providing top-tier AI and HPC performance. Flexible pricing options allow to test the platform and save money with long-term commitments.

Scale high-performance computing

Nebius delivers scalable HPC infrastructure, allowing you to process vast amounts of data quickly and efficiently, whether in research, diagnostics or bioinformatics.

Seamless cloud integration

Build, tune and deploy AI models effortlessly — we provide you with managed services and cloud-native platform management tools. Focus on innovation while the platform handles computing complexities.

Scalable infrastructure

Nebius scales effortlessly with your projects, from small research initiatives to large, enterprise-level clinical trials or biotech operations, adapting to your growing data and computing needs.

Real-time data processing

Process and analyze large volumes of healthcare and genomic data in real-time, allowing for quicker diagnostics, faster research results and improved decision-making.

Built for scientific breakthroughs

An end-to-end AI platform purpose-built for healthcare and life sciences—from infrastructure and scientific workflows to the industry’s leading AI ecosystem.

AI Infrastructure

Train foundation models, run molecular simulations, and process large-scale genomics workloads on scalable AI infrastructure built for scientific computing.

Serverless Inference

Run containerized GPU inference workloads on demand for real-time endpoints and batch processing. Eliminate cluster management, avoid idle GPU costs, and pay only for the compute you use with per-second billing.

Bioinformatics pipelines

Run Nextflow and nf-core pipelines on Nebius GPU clusters with a jointly engineered Seqera integration and one-click Terraform deployment for scalable, production-ready genomics workflows.

Managed MLflow

Track experiments, manage model versions, and link datasets with a fully managed MLflow environment covered by Nebius' SOC 2 controls.

Workflow Orchestration with Flyte (Union.ai)

Build reproducible scientific workflows on Nebius Managed Kubernetes with versioned pipelines for preprocessing, training, evaluation, and deployment.

NVIDIA Science AI Stack running on Nebius

Access NVIDIA’s healthcare and life sciences ecosystem on Nebius, including AI frameworks and models for drug discovery, genomics, medical imaging, clinical AI, real-time medical systems, and healthcare robotics, powered by BioNeMo, Nemotron, Parabricks, MONAI, Holoscan, and Isaac for Healthcare.

Enterprise-grade security for healthcare and life sciences AI workloads

Nebius meets the security, privacy, and compliance standards that healthcare and life sciences organizations demand. Independent third-party audits have verified our controls, giving your security, compliance, and procurement teams the assurance they need to accelerate AI adoption with confidence.

Audited by Deloitte. Certified across data protection, access management, privacy, and infrastructure security. SOC 2 Type II with HIPAA supports the secure handling of protected health information (PHI). ISO 27799 strengthens healthcare information security, while ISO 22301 helps ensure operational resilience for mission-critical clinical, research, and patient-facing applications.

Learn more in Trust Center and Full compliance blog post.

SOC2 Type II incl. HIPAA)

ISO 27001. ISMS

ISO 27701. Privacy and GDPR

ISO 22301. Business continuity

ISO 27018. Cloud PII

ISO 27799. Health data

ISO 27032. Cybersecurity

NIS2 & DORA aligned

CRISPR-GPT: AI gene-editing expert designed at Stanford

CRISPR-GPT is an LLM-powered agent system developed by scientists from Stanford, Princeton and Google DeepMind to automate gene editing experiments, from CRISPR system selection to sgRNA design and data analysis.

Goal: Transform gene editing from a months-long process into automated workflows accessible to any scientist.

Solution: Enabling rapid model screening and fine-tuning via Nebius.

Result: Junior researchers with no gene editing experience now achieve 80-90% efficiency on first attempt. Undergraduate students are onboarded in a day, and experts work faster by using AI agents to help run analysis, check designs and troubleshoot experiments.

Training
Life sciences
Research

100%

success rates for novice researchers

Training time reduced from weeks-to-months down to

1 day

Agentic automation

of design and analysis that integrates gene-editing expert knowledge

Accelerating bioinformatics

Unum scales IO efficiency in AI-driven life sciences workloads with StringZilla — an open-source, high-speed string processing library that accelerates computationally heavy tasks like hashing, sorting, and fuzzy-matching.

Goal: To fully leverage HPC architectures by optimizing the software layer.

Solution: Enabled by Nebius, Unum ported StringZilla to GPUs using hardware-specific kernels to tap into parallel processing capabilities for more efficient string scoring.

Result: Delivered orders-of-magnitude speedups for critical sequence analysis tasks like DNA, RNA, and protein alignment and fingerprinting.

Scaling
Drug Discovery
Research

>320 M

cell updates per second

300,000x

faster than traditional tools

>390 MB/s

of fingerprinting throughput per GPU

Psychology’s foundation LLM

Slingshot AI is building the first foundation model for psychology, reimagining mental health AI with its app, Ash. Trained on the largest behavioral dataset, Ash personalizes each journey via fine-tuning and RL.

Goal: Develop a responsible AI for scalable and tailored mental health support.

Solution: Based on real health data, Ash’s 32B-235B models were trained on fast Nebius GPU clusters with DeepSpeed and Zero-3. Fine-tuning and RL used expert-guided reward models.

Results: Adapting to users through reinforcement learning, the app opens a new in-silico therapy modality. A recent NYU study found that Ash helps users feel more connected, countering the idea that AI isolates people.

Training
Inference
Chatbot

50,000

beta users helped train and refine Ash over 18 months

32B-235B

model sizes

$93

million in venture funding raised to develop Ash

Scaling Bio Foundation Models

Helical builds Virtual AI Labs for pharma and biotech teams to scale virtual experiments in hours, not months. Their flagship model, Helix-mRNA, captures long-range biological signals and can be quickly adapted to proprietary data.

Goal: Develop personalized, production-grade Bio FMs for drug discovery pipelines.

Solution: Helical relies on Nebius to pretrain and post-train Helix-mRNA — a 500M-parameter hybrid model combining Mamba-2, transformers and codon-aware tokenization.

Results: Model training completed in 10 days, followed by 36-hour specialization runs per dataset. Helix-mRNA outperformed prior baselines, improving correlation scores by up to 146%. With Nebius, Helical scales faster and delivers in-silico tools in weeks — not months.

Training
Scaling
Drug discovery

10 days

base pretraining time for Helix-mRNA

+146%

gain in paired half-life & ribosome load prediction (vs. v0)

16,384

maximum token sequence length per model input

Advancing medical imaging models

xAID develops a foundation model for chest and abdomen CT scans, addressing the need for accurate diagnostic support amid a global radiologist shortage.

Goal: Train a large 3D transformer model on volumetric CT data, to detect 70+ pathologies across body regions.

Solution: Run compute-heavy training on large 256³ voxel CT scans by using FP16 precision and accumulated gradients.

Result: Strong F1 scores, stable multi-day training cycles and scalable model development.

Medical Imaging
Training

0.86

Macro F1 score on the top five common pathologies

5 days

per epoch training on noisy clinical data

256³

CT image resolution used as model input

Molecular dynamics for drug discovery

SieveStack is building the world’s largest dataset of molecular simulations to train a multi-layered stack of foundational models and advance dynamics-driven drug discovery.

Goal: To unlock treatment pathways for hard-to-treat diseases with AI-powered, physics-based modeling and biochemistry.

Solution: To generate high-precision molecular dynamics simulations and optimize model training with a mixed-precision approach — FP32 for accuracy and BF16 for performance.

Result: SieveStack leveraged Nebius and TractoAI support team to prototype, debug, and scale foundational models with >90% GPU utilization, revealing drug-target interactions beyond the reach of lab experiments.

Training
Drug discovery
Datasets

>90%

GPU efficiency

2–4x

faster training

–50%

memory use

Advancing precision medicine

Converge Bio is a biotech company pioneering the use of LLMs to analyze single-cell RNA sequencing data. Their work aims to transform how scientists understand disease mechanisms and therapeutic responses at the individual patient level.

Goal: To develop a scalable, high-performance foundation model capable of analyzing raw gene expression data across 20,000+ genes per cell and delivering explainable, patient-specific insights for drug discovery and precision medicine.

Solution: Converge trained Converge-SC on over 36 million cells and 2TB of data, leveraging H100 80GB GPUs and advanced parallelism techniques. The model retained full numerical fidelity, operated at the patient level, and was built for interpretability and usability.

Result: Converge-SC outperformed baseline models across disease classification and drug perturbation tasks. Now released via Hugging Face, the model is helping biotech and pharma teams unlock insights faster — bringing explainable AI to the forefront of biomedical research.

Training
Biotech
Research

2TB

dataset

30K

context length

7,000

hours of compute time

LLM-powered drug discovery

YerevaNN strives to fast-track generative molecular drug design. By leveraging molecular language models, the research center enables the precise, code-driven engineering of pharmaceutical compounds.

Goal: Optimize drug candidates with three continuously pre-trained models integrated by a genetic algorithm.

Solution: YerevaNN streamlined implementation with Flash Attention, FSDP, and post-backward prefetch for enhanced parallelism and faster preprocessing and training.

Result: With Nebius, YerevaNN fast-tracked text tokenization, expedited docking runs for molecular modeling, and improved performance by 8% over previous methods, processing 180,000 words per second.

Training
Research
Drug discovery

180,000

words per second

110M

molecules with computed properties and relations

8%

performance uplift

Quantum Chemistry for drug and material discovery

Simulacra AI is transforming the quantum chemistry field by automatically generating high-precision datasets for molecular dynamics models at scale.

Goal: Build a scalable foundational wave-function model for molecular systems that can generate high-accuracy datasets for pipelines of drug and material discovery.

Solution: Simulacra AI used Nebius infrastructure to overcome scalability and efficiency challenges.

Result: Simulacra AI delivers next-generation molecular data, enabling any company to refine in silico pipelines without relying on broad internal infrastructure to train models.

Training
Research
Quantum tech

100M+

model parameters

90% faster

Thanks to Nebius infrastructure, our largest models take 10–20 minutes to compile for pre-training, compared to over 2 hours previously

H100 + H200

NVIDIA Tensor core GPU fleet

Advancing molecular generation

Quantori is the end-to-end data, technology and digital services partner of choice for leading biopharma and healthcare organizations worldwide.

Goal: To develop an AI framework that generates molecules with precise 3D shapes, enhancing drug discovery and material design.

Solution: Quantori employs a pipeline based on Equivariant Diffusion Model and Structure Seer model trained on 1.6M molecules from the ChEMBL database. The pipeline generates molecular structures using shape descriptors.

Result: After 1,500 training epochs, the model successfully generated chemically sound molecules that closely resemble real molecules in shape. The approach enables rapid molecular ideation, predicting valid 3D conformations with optimized properties.

Training
Drug discovery

1.6M

molecules from ChEMBL — dataset size

1,500 epochs

Training duration

High similarity

to reference geometries

ML-powered clinical trials

TrialHub is a data intelligence platform designed to make clinical trials more efficient and patient-centric.

Goal: To deliver quantifiable insights from unstructured, text-heavy data — scaling to production in days.

Solution: To launch an MLOps-optimized vector embedding pipeline with NVIDIA L40s and Nebius expert support.

Result: Deployed one of the largest vector databases in clinical research, with 250 million vectors, reducing trial delays and amendments by half.

AI Healthcare
Medtech
Clinical trials
AI Innovation

80,000

analyzed sources

250 million

vectors

20x

faster research

Scaling Graph AI projects

Lynx Analytics delivers AI-powered solutions to enterprises across life sciences, telecom, retail and finance. Its platform, LynxKite 2000:MM, lets experts build predictive Graph AI workloads to enhance reasoning without writing code.

Goal: Power fast and flexible Graph AI workflows.

Solution: Using Nebius’ VMs and managed Kubernetes to scale workloads, LynxKite 2000:MM automates container orchestration, maintaining high utilization and rapid deployment.

Results: With Nebius, Lynx Analytics accelerates delivery across diverse AI projects. Easy provisioning and Slurm support via Soperator enable engineers to focus on modeling, while flexible scaling keeps performance and costs optimized.

Training
Scaling
Drug discovery

>80%

average GPU utilization across dynamic workloads

1 to 1000+

GPUs used depending on scale and demand

Zero code

required to build and deploy Graph AI models

Training Novel Cancer AI Models

Compugen is a clinical-stage biotech company pioneering AI/ML-powered drug discovery for cancer immunotherapy. Its Unigen™ platform integrates multi-omics, spatial, and single-cell data to identify new drug targets and resistance pathways.

Goal: Develop AI models that uncover new immune features in cancer datasets.

Solution: Nebius helps train large models, including single‑cell RNA transformers. Models up to 3B parameters run on BF16/FP32 with minimal setup, thanks to Nebius’ ease of use.

Results: The team trained a unique model that predicts spatial features (e.g., TLS) from non-spatial data, enabling deeper analysis across broader datasets. Nebius provided flexibility, performance, and simplicity, helping Compugen integrate AI into its Unigen discovery pipeline.

Drug Discovery
Training
Biotech

3B+

parameter models fine-tuned using proprietary cancer datasets

20 sec – 20 min

training time per epoch for 100K cell RNA-seq models

0.94 ROC-AUC

Best TLS prediction result without spatial coordinates

AI-powered mental health support

Sword Health used Nebius to help bring Dawn to life, expanding its AI Care model into direct-to-consumer mental wellbeing support. The company combined years of clinical AI development with new inference architecture to make long, sensitive conversations safer, faster and more scalable.

Goal: Build a mental health wellbeing system that could deliver clinically rigorous, high-quality support over long, multi-session conversations while meeting strict safety, governance, latency and cost requirements.

Solution: Sword paired MindGuard and MindEval with Nebius Token Factory and dedicated NVIDIA Blackwell endpoints. Using custom speculative decoding, KV caching and prefix-aware routing, the team moved from a 30B model to a 200B+ parameter model without sacrificing production responsiveness.

Result: Sword reduced tail latency from more than 20 seconds to under 12 seconds, cleared its quality bar for Dawn, and created a production-ready platform for AI wellbeing support at scale. Nebius also powered Sword’s broader AI footprint, including Thrive and high-volume AI-guided care delivery.

AI Healthcare
Inference
Large models

200B+ parameters

dawn scaled from 30B to 200B+ in production

<12s P99

tail latency dropped from 20+ seconds to under 12 seconds

10M sessions

Sword delivered 10 million AI-guided sessions in 2025

Contact us

Featured resources and events

Learn and connect with us.

Nebius AI Discovery Awards

AI Discovery Awards by Nebius is an annual initiative celebrating startups that leverage AI to revolutionize the future of science. Join us July, 1 in London.

Nebius for Life Sciences

Discover how life science teams use Nebius AI Cloud to accelerate drug discovery, genomics, protein engineering, and large-scale AI research with production-ready GPU infrastructure.

Download the solution brief

Scalable protein design with Nextflow, Seqera AI, and Nebius

Watch how to build, scale, and deploy AI-powered protein design pipelines faster using Nextflow, Seqera AI, and Nebius AI Cloud.

Watch on demand

Ready to get started?

Talk to an expert Try console

Learn more