Benchmarking recipes

To support you with running your workloads, we have curated a set of reproducible benchmark recipes that use some of the most common machine learning (ML) frameworks and models. These are stored in repositories. To access these repositories, see AI Hypercomputer organization. These benchmark recipes were tested on clusters created using Cluster Toolkit.

Overview

Before you get started with these recipes, ensure that you have completed the following steps:

  1. Choose an accelerator that best suits your workload. See Choose a deployment strategy.
  2. Select a consumption method based on your accelerator of choice, see Consumption options.
  3. Create your cluster based on the type of accelerator selected. See Cluster deployment guides.

Recipes

The following reproducible benchmark recipes are available for pre-training and inference on GKE clusters.

To search the catalog, you can filter by a combination of your framework, model, and accelerator.

Recipe nameAcceleratorModelFrameworkWorkload type
Llama3.1 70B - A3 UltraA3 UltraLlama3.1 70BMaxTextPre-training on GKE
Llama3.1 70B - A3 UltraA3 UltraLlama3.1 70BNeMoPre-training on GKE
Mixtral-8-7B - A3 UltraA3 UltraMixtral-8-7BNeMoPre-training on GKE
GPT3-175B - A3 MegaA3 MegaGPT3-175BNeMoPre-training on GKE
Mixtral 8x7B - A3 MegaA3 MegaMixtral 8x7BNeMoPre-training on GKE
A3 Mega
  • Llama3 70B
  • Llama3.1 70B
NeMoPre-training on GKE
DeepSeek R1 671BA3 MegaDeepSeek R1 671BSGLangInference on GKE
DeepSeek R1 671BA3 MegaDeepSeek R1 671BvLLMInference on GKE
Llama-3.1-405BA3 UltraLlama-3.1-405BTensorRT-LLMInference on GKE
DeepSeek R1 671BA3 UltraDeepSeek R1 671BSGLangInference on GKE
DeepSeek R1 671BA3 UltraDeepSeek R1 671BvLLMInference on GKE