BOINC AI Optimum Habana

πŸ€— Optimum Habana

πŸ€— Optimum Habana is the interface between the πŸ€— Transformers and πŸ€— Diffusers libraries and Habana’s Gaudi processor (HPU)arrow-up-right. It provides a set of tools that enable easy model loading, training and inference on single- and multi-HPU settings for various downstream tasks as shown in the table below.

HPUs offer fast model training and inference as well as a great price-performance ratio. Check out this blog post about BERT pre-trainingarrow-up-right and this article benchmarking Habana Gaudi2 versus Nvidia A100 GPUsarrow-up-right for concrete examples. If you are not familiar with HPUs, we recommend you take a look at our conceptual guidearrow-up-right.

The following model architectures, tasks and device distributions have been validated for πŸ€— Optimum Habana:

In the tables below, βœ… means single-card, multi-card and DeepSpeed have all been validated.

  • Transformers

Architecture
Training
Inference
Tasks

BLOOM(Z)

❌

  • DeepSpeed

StarCoder

❌

  • Single card

Llama 2 / CodeLlama

  • DeepSpeed

  • LoRA

  • DeepSpeed

  • LoRA

StableLM

❌

  • Single card

Falcon

❌

  • Single card

CodeGen

❌

  • Single card

MPT

❌

  • Single card

ESMFold

❌

  • Single card

  • Diffusers

Architecture
Training
Inference
<center>Tasks</center>

Stable Diffusion

❌

  • Single card

Other models and tasks supported by the 🌍 Transformers and 🌍 Diffusers library may also work. You can refer to this sectionarrow-up-right for using them with 🌍 Optimum Habana. Besides, this pagearrow-up-right explains how to modify any examplearrow-up-right from the 🌍 Transformers library to make it work with 🌍 Optimum Habana.

Last updated