Optimum
search
⌘Ctrlk
Optimum
  • 🌍OVERVIEW
  • 🌍HABANA
  • 🌍INTEL
  • 🌍AWS TRAINIUM/INFERENTIA
  • 🌍FURIOSA
  • 🌍ONNX RUNTIME
  • 🌍EXPORTERS
  • 🌍TORCH FX
  • 🌍BETTERTRANSFORMER
  • 🌍LLM QUANTIZATION
    • GPTQ quantization
  • 🌍UTILITIES
gitbookPowered by GitBook
block-quoteOn this pagechevron-down

🌍LLM QUANTIZATION

GPTQ quantizationchevron-right
PreviousHow to add support for new architectures?chevron-leftNextGPTQ quantizationchevron-right