Transformers
search
⌘Ctrlk
Transformers
  • 🌍GET STARTED
  • 🌍TUTORIALS
  • 🌍TASK GUIDES
    • 🌍NATURAL LANGUAGE PROCESSING
    • 🌍AUDIO
    • 🌍COMPUTER VISION
    • 🌍MULTIMODAL
      • Image captioning
      • Document Question Answering
      • Visual Question Answering
      • Text to speech
    • 🌍GENERATION
    • 🌍PROMPTING
  • 🌍DEVELOPER GUIDES
  • 🌍PERFORMANCE AND SCALABILITY
  • 🌍CONTRIBUTE
  • 🌍CONCEPTUAL GUIDES
  • 🌍API
  • 🌍INTERNAL HELPERS
gitbookPowered by GitBook
block-quoteOn this pagechevron-down
  1. 🌍TASK GUIDES

🌍MULTIMODAL

Image captioningchevron-rightDocument Question Answeringchevron-rightVisual Question Answeringchevron-rightText to speechchevron-right
PreviousDepth estimationchevron-leftNextImage captioningchevron-right