Transformers
Ctrlk
  • 🌍GET STARTED
  • 🌍TUTORIALS
  • 🌍TASK GUIDES
    • 🌍NATURAL LANGUAGE PROCESSING
    • 🌍AUDIO
    • 🌍COMPUTER VISION
    • 🌍MULTIMODAL
      • Image captioning
      • Document Question Answering
      • Visual Question Answering
      • Text to speech
    • 🌍GENERATION
    • 🌍PROMPTING
  • 🌍DEVELOPER GUIDES
  • 🌍PERFORMANCE AND SCALABILITY
  • 🌍CONTRIBUTE
  • 🌍CONCEPTUAL GUIDES
  • 🌍API
  • 🌍INTERNAL HELPERS
Powered by GitBook
On this page
  1. 🌍TASK GUIDES

🌍MULTIMODAL

Image captioningDocument Question AnsweringVisual Question AnsweringText to speech
PreviousDepth estimationNextImage captioning