Datasets
  • 🌍GET STARTED
    • Datasets
    • Quickstart
    • Installation
  • 🌍TUTORIALS
    • Overview
    • Load a dataset from the Hub
    • Know your dataset
    • Preprocess
    • Evaluate predictions
    • Create a data
    • Share a dataset to the Hub
  • 🌍HOW-TO GUIDES
    • Overview
    • 🌍GENERAL USAGE
      • Load
      • Process
      • Stream
      • Use with TensorFlow
      • Use with PyTorch
      • Use with JAX
      • Use with Spark
      • Cache management
      • Cloud storage
      • Search index
      • Metrics
      • Beam Datasets
    • 🌍AUDIO
      • Load audio data
      • Process audio data
      • Create an audio dataset
    • 🌍VISION
      • Load image data
      • Process image data
      • Create an image dataset
      • Depth estimation
      • Image classification
      • Semantic segmentation
      • Object detection
    • 🌍TEXT
      • Load text data
      • Process text data
    • 🌍TABULAR
      • Load tabular data
    • 🌍DATASET REPOSITORY
      • Share
      • Create a dataset card
      • Structure your repository
      • Create a dataset loading script
  • 🌍CONCEPTUAL GUIDES
    • Datasets with Arrow
    • The cache
    • Dataset or IterableDataset
    • Dataset features
    • Build and load
    • Batch mapping
    • All about metrics
  • 🌍REFERENCE
    • Main classes
    • Builder classes
    • Loading methods
    • Table Classes
    • Logging methods
    • Task templates
Powered by GitBook
On this page
  1. HOW-TO GUIDES

Overview

PreviousHOW-TO GUIDESNextGENERAL USAGE

Last updated 1 year ago

Overview

The how-to guides offer a more comprehensive overview of all the tools 🌍 Datasets offers and how to use them. This will help you tackle messier real-world datasets where you may need to manipulate the dataset structure or content to get it ready for training.

The guides assume you are familiar and comfortable with the 🌍 Datasets basics. We recommend newer users check out our first.

Interested in learning more? Take a look at of the BOINC AI course!

The guides are organized into six sections:

  • General usage: Functions for general dataset loading and processing. The functions shown in this section are applicable across all dataset modalities.

  • Audio: How to load, process, and share audio datasets.

  • Vision: How to load, process, and share image datasets.

  • Text: How to load, process, and share text datasets.

  • Tabular: How to load, process, and share tabular datasets.

  • Dataset repository: How to share and upload a dataset to the .

If you have any questions about 🌍 Datasets, feel free to join and ask the community on our .

🌍
tutorials
Chapter 5
Hub
forum