Load image data
Last updated
Last updated
Image datasets are loaded from the image
column, which contains a PIL object.
To work with image datasets, you need to have the vision
dependency installed. Check out the guide to learn how to install it.
When you load an image dataset and call the image
column, the feature automatically decodes the PIL object into an image:
Copied
Index into an image dataset using the row index first and then the image
column - dataset[0]["image"]
- to avoid decoding and resampling all the image objects in the dataset. Otherwise, this can be a slow and time-consuming process if you have a large dataset.
For a guide on how to load any type of dataset, take a look at the .
You can load a dataset from the image path. Use the function to accept a column of image file paths, and decode it into a PIL image with the feature:
Copied
Copied
You can also load a dataset with an ImageFolder
dataset builder which does not require writing a custom dataloader. This makes ImageFolder
ideal for quickly creating and loading image datasets with several thousand images for different vision tasks. Your image dataset structure should look like this:
Copied
Load your dataset by specifying imagefolder
and the directory of your dataset in data_dir
:
Copied
Load remote datasets from their URLs with the data_files
parameter:
Copied
Copied
If you only want to load the underlying path to the image dataset without decoding the image object, set decode=False
in the feature:
Some datasets have a metadata file (metadata.csv
/metadata.jsonl
) associated with it, containing other information about the data like bounding boxes, text captions, and labels. The metadata is automatically loaded when you call and specify imagefolder
.
To ignore the information in the metadata file, set drop_labels=False
in , and allow ImageFolder
to automatically infer the label name from the directory name:
For more information about creating your own ImageFolder
dataset, take a look at the guide.