composer.datasets#

Natively supported datasets.

Classes

ADE20k

PyTorch Dataset for ADE20k.

C4Dataset

Builds a streaming, sharded, sized torch.utils.data.IterableDataset for the C4 (Colossal Cleaned Common Crawl) dataset.

COCODetection

PyTorch Dataset for the COCO dataset.

PytTrain

PytVal

StreamingADE20k

Implementation of the ADE20k dataset using StreamingDataset.

StreamingC4

Implementation of the C4 (Colossal Cleaned Common Crawl) dataset using StreamingDataset.

StreamingCIFAR10

Implementation of the CIFAR10 dataset using StreamingDataset.

StreamingCOCO

Implementation of the COCO dataset using StreamingDataset.

StreamingImageNet1k

Implementation of the ImageNet1k dataset using StreamingDataset.

SyntheticBatchPairDataset

Emulates a dataset of provided size and shape.

SyntheticDataLabelType

Defines the class label type of the synthetic data.

SyntheticDataType

Defines the distribution of the synthetic data.

SyntheticPILDataset

Similar to SyntheticBatchPairDataset, but yields samples of type Image and supports dataset transformations.