The PUG: Animals dataset is for research on out-of-distribution generalization and for studying the representational space of foundation models. It includes:
By allowing for precise control distribution shifts between training and testing, this dataset can give researchers better insight into how a deep neural network generalizes on held-out variation factors.
Download the dataset - 78GBThe PUG: ImageNet dataset provides a novel, useful benchmark for the fine-grained evaluation of the image classifiers’ robustness along several variation factors. It contains:
The PUG: SPAR (Scene, Position, Attribute, Relation) dataset is used for the evaluation of vision-language models. It demonstrates how synthetic data can be used to address current benchmarks limitations. It contains:
We use the Unreal Engine, a powerful game engine well-known in the entertainment industry, to create photorealistic interactive environments from which we can easily sample images with given specifications. The diagram below illustrates how we use the Unreal Engine and sample images to generate PUG datasets.
The PUG family of datasets is released under a CC-BY-NC license with the addenda that they should not be used for Generative AI.
The PUG datasets are intended for model evaluation and discriminative training only and should not be used for training generative models.
The data (3D assets) were acquired through the Unreal Engine Marketplace and Sketchfab. Assets were then incorporated into the Unreal Engine to generate realistic 3D scenes and corresponding images. The 3D assets were manually selected to ensure high quality. Visit github for a complete list of assets used.
Please check the paper to learn more about our research findings and check the datasheet to learn more about the datasets.
Florian Bordes, Shashank Shekhar, Mark Ibrahim, Diane Bouchacourt,
Pascal Vincent, Ari S. Morcos