The iNaturalist Species Classification and Detection Dataset
Abstract
Existing image classification datasets used in computer vision tend to have a uniform distribution of images across object categories. In contrast, the natural world is heavily imbalanced, as some species are more abundant and easier to photograph than others. To encourage further progress in challenging real world conditions we present the iNaturalist species classification and detection dataset, consisting of 859,000 images from over 5,000 different species of plants and animals. It features visually similar species, captured in a wide variety of situations, from all over the world. Images were collected with different camera types, have varying image quality, feature a large class imbalance, and have been verified by multiple citizen scientists. We discuss the collection of the dataset and present extensive baseline experiments using state-of-the-art computer vision classification and detection models. Results show that current non-ensemble based methods achieve only 67% top one classification accuracy, illustrating the difficulty of the dataset. Specifically, we observe poor results for classes with small numbers of training examples suggesting more attention is needed in low-shot learning.
Additional Information
© 2018 IEEE. This work was supported by a Google Focused Research Award. We would like to thank: Scott Loarie and Ken-ichi Ueda from iNaturalist; Steve Branson, David Rolnick, Weijun Wang, and Nathan Frey for their help with the dataset; Wendy Kan and Maggie Demkin from Kaggle; the iNat2017 competitors, and the FGVC2017 workshop organizers. We also thank NVIDIA and Amazon Web Services for their donations.Attached Files
Accepted Version - 1916.pdf
Submitted - 1707.06642.pdf
Files
Name | Size | Download all |
---|---|---|
md5:c66f4b285740af3fa7e6d2c61b98e50f
|
8.4 MB | Preview Download |
md5:c380af4456b7c1c147321b22b6f1f186
|
8.3 MB | Preview Download |
Additional details
- Eprint ID
- 87114
- Resolver ID
- CaltechAUTHORS:20180614-123754546
- NVIDIA
- Amazon Web Services
- Created
-
2018-06-14Created from EPrint's datestamp field
- Updated
-
2021-11-15Created from EPrint's last_modified field