A dataset of geotagged photos on a world-wide scale is presented. The dataset contains a sample of more than 14 million geotagged photos crawled from Flickr with the corresponding metadata. To guarantee the spatial representativeness of the dataset, a crawling approach based on the small-world phenomena and the Flickr friendship’s graph is applied. Furthermore, the noisiness of user-provided tags is reduced through an automatic tag cleaning approach. To enable efficient retrieval, photos in the dataset are indexed based on their location information using quad-tree data structure. The dataset can assists different applications, especially, search-based automatic image annotation and reverse geotagging.
The files are available for download via HTTP. Link: http://traces.cs.umass.edu/index.php/Mmsys/Mmsys The files are available in one archive (79.3GB) for download via HTTP: Link: http://skuld.cs.umass.edu/traces/mmsys/2014/user03.tar
References and Citation
Use of the datasets in published work should be acknowledged by a full citation to the authors' papers [MWH14] at the MMSys conference (Proceedings of ACM MMSys 2014, March 19 - March 21, 2014, Singapore, Singapore).
MWH14: H. Mousselly-Sergieh, D. Watzinger, B. Huber, M. Döller, E. Egyed-Zsigmond, H. Kosch, World-wide scale geotagged image dataset for automatic image annotation and reverse geotagging, Proceedings of ACM MMSys 2014, March 19 - March 21, 2014, Singapore, Singapore.