World-Wide Scale Geotagged Image Dataset for Automatic Image Annotation and Reverse Geotagging

Author: Université Lyon

Partner: No

Contact: Hatem Mousselly-Sergieh (




Total: 14000000


A dataset of geotagged photos on a world-wide scale is presented. The dataset contains a sample of more than 14 million geotagged photos crawled from Flickr with the corresponding metadata. To guarantee the spatial representativeness of the dataset, a crawling approach based on the small-world phenomena and the Flickr friendship’s graph is applied. Furthermore, the noisiness of user-provided tags is reduced through an automatic tag cleaning approach. To enable efficient retrieval, photos in the dataset are indexed based on their location information using quad-tree data structure. The dataset can assists different applications, especially, search-based automatic image annotation and reverse geotagging.


The files are available for download via HTTP. Link: The files are available in one archive (79.3GB) for download via HTTP: Link:

References and Citation

Use of the datasets in published work should be acknowledged by a full citation to the authors' papers [MWH14] at the MMSys conference (Proceedings of ACM MMSys 2014, March 19 - March 21, 2014, Singapore, Singapore).


  • MWH14: H. Mousselly-Sergieh, D. Watzinger, B. Huber, M. Döller, E. Egyed-Zsigmond, H. Kosch, World-wide scale geotagged image dataset for automatic image annotation and reverse geotagging, Proceedings of ACM MMSys 2014, March 19 - March 21, 2014, Singapore, Singapore.