There is an urgent need to develop algorithms which are capable of grouping media by the social events they depict or are related to. In order to train, test, and evaluate such algorithms and frameworks, we present a dataset that consists of about 430,000 photos from Flickr together with the underlying ground truth consisting of about 21,000 social events. All the photos are accompanied by their textual metadata. The ground truth for the event groupings has been derived from event calendars on the Web that have been created collaboratively by people. The dataset has been used in the Social Event Detection (SED) task that was part of the MediaEval Benchmark for Multimedia Evaluation 2013. This task required participants to discover social events and organize the related media items in event-specific clusters within a collection of Web multimedia documents.
The files are available for download via HTTP. Link: http://traces.cs.umass.edu/index.php/Mmsys/Mmsys The files are available in one archive (121GB) for download via HTTP: Link: http://skuld.cs.umass.edu/traces/mmsys/2014/user04.tar
References and Citation
Use of the datasets in published work should be acknowledged by a full citation to the authors' papers [RPM14] at the MMSys conference (Proceedings of ACM MMSys 2014, March 19 - March 21, 2014, Singapore, Singapore).
RPM14: T. Reuter, S. Papadopoulos, V. Mezaris, P. Cimiano, ReSEED: social event dEtection dataset, Proceedings of ACM MMSys 2014, March 19 - March 21, 2014, Singapore, Singapore.