EBU MIM-SCAIE Content Set for Automatic Information Extraction on Broadcast Media

Author: EBU

Partner: No

Contact: Mike Matton (mike.matton@vrt.be)




Total: 393

Resolution: 720x576


Content set that has been made available by the European Broadcasting Union (EBU). The content in the set consists of broadcast media content collected from different broadcasters around the world. This content set is made available to the research community in order to evaluate automatic information extraction tools on this broadcast media. The set also contains ground truth data and annotations for several automatic information extraction tasks.


The files are available for download via HTTP after registration. Link: http://ebu-scaie.lab.vrt.be/mammie/

Broken link!

The original link was: http://ebu-scaie.lab.vrt.be/mammie

References and Citation

Use of the datasets in published work should be acknowledged by a full citation to the authors' papers [MMB14] at the MMSys conference (Proceedings of ACM MMSys 2014, March 19 - March 21, 2014, Singapore, Singapore).


  • MMB14: M. Matton, A. Messina, W. Bailer, J.-P. √Čvain, The EBU MIM-SCAIE content set for automatic information extraction on broadcast media, Proceedings of ACM MMSys 2014, March 19 - March 21, 2014, Singapore, Singapore.