Content set that has been made available by the European Broadcasting Union (EBU). The content in the set consists of broadcast media content collected from different broadcasters around the world. This content set is made available to the research community in order to evaluate automatic information extraction tools on this broadcast media. The set also contains ground truth data and annotations for several automatic information extraction tasks.
The files are available for download via HTTP after registration. Link: http://ebu-scaie.lab.vrt.be/mammie/
The original link was: http://ebu-scaie.lab.vrt.be/mammie
References and Citation
Use of the datasets in published work should be acknowledged by a full citation to the authors' papers [MMB14] at the MMSys conference (Proceedings of ACM MMSys 2014, March 19 - March 21, 2014, Singapore, Singapore).
MMB14: M. Matton, A. Messina, W. Bailer, J.-P. Évain, The EBU MIM-SCAIE content set for automatic information extraction on broadcast media, Proceedings of ACM MMSys 2014, March 19 - March 21, 2014, Singapore, Singapore.