The Storytime Dataset

Clips for simulated videotelephony. Four stories with ten parts each, four different quality levels per clip. German language.

Author: Quality and Usability Lab, TU Berlin

Partner: Yes

Contact: Robert Spang

Tags: , , ,


Subjective scores: true

Total: 160

Ratings: 2-11 per clip

Resolution: 1920x1080

Method: Custom


To study people’s natural behavior during different conditions of audiovisual quality, we usually invite people into a lab and let them talk to each other. In such conversation settings, not only the media quality impacts the quality perception, but, e.g., social aspects of a real conversation are reflected by individual conversational and rating behavior. Hence, to study quality perception in conversational settings, we try to create an environment that isolates the media quality from such outside factors and is consistent for each participant in the lab. Therefore, we created a dataset of simulated videotelephony clips to act as stimuli in quality perception research. The dataset consists of four different stories in the German language that are told through ten consecutive parts, each about 10 seconds long. Each of these parts is available in four different quality levels, ranging from perfect to stalling. All clips (FullHD, H.264 / AAC) are actual recordings from end-user video-conference software to ensure ecological validity and realism of quality degradation. To ensure consistency among different clips of the same quality level, each video has been scored using VMAF and POLQA and selected to match predefined selection criteria. To analyze the perceived quality of the clips, we conducted a user study (N=25) and evaluated perceived quality, interest in the stories, and speaker engagement. Results validate the consistency of the quality levels of the video clips. Apart from a detailed description of the methodological approach, we contribute the entire stimuli dataset containing 160 videos and all rating scores for each file.


Go to Then select “Download as zip” in the “Files” container.


CC-By Attribution 4.0 International

References and Citation

Please, cite the following paper, if you use this database: [STD22]


  • STD22: Spang, R.P., Voigt-Antons, J. N. & Möller, S. (2022, September). The Storytime Dataset: Simulated Videotelephony Clips for Quality Perception Research. In 2022 Fourteenth International Conference on Quality of Multimedia Experience (QoMEX) (pp. 1-6). IEEE.