— TRECVID 2018 guidelines

Social-media video storytelling linking

Task coordinators: Joao Magalhaes, David Semedo, Saverio Blasi

Introduction

The social-media video storytelling linking task seeks to advance the area of visual summarization, with collaborative videos, images and texts available from professional media and social-media users. The challenges in creating such visual timelines are various: alignment of videos and images through sound or timestamps to smooth transitions or remove duplicates, detection of video intervals of high interest, caption generation for a group of pictures, among others.

Task

A news story topic is an actual news narrative and the news segments correspond to particular sentences of the news, that a journalist may wish to illustrate. For each story segment (a sentence query with some a strong visual component), systems should retrieve the video and image that satisfy the two requirements:

Best illustrates the news segment;
Makes the best transition from the previous video/image illustration.

For a more detailed guidelines please refer to this document.

Working example

Data

Edinburgh Festival : Consists of a celebration of the performing arts, gathering dance, opera, music and theatre performers from all over the world. The event takes place in Edinburgh, Scotland and has a duration of 3 weeks in August.

Le Tour de France : Consists of one of the main road cycling race competitions. The event takes place in France (day 1-8, 11-17, 20-23), Spain (day 9), Andorra (day 9-11), Switzerland (day 17-19), and has a duration of 23 days in July.

Twitter images and videos:

Edinburgh Festival: over 32k images and 6.2k videos;
Le Tour de France: over 66k images and 19k videos.

Flickr images:

Edinburgh Festival: over 10k images;
Le Tour de France: over 11k images.

Please see the LNK task website for data download instructions.

Evaluation

Relevance of the segment illustration (blue links in the above figure):

s_i=0: the image or video illustration is not relevant to the story segment.
s_i=1: the image or video illustration is relevant to the story segment.

Consistency of illustration transitions (red links in the above figure):

t_i=0: there is no relation between the segment illustrations.
t_i=1: there is a semantic (and visual) relation between the two segments.

this link

Submissions

There will be a total of 30 story topics, organized into 3-5 visual story segments each.
Each run is composed of a sequence of videos/images intended to illustrate the sequence of story segments.
Teams may submit up to 5 runs. Hence, each team can create up to 5 alternative visual summaries for each story.
Runs are due on August 26 and should be submitted via NIST password protected submission webpage.