File(type)s: - "collection.xml": video id is linked to the video file name and its use for development or test. 2. Jul 2003 8 files were withdrawn from the test collection due to significant encoding problems over a large portion of each video. Such videos are marked "withdrawn.from.test" - "xxx.mp7.xml" : Mpeg-7 output, where "xxx" is the video id instead of the video file name. A few small points that are related to the Mpeg-7 output: - The media time format is based on the Gregorian day time (ISO 8601) norm. Fractions are defined by counting pre-sepcified fractions of a second. In our case, the frame rate is 29.97. One fraction of a second is thus specified as "PT1001N30000F". - Trec video id has the format of "XXX" and shot id "shotXXX_YYY". The "XXX" is the sequence number of video onto which the video file name is mapped, this is based on the "collection.xml" file. The "YYY" is the sequence number of the shot. - There are two types of keyframes : one "representative keyframe" (with "RKF" in the subshot name) corresponding to the main (or the single) subshot within the shot, zero, one or more additional "non representative keyframe" (with "NRKF" in the shot name) corresponding to other subshots (if any) within the shot. Subshots within a shots correspond to extracted shots merged together in order to enforce the 2s minimum duration of a shot. Information about the segmentation: - The segmentation was performed using an improved version of the the system used by CLIPS-IMAG for the TREC 2002 shot boundary determination task. - The segmentation has been done so that no shot has a duration of less than 2 second (or 60 frames) so that assessors can confidently evaluate them. In order to satisfy this constraint, short shots have been merged with their neighbors. Therefore, many shots (roughly 20% I would say) actually contain several subshots. When a shot contains several subshots (and we know that because we know the segmentation before and after the merging od short shots), the corresponding keyframe is always chosen within the longest subshot. - The segmentation is such that there are no gaps between shots and no shots overlap. When a gradual transition is detected, the end and the begining of the previous and next shots are set to the middle point of the transition (i.e. shots begins and ends in the middle of the transitions). Again, although the transition boundary information is lost in the final segmentation, it it used for keyframe extraction and the keyframe is never within a portion of a (detected) gradual transition.