![]() Understanding Conferences |
![]() |
D U C 2 0 0 6: Task, Documents, and MeasuresThe system task for DUC 2006 will essentially be the same as the 2005 task and will model real-world complex question answering, in which a question cannot be answered by simply stating a name, date, quantity, etc. Given a topic and a set of 25 relevant documents, the task is to synthesize a fluent, well-organized 250-word summary of the documents that answers the question(s) in the topic statement. Successful performance on the task will benefit from a combination of IR and NLP capabilities, including passage retrieval, compression, and generation of fluent text. Documents for summarizationNIST assessors will develop topics of interest to them. The assessor will create a topic and choose a set of 25 documents relevant to the topic. These documents will form the document cluster for that topic. The documents will come from the AQUAINT corpus, comprising newswire articles from the Associated Press and New York Times (1998-2000) and Xinhua News Agency (1996-2000). The corpus has the following DTD:
Reference summariesEach topic and its document cluster will be given to 4 different NIST assessors, including the developer of the topic. The assessor will create a ~250-word summary of the document cluster that satisfies the information need expressed in the topic. These multiple references summaries will be used in the evaluation of summary content. System taskSystem task: Given a DUC topic and a set of 25 documents relevant to the topic, create from the documents a brief, well-organized, fluent summary which answers the need for information expressed in the topic. The summary can be no longer than 250 words (whitespace-delimited tokens). Summaries over the size limit will be truncated. No bonus will be given for creating a shorter summary. No specific formatting other than linear is allowed. Each group can submit one set of results, i.e., one summary for each topic/cluster. Participating groups should be able to evaluate additional results themselves using ISI's ROUGE/BE package. EvaluationAll summaries will first be truncated to 250 words. Where sentences need to be identified for automatic evaluation, NIST will then use a simple Perl script for sentence segmentation.
Tools for DUC 2006
DUC Workshop Papers and PresentationsEach participant in the system task may submit a paper describing their system architecture, results, and analysis; these papers will be published in the DUC 2006 Workshop Proceedings. Participants who would like to give oral presentations of their papers at the workshop should submit a presentation proposal in May 2006, and NIST will select the groups who will present at the workshop. |
For
data, past results, mailing list or other general information
contact:
Lori
Buckland ([email protected])
For
other questions contact: Hoa
Dang (hoa.dang AT nist.gov)
Last
updated: Monday, 05-Dec-2005 20:54:56 EST
Date
created: Wednesday, 24-November-05