NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC-2)

SP500215 NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC-2) Overview of the Second Text REtrieval Conference (TREC-2) chapter D. Harman National Institute of Standards and Technology D. K. Harman Table 1. Document Statistics (disk3) 90,257 78,325 161,021 6,711 Median number of terms per record (diski) 182 353 181 313 82 (disk2) 218 346 167 315 (disk3) 279 358 119 2896 Average number of terms per record (diskl) 329 375 412 1017 89 (disk2) 377 370 394 1073 (disk3) 337 379 263 3543 <smry> Summary: Document will identi[OCRerr] a type of natural language pro- cessing technology which is being developed or mar- keted in the U.S. <narr> Narrative: A relevant document will identify a company or institu- tion developing or marketing a natural language pro- cessing technology, identify the technology, and identify one or more features of the company'S product. <con> Concept(s): 1. natural language processing 2. translation, language, dictionary, font 3. software applications <fac> Factor(s): <nat> Nationality: U.S. <Ifac> <def> Definition(s): <Itop> Each topic is formatted in die same standard method to allow easier automatic construction of queries. Besides a beginning and an end marker, each topic has a number, a short title, a one-sentence description, and a summary sentence or two that can be used as a surrogate for the full topic (often very similar to the one-sentence description). There is a narrative section which is aimed at providing a complete description of document relevance for the 5 assessors. Each topic also has a concepts section with a list of assorted concepts related to the topic. This section is designed to provide a mini-knowledge base about a topic such as a real searcher might possess. Additionally each topic can have a definitions section andlor a factors section. The definition section has one or two of the defi- nitions critical to a human understanding of the topic. The factors section is included to allow easier automatic query building by listing specific items from the narrative that constrain the documents that are relevant. Two par- ticular factors were used in the ThEC-2 topics: a time factor (current, before a given date, etc.) and a nationality factor (either jiwolving only certain countries or excluding certain countries). While the ThEC topics did not present a problem in scal- ing, the challenge of either automatically constructing a query, or manually constructing a query with little fore- knowledge of its searching capability, was a major chal- lenge for ThEC participants. In addition to filtering the relatively large amount of information provided in the topics into queries, the sometimes narrow definition of relevance as stated in the narrative was ditficult for most systems to handie. 3A The Relevance Judgments The relevance judgments are of critical importance to a test collection. For each topic it is necessary to compile a list of relevant documents; hopefully as comprehensive a list as possible. For the TREC task, three possible