SP500215
NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC-2)
Appendix B: System Features
Appendix
National Institute of Standards and Technology
D. K. Harman
DTES:
Average time to build query (minutes): 11[OCRerr]5 for combined query (S2[OCRerr] for the five individual query formulations, and 60 for translating them into INQ
format). These correspond to 10.5 and 12 minutes, respectively, for each individual query formulation.
Queries had manually chosen synsets added to the input text. Ml processing of the text was automatic.
Entire topic statement read when selecting synsets, Concepts (<con>), Description (<desc>), Factors (<fac>), Narrative (<narr>), Nationality ([OCRerr]
Title (<title>) used in automatic part.
5 - 10 minutes a query to select the synonym sets to add an average of 1.2 seconds to automatically process the topic text (1 minute for 50 queries [thei
other jobs on the machine!]).
Title, Description, Concepts, Factors, Narrative.
Addition and deletion of terms selected from the narrative field.
Three sets of queries were constructed: one pnorm boolean query set and two vector query sets, one longer than the other. They are called pnorm, lon[OCRerr]
and short vector queries below.
Ml query sets: title, description, concepts pnorm and long vector: Narrative; long and short vector: Defmitions.
Domain knowledge of computer system expert, limited use to compensate for omissions in topic descriptions.
0] Boolean operators were assigned equal weights (P-values) for the pnorm queries, P-values of 1.0, 1.5 and 2.0 were used for different evaluations of the C