SP500207 NIST Special Publication 500-207: The First Text REtrieval Conference (TREC-1) OCLC Online Computer Library Center, Inc. chapter R. Thompson National Institute of Standards and Technology Donna K. Harman Num~queries: Total number of Retrieved: Relevant: Rel_ret: Trunc_ret: Recall Precis at 0.00 at 0.10 at 0.20 at 0.30 at 0.40 at 0.50 at 0.60 at 0.70 at 0.80 at 0.90 at 1.00 Average precision 11-pt Avg: Average precision 3-pt Avg: Table III 25 documents over all queries 5000 245 166 4432 ion Averages: 0.3590 0.3031 0.2118 0.1641 0.1444 0.1269 0.0946 0.0819 0.0599 0.0322 0.0292 for all points 0.1461 for 3 intermediate points (0.20, 0.50, 0.80) 0.1329 Precision Results for Eleven Levels of Recall with FASIT Indexing Num_queries: 25 Total number of documents over all queries Retrieved: 5000 Relevant: 245 Rel_ret: 164 Trunc_ret: 4382 Recall - Precision Averages: at 0.00 0.3867 at 0.10 0.3092 at 0.20 0.2369 at 0.30 0.1918 at 0.40 0.1449 at 0.50 0.1034 at 0.60 0.0816 at 0.70 0.0666 at 0.80 0.0501 at 0.90 0.0458 at 1.00 0.0337 Average precision for all points 11-pt Avg: 0.1501 Average precision for 3 intermediate points (0.20, 0.50, 0.80) 3-pt Avg: 0.1301 The higher precision scores at the lowest levels of recall are a replication of the experiments with FASIT reported in Dillon and McDonald (1983) and Burgin and Dillon (1992), and support the hypothesis that phrase indexing serves primarily as a precision enhancer. In the higher levels of recall, the effects of phrase indexing and stem indexing are essentially the same, indicating that retrieval performance is not jeopardized by the reduced document representation. We expect larger gains in precision with phrase indexing as we fine-tune our phrase-extraction algorithms. References Burgin and Dillon, M (1992). Improving Disambiguation in FASIT. 191