SP500215 NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC-2) Probabilistic Learning Approaches for Indexing and Retrieval with the TREC-2 Collection chapter N. Fuhr U. Pfeifer C. Bremkamp M. Pollmann National Institute of Standards and Technology D. K. Harman 1 0.9- `dortVl" [OCRerr] "dortPl" I 0.8[OCRerr] 0.7- 0.6- Precision 0.5- 0.4- 0.3- 0.2- 0.1- 0 I i I I I I I I I I 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Recall Figure 2: Recall-precision curves of routing runs and the RPI retrieval function yields Q(qk,dm)= [OCRerr] log(cikuim+1). tIEq[OCRerr]T 4.2 Experiments In principle, the RPI formula can be applied with or without query expansion. For our experiments in TRECi, we did not use any query expansion. The final results showed that this was reasonable, mainly with respect to the small amount of relevance feed- back data available. In contrast, for TREC2 there were about 2000 relevance judgements per query, so there was clearly enough training data for applying query expan- sion methods. As basic criterion for selecting the expansion terms, we considered the number of relevant documents in which a term ocurs, which gave us a ranking of candidates; docu- ment indexing weights were considered for tie-breaking. Then we varied the the number of terms which are added to the original query. expansion result 0 0.2909 10 0.3047 30 0.3035 50 0.3002 100 0.2832 Table 8: Effect of number of expansion terms In a first series of experiments, we considered single word only. We used Q2/D1 (lsp document indexing) as training sample and Q2/D2 (ltc indexing) as test sample. As can be seen from table 8, query expansion clearly improves retrieval quality, but only for a limited (7) number of expansion terms. For larger numbers, we get worse results. This effect seems to be due to parameter estimation problems. expansion phraseweight single w. phrases result 0.5 0 0 0.3476 0.5 20 0 0.3713 1.0 20 0 0.3730 0.5 20 10 0.3728 1.0 20 10 0.3605 0.5 30 10 0.3729 1.0 30 10 0.3626 Table 9: Query expansion with phrases In a second series of experiments, we looked at the combination of single words and phrases. These ex- periments were performed as retrospective runs, with Q2/D12 as training sample and Q2/D2 as test sample (both with ltc document indexing). For the number of expansion terms, we treated single words and phrases separately. Furthermore, similar to the adhoc runs, we used an additional factor for downweighting the query term weights of phrases. The different parameter com- binations tested and the corresponding results are given in table 9. Obviously, phrases as expansion terms gave no improvement, so we decided to have only single words as expansion terms (but the phrases from the original query still are used for retrieval). Furthermore, the re- trieval quality reaches its optimum at about 20 terms. 72