SP500215
NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC-2)
Appendix B: System Features
Appendix
National Institute of Standards and Technology
D. K. Harman
HL SEARCHING (CON'1)
em occurrence frequencies in titles were doubled in some collections.
sriables used were: optimally relativized query and document stem frequencies, global relative frequency of stem in all document texts, 2nd routing run: stem-1.
nk type weight that reflects relative importance of different lexical relations; set to 0.5 for these experiments.
sofar as terms, closely related to selected synonym sets are added to query.
[OCRerr]ne normalization of weights in both documents and queries.
sed subjectively when choosing synsets to add.
[utual Information Measure determines how phrases are evaluated, which indirectly affects the rank.
[OCRerr] use maximum Th though, which seems to be correlated with document length.
[odified per SMART ann ranking given above.
idirectly via use of maximum term frequency.