SP500215
NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC-2)
Appendix B: System Features
Appendix
National Institute of Standards and Technology
D. K. Harman
[OCRerr]TL1[OCRerr] C()MPARI[OCRerr]ON
[OCRerr]ust need something, because other Systems do better! We've not done much with phrases: it seems likely that extensive use of phrases would help. The sy[OCRerr]
m already. What we need is a good method of phrase discovery. We were attracted by the idea of treating paragraphs as documents this time, but didn't
Needs more elaborate database model.
are more directly incorporating term weighting into our system. Better query construction, evaluation and refinement tools are under development. We [OCRerr]
ncorporate several ideas developed in this exercise into automatic query generation and refinement tools. Incorporation of these ideas should improve the
formance of our system. Some specific improvements include:
Better integration of term weights.
Better tools for initial query construction
Better stemming and stop-word elimination
Evaluation of search term independence
Better document similarity metrics.