SP500215
NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC-2)
Appendix B: System Features
Appendix
National Institute of Standards and Technology
D. K. Harman
IA. CONSThUCTION OF INDICES, KNOWIEDGE BASES, AND OIlIER DATA STRUCTURES -- METhODS USEU
[OCRerr]M NAME I c[OCRerr] [OCRerr] ERIM ] mic [OCRerr] CUIRI [OCRerr] UCLA [ SEC [OCRerr]fflfl
Ust Yes No Yes No Yes No ThThes
658 stopwords 438 stop words
`of Words 284 semi- 58 semi- 250
____________________ stopwords [1] _____________ 370 stopwords (5) ___________
dVocabulary No No No No No No No
,, Yes No Yes Not currently Yes
Based on Based on Porter Suffix stripping
d Algorithms Porter [21 lovi's (2] (Porter, 1980)
[OCRerr]ogical analysis Not yet No
[OCRerr]hting No No Yes IDF No Yes Yes (7)
No, a few Yes No
iscovery phrases are
_______________ recognized No Yes No Yes [6) ________
phrase Adjacent words Noun
Statistical
[OCRerr] Methods tagging ______________
ic Methods Yes
parsing No No No No No No
`ise Disambiguation No No No No None No
Associations No No No No No
[OCRerr]tinition
ecking
[OCRerr]anual Correction) No No No No No No
Correction No No No No No No
[OCRerr]oun Identification No No No No Yes No
-[OCRerr] Not used [3) No No Not used (3] No No
Patterns? Date ranges Date ranges _____________
[OCRerr]anually Indexed Terms No No No No No No
echniques to Build Data
`es No [4] Yes [8]