SP500215 NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC-2) Appendix B: System Features Appendix National Institute of Standards and Technology D. K. Harman IA. CONSTRUCTION OF INDICES, KNOWLEDGE BASES, AND OilIER DATA STRUCTURES -- MEIHODS USED SYSThM NAME [OCRerr]__ADS UIC [2] MEAl) [OCRerr] UIF J IDSR [OCRerr] Stopword Ust Yes [OCRerr] Yes [OCRerr] No [OCRerr] Yes Yes -\ - Number of Words 421 632 None 166 (6] 399 Controlled Vocabulary No No No No No Yes, but Stemming Yes (adsl) Yes No not used Yes Extracted from SMART, coded in lovins, - Standard Algorithms Paice, 1990 SPlIBOL None modified Paice - Morphological Analysis No No No None No Term weighting No IDF [3] No tf * idf Yes Yes Phrase Discovery No Yes No No No - Kind of phrase _____________ [4] None __________ _________ ____________ Statistics on word pairs computed & - Statistical Methods used None - Syntactic Methods No None __________ ____________ Syntactic parsing No No None No No Word Sense Disambiguation No No No Yes [7] No ___________ Heuristic Associations No No No No - Short Definition word co[OCRerr]occurrences None Spell Checking (With Manual Correction) No No No No No Spelling Correction No No No No No Proper Noun Identification No No No No No Tokenizer No No No No No - Which Patterns Use of Manually Indexed Terms No No No No No Other Techniques to Build Data List of Structures Yes (1] offsets [5] None