SP500215 NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC-2) Appendix B: System Features Appendix National Institute of Standards and Technology D. K. Harman C. CONSTRUCTION OF INDICES, KNOWLEDGE BASES, AND OTHER DATA STRUCTURES -- DATA BUILT FROM OTHER SOURC [OCRerr]ewhat angled towards American data, but mostly very general. itains synonym Classes (e.g. [child, children]), go phrases (e.g. Des Moines), stopwords and semi[OCRerr]opwords. recorded. Because of disk shortage, System 1 included a number of additional stopwords suggested by high frequencies in the [OCRerr]IREC data. e manually-built file that contains common business acronyms and abbreviations and abbreviations of organizations. The business acronyms were compiled of organizations is based on entries from the files "un.txt" and "organizations.txt" that are available from Project Gutenberg (the [OCRerr].txt files were made availab ice Croft, UMASS).