IR4873 NIST Interagency Report 4873: Automatic Indexing Automatic Indexing chapter Donna Harman National Institute of Standards and Technology 13 Harman D.(199[OCRerr]). Relevance Feedback Revisited. In: Proceedings of the 15th International Conference on Research and Development in Information Retrieval; June 1991, 1-10; Copenhagen, Denmark. Harman D. and Candela G. (1990). Retrieving Records from a Gigabyte of Text on a Minicomputer using Sta- tistical Ranking, Journal of the American Society for I[OCRerr]ormadon Science, 41(8), 581-589. Katzer J., M[OCRerr]ill MI., Tessier J.A., Frakes W., and DasGupta P. (1982). A Study of the Overlap among Ioocu- ment Representations. J[OCRerr]ormation Technology: Research and Development, 1(2), 261-274. Lewis D.D. (1992). Feature Selection and Feature Extraction for Text Categorization. Paper to appear in the Proceedings of the 5th DARPA Workshop on Speech and Natural Language, Harriman, N.Y. Lewis D.D. and Croft W.B. (1990). Term Clustering of Syntactic Phrases, In: Procelings of the 13th Interna- tional Conference on Research and Development in Information Retrieval; September 1990; 385-504; Brussels, Belgium. Lennon M., Peirce D., Tarry B., Willett P. (1981). An Evaluation of Some Conflation Algorithms for Informa- tion Retrieval, Journal of Information Science, 3, 177-188. Lovins J.B. (1%8). Development of a Stemming Algorithm, Mechanical Translation and Computational Linguistics, 11, 22-31. Luhn HY. (1957). A Statistical Approach to Mechanized Encoding and Searching of Literary Information, IBM Journal of Research and Development, 1(4), 309-317. Pacak M.G. and Pratt A.W. (1978). Identification and Transformation of Terminal Morphemes in Medical English, Part II, Methods of I[OCRerr]ormation in Medicine, 17, 95-100. Porter M.F. (1980). An Algorithm for Suffix Stripping, Program, 14(3),130-137. Salton G. and Buckley C. (1988). Term-Weighting Approaches in Automatic Text Retrieval. Information Pro- cessing and Management, 24(5), 513-523. Salton G., and Buckley C. (1989). A Comparison between Statistically and Syntactically Generated Term Phrases. Technical Report TR 89-1027, Cornell University: Computing Science DepartmenL Salton G. and Buckley C. (1990a). Improving Retrieval Performance by Relevance Feedback. Journal of the American Society for Information Science, 41(4), 288-297. Salton G., and Buckley C. (199Od). An Evaluation of Text Matching Systems for Text Excerpts of Varying Scope. Technical Report TR 89-1027, Cornell University: Computing Science Department. Salton G. and Buckley C. (1991). Automatic Text Structuring and Retrieval: Experiments in Automatic Encyclo- pedia Searching. In: Proceedings of the 14th International Conference on Research and Development in Infor- mation Retrieval; October 1991, 21-31; Chicago, illinois. Salton G., Buckley C., and Smith M. (1990b) On the Application of Syntactic Methodologies in Automatic Text Analysis, Information Processing and Management, 26(1), 73-92. Salton G., Zhao Z. and Buckley C. (199Oc). A Simple Syntactic Approach for the Generation of Indexing Phrases Technical Report TR 90-1137, Cornell University: Computing Science Department. Salton 0. and McGill M. (1983). Introduction to Modern Information Retrieval. New York, NY.: McGraw-Hill. Sparck Jones K. (1972). A Statistical Interpretation of Term Specificity and Its Application in Retrieval. Journal of Documentation, 28(1), 11-20.