IR4873
NIST Interagency Report 4873: Automatic Indexing
Automatic Indexing
chapter
Donna Harman
National Institute of Standards and Technology
13
Harman D.(199[OCRerr]). Relevance Feedback Revisited. In: Proceedings of the 15th International Conference on
Research and Development in Information Retrieval; June 1991, 1-10; Copenhagen, Denmark.
Harman D. and Candela G. (1990). Retrieving Records from a Gigabyte of Text on a Minicomputer using Sta-
tistical Ranking, Journal of the American Society for I[OCRerr]ormadon Science, 41(8), 581-589.
Katzer J., M[OCRerr]ill MI., Tessier J.A., Frakes W., and DasGupta P. (1982). A Study of the Overlap among Ioocu-
ment Representations. J[OCRerr]ormation Technology: Research and Development, 1(2), 261-274.
Lewis D.D. (1992). Feature Selection and Feature Extraction for Text Categorization. Paper to appear in the
Proceedings of the 5th DARPA Workshop on Speech and Natural Language, Harriman, N.Y.
Lewis D.D. and Croft W.B. (1990). Term Clustering of Syntactic Phrases, In: Procelings of the 13th Interna-
tional Conference on Research and Development in Information Retrieval; September 1990; 385-504; Brussels,
Belgium.
Lennon M., Peirce D., Tarry B., Willett P. (1981). An Evaluation of Some Conflation Algorithms for Informa-
tion Retrieval, Journal of Information Science, 3, 177-188.
Lovins J.B. (1%8). Development of a Stemming Algorithm, Mechanical Translation and Computational
Linguistics, 11, 22-31.
Luhn HY. (1957). A Statistical Approach to Mechanized Encoding and Searching of Literary Information, IBM
Journal of Research and Development, 1(4), 309-317.
Pacak M.G. and Pratt A.W. (1978). Identification and Transformation of Terminal Morphemes in Medical
English, Part II, Methods of I[OCRerr]ormation in Medicine, 17, 95-100.
Porter M.F. (1980). An Algorithm for Suffix Stripping, Program, 14(3),130-137.
Salton G. and Buckley C. (1988). Term-Weighting Approaches in Automatic Text Retrieval. Information Pro-
cessing and Management, 24(5), 513-523.
Salton G., and Buckley C. (1989). A Comparison between Statistically and Syntactically Generated Term
Phrases. Technical Report TR 89-1027, Cornell University: Computing Science DepartmenL
Salton G. and Buckley C. (1990a). Improving Retrieval Performance by Relevance Feedback. Journal of the
American Society for Information Science, 41(4), 288-297.
Salton G., and Buckley C. (199Od). An Evaluation of Text Matching Systems for Text Excerpts of Varying
Scope. Technical Report TR 89-1027, Cornell University: Computing Science Department.
Salton G. and Buckley C. (1991). Automatic Text Structuring and Retrieval: Experiments in Automatic Encyclo-
pedia Searching. In: Proceedings of the 14th International Conference on Research and Development in Infor-
mation Retrieval; October 1991, 21-31; Chicago, illinois.
Salton G., Buckley C., and Smith M. (1990b) On the Application of Syntactic Methodologies in Automatic Text
Analysis, Information Processing and Management, 26(1), 73-92.
Salton G., Zhao Z. and Buckley C. (199Oc). A Simple Syntactic Approach for the Generation of Indexing
Phrases Technical Report TR 90-1137, Cornell University: Computing Science Department.
Salton 0. and McGill M. (1983). Introduction to Modern Information Retrieval. New York, NY.: McGraw-Hill.
Sparck Jones K. (1972). A Statistical Interpretation of Term Specificity and Its Application in Retrieval. Journal
of Documentation, 28(1), 11-20.