NIST Monograph 91: Automatic Indexing: A State-of-the-Art Report

MONO91 NIST Monograph 91: Automatic Indexing: A State-of-the-Art Report Problems of Evaluation chapter Mary Elizabeth Stevens National Bureau of Standards that subject indexes are not the most important source, nor even a major source. Herner found, for example, that only about 16 p ercent of his respondents reported use of indexes and abstracts as primary tools in literature searches. He reports, for the use of tools in becoming aware of current sources of information, 477 of 3832 responses indicating the use of indexing and abstracting publications as against 486 using footnotes or other cited references, 1/ 291 using library acquisition lists, and 212 using separate bibliographies (Herner, 1958 [265]). These data, and similar findings of [OCRerr]ishendon that 17 percent of scientists queried considered the scanning of titles in accession lists and announcement bulletins a principal means to find information of interest, 2/ suggest that KWIC type indexes may be adequate for many purposes. On the other hand, the KWIC index to the U.S. Government Research Reports made available to the public on an experimental basis through the Office ot Technical Services was discontinued after a year of subsidized operation because too few of the users indicated willingness to pay a fee in order to have the service continued on a subscription basis. The evaluational problem here involves the lack of information on indexing costs, the relatively few quantitative and objectively validated studies that have been made of user needs, the question of whether what the user says he does or wants is what he really wants or does, and the matter of defining `1interest1 for different users with differing purposes and requirements. The concept of I1interest1.! is taken to mean the motivations of a particular user or group of users at a particular time, while the equally imprecise notion of `1relevance1 refers to the value judgments made by the user as to the relation of an item to his query or interest. A final core problem, then, is that of the question of relevancy itself, involving recognition that tirelevancy is a comparative rather than a qualitative concept . . . (and) ... that a doc[OCRerr]mment of little relevancy in the eyes of X might well be highly relevant in the eyes of Y. [OCRerr]/ Mooers states, similarly, that: "There is no absolute `Relevance' of a document. It depends upon the person and his background, the work and the date. What is not relevant today may be 4/ relevant tomorrow. - Good discusses various possible measures of `relevance' - logical measures, frequency measures, references to, citations of, interest measures, linguistic measures _ 1/ 2/ 3/ 4/ 5/ Note that Herner's data and those of Glass and Norwood, 1958 [232], reporting 6.9 percent use of cross-citations in another paper as the method of learning of important work as against 1.2 percent using an indexing service, appear to re-enforce the claims of those who advocate citation indexing. Fishenden, 1958[197], p. 163. Bar-Hillel, 1959 [33], p. 4-8.4. Mooers, 1963 [423], p. 2. Good, 1958 [234], pp. 7-9. 147