MONO91
NIST Monograph 91: Automatic Indexing: A State-of-the-Art Report
Problems of Evaluation
chapter
Mary Elizabeth Stevens
National Bureau of Standards
be en to study an existing system (e.g., using Merck, Sharp and Dohme data) with respect
to indexing terms such as "penicillin," "toxicity," and "mode 0£ action." He then
attempts to define various possible machine assignment rules, and then to determine the
probable over-and-under assignments that would result from the application of these
rules.
Typical results pertinent to both questions of word-indexing evaluation and of inter-
indexer consistency showed that for 23 documents indexed under the term "toxicity," 11
did not contain the stem "toxi. . ." at all; that 17 items indexed under "penicillin" contained
the word at least once; that none of 34 randomly selected documents not indexed under
"penicillin" contained the word, but that 7 of 28 items not so indexed but selected as
probable candidates from title and other clues did contain the word. (O'Connor, 1961
[447])
Typical suggestions, comments, and conclusions made by O'Connor include the
following.'
"It might be required that the mechanized indexing permit as good (or no worse)
retrieval as existing human indexing, because it is desired to free the subject-
skilled indexing personnel for other work. Or poorer retrieval (than possible
with human indexing such as is presently done of comparable material) might be
accepted from computer indexing, because poorer retrieval is better than none 11
and there is a shortage of subject-skilled people to do the additional indexing." -
"Such considerations as the following are relevant. Over-assigning can increase
input costs and storage (to an extent dependent on the storage system), but
mechanizing indexing might be worth the cost. Over-assigning might also
increase the number of irrelevant documents retrieved, but the increase might
be insignificant," 2/
Suppose terms A, B, and C each correctly characterize five percent of a
ten thousand document collection, each term is overassigned to another five
percent, and over-assignment of each term occurs independently of the correct
assigning and over[OCRerr]assigning of the others. Then about nine documents will be
extra for the search question A & B & C." 3/
"The question of permitting some under-assigning, that is, the computer failing
to assign E a term] T to some document which should have it, is more delicate.
Human indexers sometimes underassign. If we knew the rate of ounderassigning
by human indexers for a term T, we might consider allowing the computer a
similar rate. However, some cases of underassigning might be more important
than others and if the computer made more important mistakes than the human
indexers, retrieval might not be `good enough'." 4/
1/
2/
3/
4/
O'Connor, 1960E444], p.3.
O'Connor, 1961 [448], p. 199.
O'Connor, 1960 [444], p. 6.
Ibid, pp. 6-7.
l[OCRerr]2