MONO91
NIST Monograph 91: Automatic Indexing: A State-of-the-Art Report
Problems of Evaluation
chapter
Mary Elizabeth Stevens
National Bureau of Standards
At AEC, 96 items were re-indexed to the subject heading scheme used in Nuclear
Science Abstracts. There had been 249 headings assigned to these items originally and
406 were assigned on the second run, for an overall consistency rate of 54 percent, but
with 53 percent of the headings used the second time not having been used the first. The
sample checked at OTS consisted of 32 items to which 346 descriptors had been assigned
the first time and 418 the second. The consistency was 65 percent with respect to the first
run and 54 percent with respect to the second. Finally, at the National Agriculture Library
99 items were checked, with results showing a high consistency rating and a similarity of
indexing between the two runs of 86 percent. Painter concludes:
"The consistency rates are not encouraging. Apparently there is little difference
between preparation for a manual system and that for a machine system. The per-
centages indicate that there is no significant difference between consistency where
two or three headings are assigned and where twelve or sixteen are assigned.
Therefore, we are left with the fact that regardless of these variables, consistency
rates range between 60 and 72 per cent." 1/
Jacoby and Slamecka report even less encouraging data (1962 [293]). "In general,
the inter-indexer reliability was found to be low (in the vicinity of 20 per cent), the intra-
indexer reliability somewhat higher (about 50 per cent)." For a series of tests of indexing
of a group of chemical patents by three experienced and three inexperienced indexers, they
found that the beginners had average matchings among the terms assigned by them to the
same documents of only 12.6 percent and that even for the experienced indexers the
average percent of matching terms was only 16.3 percent. 2/ In other studies, these in-
vestigators have explored the effects of various indexing aids upon the reliability and
consistency of indexing, concluding that the use of prescriptive aids such as authority lists
improves reliability and inter-indexer consistency from 8 or 9 percent to 33 percent, while
those aids such as thesauri and association lists "which enlarge the indexer's semantic
freedom of term choice" are detrimental (Slamecka and Jacoby, 1963 [560]).
Rodgers in a study of intra-indexer consistency reports data for the re[OCRerr]indexing, by
the same person at a later date, of 60 documents dealing with the United Arab Republic
taken from The New York Times. She reports that the average consistency over all 60
documents was 59 percent. 3/ In a further study of inter-indexer consistency, 20 papers
from Area 5, ICSI, were key-word indexed by 16 people all of whom were familiar with
the subject matter, (although only 8 completed all 20 papers). Results are given in terms
of the proportions of the total number of unique words chosen by 100 percent of the subjects
(.008) half of them (.14) and only one of them (.52). 4/ Study of the results in terms of
the proportion of words selected in common by any pair of these indexers to the total
number of different words selected by them both gave a "grand mean agreement for all
two-person combinations for the 8 subjects... [of].. 24 percent against all 20 articles. "5/
The mean percentage of overlap between Luhn's word-frequency selection technique (as
applied to the same papers) and any one or more indexers who agreed was .15.
1/ Painter, 1963 [460], p. 94
2/ Jacoby and Slamecka, 1962 [293], p. 16.
3/ Rodgers, 1961 [504], p. 12.
4/ Rodgers, 1961 [503], p. 50.
5/ Greer, 1963 [239], p. 10.
159