MONO91
NIST Monograph 91: Automatic Indexing: A State-of-the-Art Report
Automatic Indexing
chapter
Mary Elizabeth Stevens
National Bureau of Standards
Application 0£ the indexing principle by use of clerical procedures that today can be
accomplished by machine was suggested a little more than a century ago. A British
librarian, Andreas Crestadoro, advocated the permutation of the words in titles in 1856,
claiming that thus the subject matter index would follow the author's own definition of the
contents of his book. He prepared such "concordances of titles" for several different
library collections. 1/
Within a generation, punched card machines had been invented, but they were not to
be used for library and documentation purposes for some decades yet. 2/ Keppel,
writing in 1937 of his vision of the library 21 years in the future, says:
"When it comes to using the cards, I blush to think for how many years we watched
the so-called business machines juggle with payrolls and bank books before it
occurred to us that they might be adapted to dealing with library cards with equal
dexterity. Indexing has become an entirely new art. The modern index is no
longer bound up in the volume, but remains on cards, and the modern version of
the Hollerith machine will sort out and photograph anything the dial tells it . . "3/
By 1945, Bush had prophesied Memex [93], and in the 1950 Windsor lectures
Ridenour referred to an RCA development, the so-called "electronic pencil", a proposed
reading aid for the blind intended to convert printed characters to a suitable coded form.
He went on to suggest:
We shall have to arrange for cataloguing to be done by machine, without
human interaction except in terms of setting up once for all the system on
which the cataloguing is performed... It is only a step from this device (the
electronic pencil) to the electronic catalogue, which will read text for itself,
recognize key symbols and phrases with which it has been provided, and con-
struct appropriate catalog entries for the text it reads.'L4/
It has only been in the past decade or so, however, that there have been any serious
efforts directed to the use of machines for automatic indexing. In the period 1957-1958,
Luhn first presented and published several provocative papers dealing with such
challenging possibilities as "auto-abstracting", "auto- encoding" and "auto-indexing"
(Luhn, 1957 [385]; 1958 [374]; 1959 [371] ). Luhn's work on the permutation of signifi-
cant words in titles, abstracts, and complete text, the Keyword-in-Context or KWIC
1/
2/
See pp.[OCRerr]9-22 of this report.
3/
4/
See Crestadoro, 1856 [146]; see also Farley, 1963 [192];
Metcalfe, 1957 [416]; and Ohlman, 1960 [451].
See Keppel, 1939 [316], p. 5.
See Ridenour, 1951 [500], p. 26.
4
Linder, 1960 [362];