MONO91
NIST Monograph 91: Automatic Indexing: A State-of-the-Art Report
Indexes Compiled by Machine
chapter
Mary Elizabeth Stevens
National Bureau of Standards
In the use of the JBM 705 for the concordance to the Summa Theologiae, Fr. Busa reports
that oflly 60 hours were required to arrange in alphabetical order 1, 600,000 words. 1/
This advantage of speed, with the concomitant benefits of both economy and timeliness, is
illustrated by Tasman as follows:
* . It has been estimated that it would take 50 scholars 40 years. . to manually
index the 13 million or so words of St. Thomas Aquinas! complete works. IBM
punched card machines would produce the indexes and concordances much more
accurately and would take ten scholars about four years. Large-scale data
processing techniques would reduce the time to about 25 percent... (or)... ten
scholars to do the job in less than a year 2/
Other advantages stem from the facility with which further machine processing can be
introduced. Once the text is in machine-readable form, a number of valuable byproducts
can be derived. Examples are statistics on the number of words that have 2, 3,... n
letters, frequencies of letter usage; printouts[OCRerr]of occurrences of specified words or groups
of words; and lists alphabetized on terminal rather than initial letters. Added advantages
of computer processing are further exemplified in the options available with the California
concordance computer program (1964 [95]), some of which are as follows:
(1) The user may obtain a restricted rather than a full concordance by supplying a
list of words for which nd entries are to be made.
(2) The user may obtain a selective concordance by supplying a list of words for
which, and only for which, entries are to be made.
(3) Each entry word may be centered with its preceding and succeeding context,
up to the limits of one full line of 131 characters, or each entry word may be
listed together with the full sentence or verse in which it occurs.
(4) Text with interlinear information such as grammatical symbols can be used and
selective concordances can be compiled on the basis of such interlinear
information.
(5) The citations of an entry can be listed in order of textual occurrence, in an
order determined by preceding or following words in its context or in an order
determined by accompanying interlinear symbols.
2.2 Card Catalogs, Book Catalogs, Bibliographies and Subject Index Listings
Prepared by Machine
The use of machines such as punched card equipment for the preparation and pro-
cessing of library card catalogs and of index listings was advocated by a few far-sighted
documentalists at least as early as the 1930's (Parker, 1938 [463]; Dewey, 1959 [153]).
1/
2/
See his statement in Scheele, 1961 [522], p.209.
Tasman, 1958,(596] , p.1[OCRerr].
19