ISR11
Scientific Report No. ISR-11 Information Storage and Retrieval
Design Criteria for Automatic Information Systems
chapter
M. E. Lesk
G. Salton
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
V-'
Design Criteria for Automatic Information Systems
M. E. Lesk+ and G. Salton
1. Introduction
Considerable attention has been paid in recent years to the organiza-
tion of information centers. Various plans have been advanced for the
establishment of partly mechanized information and library centers, and
recommendations have been drawn up specifying the organization of a
national document handling system. Ll,2] In general, such plans stipulate
use of a given equipment complex to store the information to be searched.
Provision is normally made for introducing search requests from a variety
of input stations, some of which may be situated far away from the central
equipment, and users are often allowed to submit their requests asynchronously,
and independently of each other.
Nearly all of those proposals are, moreover, based on a number of under-
lying assumptions, which though unproved and unaccompanied by supporting
evidence, are nevertheless stated with great forcefulness and considered to
be axiomatic. The principal assumptions may be stated as follows:
a) a computer cannot perform the intellectual work required to
analyze the content of a document, and information centers
must therefore rely on a large staff of human subject experts
to assign keywords to all items stored in the system;
* This study was supported in part by the National Science Foundation under
grants GN-360 and GN-[OCRerr]95.
+ Aiken Computation Laboratory, Harvard University, Cambridge, Mass., 02138.
Dept. of Computer Science, Cornell University, Ithaca, New York, 14850.