ISR11
Scientific Report No. ISR-11 Information Storage and Retrieval
Operating Instructions for the SMART Text Processing and Document Retrieval System
chapter
M. E. Lesk
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
11-37
that new cards on [OCRerr] replace the old file on A6 which is skipped, or
IG[OCRerr]RE, indicating that new cards are on A2 and tape A6 is to be ignored,
or OOPY, indicating that the old phrase dictionary on A6 is to be copied
unchanged to B5.
If REF[OCRerr]AC or IGNORE are specified, the control card must be followed
by the cards for the phrase dictionary. Each phrase is punched on a
separate card. The concept zxun[OCRerr]er of the entire phrase is punched right-
adjusted in columns 1-5. The componen[OCRerr] concept nun[OCRerr]bers are punched right-
adjusted in six-column fields, using the leftmost fields first. Thus, the
first component goes in columns 6-li, the next in 12-17, the third component
(if any) in 18-23, and so on.
Statistical phrases must have two components, and may have up to six.
Non-significant concept nuD[OCRerr]ers may be used; for exsmple, one could find
the phrase "exclusive or" by searching for the components "exclusive't and
[OCRerr]tor?1 even if "or" had a non-significant number (as it usually does).
The statistical phrase dictionary may be in any order, but the last
card must contain NO[OCRerr]R in columns 1-5 and a blank in column 6.
5.1.3. Syntactic Suffix List
The second file on the library tape contains the partial homograph
codes associated with the suffixes. One control card is required by the
library update programs; it contains one of four codes in column 1-5.
The possible codes and their meaning are:
(ORIG A deck of suffix cards follows in the format described
below. Generate a file of these suffixes, ignoring tape A6.
(DUPL The file on A6 is copied unchanged to B5. No other cards