ISR11
Scientific Report No. ISR-11 Information Storage and Retrieval
S0CCER - A Concordance Program
chapter
Guy E. Hochgesang
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
111-6
listing. These statistics include a count of the number of tokens processed;
the number of types found; the number of types with only one token, two
tokens, three tokens, etc., up to the number of times each type occurred on
the restriction (or selection) list. Finally S[OCRerr]CCER writes out the time it
took to process the text and exits.
D. The Cutput Format
All of the output produced by S[OCRerr]CCER is written on the [OCRerr]UTPUT tape
in blocked BCD records of varying length. This tape includes a carriage
control character for each line and is meant to be printed off-line with a
printing pro[OCRerr];am capable of handling blocked records and printer carriage
control.
3. Tape Usage
A. Control Cards
The c[OCRerr]ntrol cards for each run of S[OCRerr]CCER must be included as DATA
cards on the system input tape, [OCRerr]. S[OCRerr]CCER always reads these card images
from A2 before beginning to process the text.
B. The INPUT, [OCRerr]UTPUT, and SMRTAP Tapes
As explained in Fart [OCRerr], the INPUT tape contains card L[OCRerr]ges of
the text to be processed. This tape may be mounted on any tape unit except
those used for the [OCRerr]UTPUT and SMRTAP tapes. The most convenient unit to use
for the INPUT tape may be A2. In this case the cards of the text are simply
subiutted as DATA cards ilmiLediately following the control cards on A2. The
logical tape number of the INPUT tape is controlled by the I[OCRerr][OCRerr]PLJT control card.
The [OCRerr]UT?UT tape contains the listing of the text and the concordance,