ISR11
Scientific Report No. ISR-11 Information Storage and Retrieval
S0CCER - A Concordance Program
chapter
Guy E. Hochgesang
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
hI-i
S[OCRerr]CCER - A Concordance Program
Guy T. Hochgesang
1. Introduction
SQCCER* was written in response to a need for a program which could
produce concordances of long texts. The program was written with three major
objectives:
1. It should be sufficiently fast to permit concordances of
moderately long texts.(30,OO0 - 300,000 words)
2. It should produce concordances in an easily-read format. 11TokensT'
or words in the concordance shouldbe listed with as much context
as possible to reduce the need for references to the original text.
3. It should be simple to use, requiring a minimum amount of effort
to set up for each text.
Existing programs generally fail to meet one or more of these objectives.
These programs usually do not satisfy the most crucial criterion, that of speed
of processing. In order to make it economically feasible to produce concor-
dances of moderately long texts, a more efficient program was needed. By
using a balanced-merge tape sort with overlapped I-[OCRerr], [OCRerr]CCER achieves a
significantly faster rate of processing than other concordance programs. It
is believed that this speed has been achieved without sacrificing anything in
the way of simplicity for the user or the utility of the concordance produced.
*SQCCER : Smart1s Own Concordance Constructor, Extremely Rapid.