ISR11
Scientific Report No. ISR-11 Information Storage and Retrieval
Operating Instructions for the SMART Text Processing and Document Retrieval System
chapter
M. E. Lesk
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
h-i
II. Operating Instructions for the SM[OCRerr]T Text Pro[OCRerr]essing
and Document Retrieval System
M. E. Lesk
1. Introduction
SMART is a fully autanatic programming system designed for the analysis
of [OCRerr]glish[OCRerr]text and for the processing of information requests. The S[OCRerr][OCRerr]RT
system can also be used to perform ccmplete studies of information
retrieval systems, vocabulary and dictionary investigations, and mm:ny
other linguistic and classificatory studies. Among the features of the
system which are available for purposes of content analysis are: thesaurus
look[OCRerr]p, phrase searching methods (including complete syntactic analysis),
hierarchical processing, statistical association methods, and title/abstractl
text differentiation. Intermediate and final results are available in
convenient form for detailed study, or alternatively, a complete retrieval
technique may be evaluated automatically. A full description of the methods
and programs used may be found in [1); shorter writeups are available in
[2) [3), and [[OCRerr]).
The system described here supersedes the operating system described
a year and a half ago in [5) and [6). The basic differences between new
and old systexn.[OCRerr] are a widely expanded system capacity and faster processing
speeds. In future years, it is hoped to experiment with time-sharing
environments, and additional processing algorithms are being developed.
Since the programs have reached a state of semi-stability, however, new
operating instructions are being prepared.