ISR11
Scientific Report No. ISR-11 Information Storage and Retrieval
Operating Instructions for the SMART Text Processing and Document Retrieval System
chapter
M. E. Lesk
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
11-11
value, i.e. they nuist be assigned the proper word stem or the
proper thesaurus category;
2) the components of the phrase miist have the proper syntactic
role (e.g. "aut[OCRerr]natically translating" would not match "aut[OCRerr]natic
translation");
3) the components of the phrase zinist exhibit the proper syntactic
dependency relations with each other (e.g. "blind Venetian" would
not match "Venetian blind").
This phrase searching procedure makes use of the Kuno multiple-path
syntactic analyzer (7,8,9] to determine the syntactic roles and dependency
relations of the words in the sentence. This makes possible very accurate
searching, in that, for exan[OCRerr]le the phrase "solid state" would not be
recognized as appearing in "Some authors [OCRerr][OCRerr]hose other work is solid state
that information retrieval can be performed without accurate syntactic
analysis."
It should be noted that the syntactic phrase searcher is very slow
and also very erratic in its tinzing. It is hoped that the use of a new
version of the Kuno syntactic analyzer will alleviate these problems (10].
The specifications to be used in the phrase searching procedure
are:
which causes a statistical phrase search to be performed;
which causes a syntactic phrase search to be performed;
causes a table of the detailed correspondences between
tree nodes and sentence words to be printed;
SYNA[OCRerr][OCRerr] causes the syntactic analyses of the text searched to be
printed.
STATPR
SYNTAX
NODECO