IRS13
Scientific Report No. IRS-13 Information Storage and Retrieval
Search Matching Functions
chapter
E. M. Keen
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
hI-i
III. S[OCRerr]arc[OCRerr] [OCRerr]atchin[OCRerr] [OCRerr][OCRerr]nctions
Ee M. Keen
1. Introduction
It is the function of a document retrieval system to draw to a
[OCRerr] attention some or all of the documents in the store that have
some probability of being relevant to the requestor1s need. The prGcessing
of a search request is in part a matching operation, and procedures used
in manual and mechanized systems are briefly described before automated
s[OCRerr]stems are discussed. SMART test results are given, separating the results
of the matching functions themselves from the weighting scheme that may be
employed. The analyses show that one particular matching function used [OCRerr]ith
the weighting scheme is superior to the others that have been tested.
A suggested experiment is proposed to determine whether development of still
better functions is possible.
2. Matching [OCRerr]ocedures used in Manual, Mechanized and Automated Systems
A) Manual Systems
The task of matching a search request against a stored file of
document representations is frequently an entirely manual process. After the
search request has been received and "translated'1 into the vocabulary of the
system, either words or code nun[OCRerr]bers, a searcher then seeks documents that
match the coded request by referring to the index file, that is, the cards
or the pages constituting the physical index. In searches made with dic-
tionary or classified catalogs, the searcher normally proceeds by starting