ISR10
Scientific Report No. ISR-10 Information Storage and Retrieval
Search Request Formulation
chapter
Joseph John Rocchio
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
1. Read an initial query vector q[OCRerr] (in integer format),
convert it to a unit vector, and store it in the array
Q(I), I = 1,i\. ([OCRerr] is the dimension of the index language
vector space.)
2. Read in the set of relevant document vectors r.
j = 1 ,n1, convert them to unit vectors, and store them
in the array R(i,j), I = 1 ,N; J = 1 ,[OCRerr]
1
[OCRerr]. Read in the set of nonrelevant document vectors 5.,
3.
j = 1 ,n2, convert them to unit vectors, and store them
in the array s(i,j), I = 1,N; J =
2
(Note that since the dimension of the index space was
N = 511.for[OCRerr]hetthesaurusuused, and since a document
vector typically has a[OCRerr]out [OCRerr]5 nonzero components, the
program actually handled the vectors in a condensed
format;)
4. Form a new query vector represented by the arr[OCRerr]y:
Q1(I) = N1N2Q(I) + N1 N2
N2[OCRerr]1.R(I[OCRerr]J) - N1 [OCRerr] s(i,j)
J=1
5. Normalize to unit length:
[OCRerr] ZNQl (1)2
L 1=1 j
Program Steps for Producing Relevance Feedback
[OCRerr]odified Queries
Table 5.1