ISR10 Scientific Report No. ISR-10 Information Storage and Retrieval Search Request Formulation chapter Joseph John Rocchio Harvard University Gerard Salton Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government. 2 2 identified in response to the original query, and [OCRerr] and S are the corresponding subsets identified in response to the modified query, [OCRerr] 1 2 [OCRerr] 1 2 then, for the second iteration, the sets [OCRerr] = R U[OCRerr], and .5 = S [OCRerr]S are available for the optimization algorithm. [OCRerr]asically, two alternatives are possible: the optimal vector to differentiate documents in R from documents in S may be used as a perturbation of the user1s original query; or this vector may be used as a perturbation of the query resulting from the first iteration. In practice this could be left for the user to decide, depending on his interpretation of the output from the first iteration. In the general case the expression for the nth modified query in which all modifications are made to the original query can be written: =[OCRerr]1q[OCRerr] + [OCRerr]2 hi $r[OCRerr][OCRerr]> ;-5[OCRerr][OCRerr] 1 1 ([OCRerr].14) where = n iT n n2 = n(s[OCRerr]), and[OCRerr] and[OCRerr] are ½) [OCRerr], S = Q) S , n = n([OCRerr]T), 1 2 i=1 i=1 weighting coefficients. If each modification is made to the preceding resultant query image, the iteration formula becomes: +[OCRerr]2 Fnli I r[OCRerr]i[OCRerr]nl2 I r.[OCRerr]RT s.E 5Th 1 1 ([OCRerr].15)