ISR10 Scientific Report No. ISR-10 Information Storage and Retrieval The Query-Document Matching Function chapter Joseph John Rocchio Harvard University Gerard Salton Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government. Comparison Operation Definition Equality A=3 a [OCRerr] A[OCRerr]a 6 B Inclusion ACB a [OCRerr] A[OCRerr]a E 3 Overlap Correlation [OCRerr](A,B) = n(A[OCRerr]3) Metric Distance g(A,B) = 1[OCRerr](A:[OCRerr] B)/n(AU[OCRerr]3)[OCRerr] Comparison Operations on Set Represented Operands Table 4.1 Boolean al[OCRerr]ebra to the partially ordered system formed by the subsets of the keyword set and the set inclusion relation. This allows one to structure the representation of a search request in the form of a Boolean combination of [OCRerr]eywords, i.e. q =w(k[OCRerr]) as opposed to using an unordered keyword set representation. Let column i of the keyword document matrix (Fi[OCRerr]ure 4.1(a)) represent the document subset of the ith keyword. The retrieval,operation, then, consists in generating the retrieved subset R by replacing each keyword in the Boolean query polynomial by its keyword set and substituting set intersection for Boolean `and" and set union for Boolean "or". With this transformation the subset R is specified by: R =L[OCRerr]1(K.) 1