ISR10 Scientific Report No. ISR-10 Information Storage and Retrieval Search Request Formulation chapter Joseph John Rocchio Harvard University Gerard Salton Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government. user, there is no certainty that an ideal search req[OCRerr]est (under some fixed index transformation) in fact exists. [OCRerr]elevance, the relation between the query and the subset D is a function of the user1s infor- mation needs and his interpretation of the text of reference documents; while the retrieval ordering produced by the system is a function of the query and document index representations In the case where there is no ideal[OCRerr]request corresponding to a given subset D[OCRerr], one might say that the index transformation is deficient from the point of view of the particular user (who specified i[OCRerr]), since it does not allow distincti[OCRerr]ns equivalent to those he can make In general one [OCRerr]must a[OCRerr]sume that this wilL be the norm[OCRerr]rather than the exception,. since the indexing[OCRerr]process is designed to reduce rather than preserve information. It is therefore useful to define an unambiguous, optimal search request as a function of DR, D, and implicitly, therefore, of the index transformation, such.:thatfor every non-empty unique `subset D of D, this optimal search request both exists and is R unique. An o[OCRerr]timal search request index image corresponding to a given subset D of a collection of'document images produced by the R index transformation T is that request image q which maximizes the difference between the mean of its correlations with the relevant documents (members of DR) and the mean of its correlations with the nonrelevant documents (members of D' not in D ). R The subjective relevance relation which specifies a subset D corresponding to each input query induQes an effective partition R