IRE Information Retrieval Experiment Ineffable concepts in information retrieval chapter Nicholas J. Belkin Butterworth & Company Karen Sparck Jones All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, including photocopying and recording, without the written permission of the copyright holder, application for which should be addressed to the Publishers. Such written permission must also be obtained before any part of this publication is stored in a retrieval system of any nature. 48 Ineffable concepts in information retrieval by the user of the appropriateness of text to need based not upon the question put to the system but rather on the user's entire desire and need state at the time of receiving the text for judgement. Each of these concepts of relevance is obviously quite different from the others, and each has special advantages and problems in testing situations. The major point here is that the choice of relevance concept will radically affect evaluation of system performance. The most commonly used measures of performance, recall and precision, depend solely on relevance judgements, and thus are peculiarly sensitive to this concept. Utility, the other major proposal for performance evaluation (see, e.g. Cooper25), can in principle take account of other factors than relevance in the destination's or logical senses, but at the moment practical difficulties in assessing this measure make its use difficult. Karlgren26, with homeosemy, and Robertson27, with synthema, have independently attempted to formalize the topic relationship between text and question. They both suggest that the problem may be considered as the degree to which the aboutness of each coincides with that of the other, and that this relationship, especially in Robertson's suggestion, might provide a scale which would underlie the concept of relevance. Notice the extent to which both of these notions depend upon the concepts of aboutness or meaning. These suggestions provide a potentially valuable way of partitioning the various aspects of the relevance concept, but as yet are still only suggestions, with no concrete methods of implementation. To return to the general notion of satisfaction, one can see that logical and destination's relevance, and more formally synthema or homeosemy, refer only to conceptual aspects of the need, and assume that the question is an adequate representation of the need. Situational relevance extends the notion to include beliefs and other aspects of the user's condition, while accepting the possibility of adequate linguistic representation, but pertinence and utility go even farther in attempting to take into account affective and other aspects of the need, and do not necessarily assume that the question expresses the need in its entirety. Usefulness is another term that has been used in place of relevance in attempts to include more than strictly topic-related aspects of need in the judgement process. Apart from satisfaction, in terms of relevance or utility, effectiveness appears to be the only major candidate as a basis for information retrieval system evaluation. If, for example, information is considered as `data of value in decision making28', then system performance can be evaluated by whether its output was used in making appropriate decisions; that is, by some `objective' or at least behavioural measure of the effect of its output on the user. This approach completely bypasses the question of relevance, but is unfortunately very difficult to operationalize, especially in obtaining and measuring observations of the effect. It also poses significant difficulties in prediction of effect, because of its situation dependence. 4.3 Interactions The concepts discussed in this chapter are difficult to deal with in an experimental or investigatory setting not only because they are conceptually and practically not very tractable, but also because they are highly