IRE Information Retrieval Experiment Ineffable concepts in information retrieval chapter Nicholas J. Belkin Butterworth & Company Karen Sparck Jones All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, including photocopying and recording, without the written permission of the copyright holder, application for which should be addressed to the Publishers. Such written permission must also be obtained before any part of this publication is stored in a retrieval system of any nature. 54 Ineffable concepts in information retrieval their own right; the second is as possible sources of error or variation in other aspects of the system with which the tester is primarily concerned. Each of these situations raises different questions and different problems of interpretation and control. The major problem in the first case lies in thc construction of operational definitions of the variables to be studied. This may be difficult, but once these definitions have been established, it may be possible to study the variables in isolation. For instance, if one wants to investigate the concept of `aboutness' experimentally, one could control the experiment to include, say, only documents and readers of documents, without reference to an information retrieval system at all. The problems of the second case, on the other hand, arise because the interactions of variables within information retrieval systems are so complex. In this situation, it is in general not possible to isolate variables completely. For instance, if one wants to evaluate some method of content description within an information retrieval system, then the relationships between the concept of aboutness, the description method and the evaluation measures are significant, even though aboutness is not an explicit variable in the experimental paradigm. Thus, in the second case, which is likely to be the usual situation in information retrieval system testing, one ought to begin by attempting to isolate all of these variables (conceptually), and then to see to what extent they actually might influence the variables in which one is interested, and the evaluation measures. In this case, again, one is not interested in the `ineffable' variable itself, but rather in the effects that not having included it in the design might have on the results of the test. It is useful here to discuss, in rather general terms, the extent to which being able to deal with these concepts might affect an information retrieval test. Text-related concepts It is possible that having a specific, operationally definable and experimentally tractable information concept is a necessity for the development of theory in information retrieval. Nevertheless, there is some question as to whether this particular concept need be made explicit in evaluation tests, at least of existing systems. The reason is that there is usually only a minimal relationship between whatever passed for an information concept in the system as originally formed and the measures used to evaluate the system's performance. On the other hand, if one wishes not only to compare, but also to explain differences between systems, then both information and aboutness become quite important. This is especially true if the focus of attention is the description mechanism, rather than, say, the retrieval strategy. If one wishes seriously to explain the difference between two description mechanisms, then one must be able to discuss the relationship of each mechanism to an underlying information or aboutness concept. For instance, one could compare two systems of automatic indexing in terms of retrieval performance (recall and precision, say) and discover that one of the systems gave consistently better results. But to be able to say why this was the case, one would need to consider the underlying assumptions of each method, in terms of an underlying information or aboutness concept. One could then make some decisions about whether the difference in performance lay in a different technique deriving from the same assumptions, or in different assumptions. I I I I I I I i I i I I I I I I- h I I I I 5 I I