Information Retrieval Experiment

IRE Information Retrieval Experiment Ineffable concepts in information retrieval chapter Nicholas J. Belkin Butterworth & Company Karen Sparck Jones All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, including photocopying and recording, without the written permission of the copyright holder, application for which should be addressed to the Publishers. Such written permission must also be obtained before any part of this publication is stored in a retrieval system of any nature. Inference chains and operational definitions 51 uncertain ways, to the original ASK idea. The problem now is: how can one test the validity or accuracy of that original cdnstruct? For instance, if one wishes to know if the end representation is an `accurate' reflection of the user's ASK, and tries to determine this by direct question, the response may he dependent upon any one of the assumptions in the entire chain, with no easy way to tell which of them, or which combination, is at issue in the response. In principle, one ought to examine and test the validity of each of the hypotheses made along the way before performing the test as a whole, but in practice this is very unlikely to happen. The more generally appropriate strategy is to attempt to develop a design in which, as in this example, responses can be directed to each confounding factor. With a chain of assumptions the length of this example, which is probably not unusual, the design may become very complex, and the test instrument clumsy. A possible approach is to run a series of tests, each concentrating upon the end product from the point of view of one of the assumptions, using the data derived from each of the series for design of the subsequent members. But in such a case, there will be some assumption or hypothesis that one cannot, or will not be willing to test (in the example, perhaps the idea of knowledge as a structure of concepts and relations), which must then be considered as an integral part of the original construct itself. An example of this type of problem from the text-related concept group is the question of relative informativeness of text representations. This is an obviously important issue in comparative evaluation of techniques for text representation, especially if the text representations are to be used as the basis for relevance judgements or for matching for retrieval. In the formal case, the situation is that the user, or the user's representative, is presented with some description of a text, on the basis of which a probabilistic relevance decision must be made. In such a case, the obvious strategy is to compare judgements of the representations with relevance judgements of the entire documents, within each system, and then to compare the overall results of the competing systems. Belzer30, for instance, has done an experiment of this sort. In this case, informativeness is operationally defined as the capability of the representation to induce a `correct' relevance judgement in the user. There will be problems in such a design with possible interactive effects of the documents and document representations upon relevance judgements made by any individual, but in general the dependent variable can be fairly well isolated from problems of inference chains, as long as the test is only evaluative or comparative. But, if the purpose of the test is explanatory; that is, if one wishes to explain the differences in informativeness between the representations, then concepts of aboutness, meaning and information become important, and the inference chains from the underlying theory to the eventual representation can cause problems. One now must begin to investigate the assumptions, to see how they have influenced the representations, in order to decide whether the intermediate assumptions, or the underlying concept, are the reasons for the performance. In the case of informativeness as applied to matching for retrieval, the situation is much more complicated immediately, for it must require an aboutness or meaning concept for its implementation. That is, informative- ness here means the ability of the representation system to represent both