IRE
Information Retrieval Experiment
Ineffable concepts in information retrieval
chapter
Nicholas J. Belkin
Butterworth & Company
Karen Sparck Jones
All rights reserved. No part of this publication may be reproduced
or transmitted in any form or by any means, including photocopying
and recording, without the written permission of the copyright holder,
application for which should be addressed to the Publishers. Such
written permission must also be obtained before any part of this
publication is stored in a retrieval system of any nature.
Definitions or interpretations of the concepts 45
(1) User-dependent concepts: information need; desire.
(2) Text-dependent concepts: information; aboutness; meaning.
(3) User and text-confounded concepts: satisfaction; effectiveness; synthema
(homeosemy).
This categorization of issues derives from the general structure of
information retrieval systems2, in which documents and needs are each
separately represented, then matched against one another in order to retrieve
documents which are judged by the user according to their appropriateness
to her/his need. This situation requires concepts basic to need representation
and need understanding, concepts basic to text representation and under-
standing, and concepts concerned with the relationships between text and
need. Various concepts basic to each of the three areas outlined above have
been widely discussed in the literature, although not always to great effect as
far as testing of information retrieval systems is concerned. Usually, the test
of any system has been concerned with secondary entities or processes in one
of the sub-areas (such as comparative experiments on indexing systems),
stopping short of investigating the relationship of underlying concepts (such
is information or aboutness) to the results of the tests, or even of determining
whether there were any such underlying concepts to the systems being tested.
The suggestion here is that it may now be the right time to begin such
investigations, to make these concepts at least explicit in testing, and perhaps
even to make them the basic variables in the testing of information retrieval
systems. Before continuing this argument, some general discussion of these
concepts themselves is in order.
4.2 Definitions or interpretations of the concepts
Jser-related concepts
Although this group of concepts seems to be the obviously central core to the
information retrieval situation, since evaluation of system performance
should be solely in its terms3'4, it seems to be the most neglected in the
literature of information retrieval system testing. This may be because
concepts such as relevance, which depend upon this group but are confounded
with the text-related concepts, have been initially more important to systems
testers in that they provide the means for direct comment on system
l)erformance. There has been somewhat more treatment of user-related
concepts in such areas as reference work5 and in theoretical discussions of
4
information science
The basic situation, as outlined by Taylor5, is that of a person coming to an
information system with some already (at least vaguely) recognized need,
md going through various stages of representation of that need which
culminate in a formal request put to the information retrieval mechanism in
terms which it can use for matching against its store of texts. In this situation,
one can recognize a number of elements which are likely to affect the
iiiechanism in significant ways, yet which are difficult to describe or quantify.
The first of these is the desire of the user. This concept seems not to have
l'een discussed explicitly as a separate issue, but one should note that, apart
t#()m a need for information, the user comes to the mechanism with some set