Information Retrieval Experiment

IRE Information Retrieval Experiment Laboratory tests: automatic systems chapter Robert N. Oddy Butterworth & Company Karen Sparck Jones All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, including photocopying and recording, without the written permission of the copyright holder, application for which should be addressed to the Publishers. Such written permission must also be obtained before any part of this publication is stored in a retrieval system of any nature. Model building 171 this is a less common activity, the terms `theory' and `model' are used almost interchangeably in the information retrieval field. A proper discussion of the differences between the usages of these words in science would be extensive and out of place here54' 55. I should like to use the term `model' to refer to a particular type of theory: one that attempts explanation of a phenomenon by describing aprocess and exploring its effects. This gives us an approach to the problem of relating the outputs to the inputs of our systems which is distinct from mathematical argument. Recently, mathematical theories have been used to suggest retrieval algorithms16:I would not call such an algorithm a emodel[OCRerr] while it is still entirely derivative. One might ask why we should wish to build models without a sound mathematical theory. In information retrieval, we are trying to reproduce automatically a cognitive act: the decision as to whether a document, or `item' of information, is relevant. It is notoriously difficult to formulate mathematical theories which account for, and enable us to handle the `)ariability in human behaviour56' [OCRerr] According to Farrell50, many psychologists believe that outputs cannot be related to inputs in any simple (mathematical) way, and that internal states must be taken into account. This is best done by modelling. If we are looking for retrieval systems which are more responsive and adaptable to the individual user we could do worse than model the behaviour of good human information providers subject experts, librarians and information scientists. Following a `systems' approach, one can formulate a high-level structural model of their behaviour, illustrated by a block diagram showing the relationships between components, for instance, and experi- mentally determine whether the human behaviour fits the model (see, for example, Olney58, Ingwersen and Kaae59, Bivins60 and, for methodology, the chapter by Keen (8)). Alternatively, one can attempt to build a computer program with the hypothetical structure, run it, and observe its behaviour. In the latter approach, the modeller must elaborate the meaning of all his high- level components in a very precise way. Before making more general points, I should like to illustrate the process, briefly, from my own experience, by explaining how certain essential parts of a particular program, called Thomas61' 62, were written. The program is an interactive system which provides a browsing facility for the user. He is not required to formulate a query, and as the dialogue progresses, his reactions to the indexed references are used by the program to build a picture of his area of concern. At the highest level, a `cognitive' model of dialogue, essentially like Hollnagel's63, is assumed. Each participant has his own image of the world, which includes an image of the other's world-image: let me call this included image a `meta-image'. Communication becomes more effective as these meta-images more accurately portray the current concerns of the participants. To implement this model in a man-machine dialogue, the computer program must have a fund of knowledge about the world, and a means of representing its image of the man. It must be able to improve that meta-image as the dialogue progresses, that is, in response to the man's utterances, and in displaying information to him it must aim not only to give him relevant references, but also to help him form his image of the program's world-image. (We are beginning to see some guidelines for a program design.) Thomas' world-image is a graph in which the nodes represent documents, subject terms and authors, and the arcs associations between them, derived