IRE Information Retrieval Experiment Laboratory tests: automatic systems chapter Robert N. Oddy Butterworth & Company Karen Sparck Jones All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, including photocopying and recording, without the written permission of the copyright holder, application for which should be addressed to the Publishers. Such written permission must also be obtained before any part of this publication is stored in a retrieval system of any nature. 9 Laboratory tests: automatic systems Robert N. Oddy 9.1 Laboratory experiment in information retrieval A laboratory is a sheltered place where one constrains the enthusiastic abandon of the real world in order to attempt to lay the blame for one apparently capricious phenomenon upon another. The extraordinary diffi- culties experienced in all areas of science when designing experiments testify to the complexity of the interactions between natural phenomena. Experi- ments are concerned with relationships between events or states as reflected by the measurements that we choose, and are able, to make. It is invariably necessary to control, or at least to keep tabs on, factors which are not the immediate concern of the experiment, but are suspected to have some influence on those that are. These comments apply whether an experiment is conducted in laboratory conditions or not. What, then, are the characteristics of a laboratory experiment which distinguish it from a real life experiment? I do not think that there is a clear distinction. However, one might point to a tendency for the experimenter to initiate events at his convenience in the laboratory, whe?eas he will observe natural occurrences in real life. In a laboratory experiment it will often be possible to control subsidiary variables: in a real life experiment one is more often faced with the problem of eliminating them in the analysis. It is easy to find exceptions to these tendencies. In experiments on aspects of human behaviour, for example, it is not always possible to control variables as the researcher would wish, even within the laboratory. Human cognitive activity is a substantial component of information retrieval, and in another chapter of this volume, Keen has discussed the problems of laboratory experimentation on that aspect of the field. In contrast, this chapter will consider laboratory work on the mechanical components of information retrieval. The events that the experimenter observes, in this case, are all executed by a machine whose characteristics can, to a large extent, be determined by the experimenter himself. Thus, in principle, he can exercise a considerable degree of control over variables. Indeed, it is hard to thinkof a `purer' laboratory environment in science. The most substantial examples of the type of work to which I refer are the long series of tests with the Smart system (see Chapter 15) and those conducted over a period of several years at Cambridge University by Karen Sparck Jones, C. J. van Rijsbergen and others. 156 I I