IRE Information Retrieval Experiment Simulation, and simulation experiments chapter Michael D. Heine Butterworth & Company Karen Sparck Jones All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, including photocopying and recording, without the written permission of the copyright holder, application for which should be addressed to the Publishers. Such written permission must also be obtained before any part of this publication is stored in a retrieval system of any nature. Some previous work in simulation applied to information retrieval 195 operational Systems (where Cooper was concerned with more hypothetical data) this objective does not appear to have met with complete success, since (a) operational data for a full validation of the simulation model was not obtainable, and (b) the data that was obtainable from existing small test collections was either inadequate or inapplicable (p. 11). The main goal of preparing an information retrieval system simulation appropriate to operational retrieval systems appears therefore to be far from complete, since even if a model incorporating valid real data in all significant components could be found, there would still remain the problem of `designing this in' to a larger model taking into account the motivations of users and supporting agencies, as discussed by Reilly, and Baker and Nance. If valid data cannot be obtained (which seems unlikely), then there is a limitation here in princiole to the usefulness of the simulation approach. Further discussion The general matter of the appropriateness of incorporating test-collection based data into a simulation model deserves careful examination. A prior question is whether research on such collections should be regarded as `simulation work'[OCRerr]a point touched on in the introduction. Given the immense amount of valuable work that has been done using (say) the Cranfield experimental data32, by researchers not involved in the actual acquisition of that data: for example Salton3, Sparck Jones33 or van Rusbergen and Sparck Jones34, it seems reasonable to regard such work as `simulation'. It is oriented towards optimization of a process (information retrieval) through intervention of some kind in it, and is concerned with data already found. It is a substitute for further data acquisition achieved through the manipulation of experimental variables, i.e. manipulation of the `apparatus'. It has become clear from such work that the experimental data incorporated into the test collections are in fact an indispensable input to analyses of a very wide range, as discussed in detail elsewhere in this volume, and that such analyses were not initially objectives of the experimental work. They have clearly `spun off' it. In the sense that such theoretical work uses both the experimental data as input, and uses some of the representations of system components recognized in the experimental work, it is reasonable if somewhat arbitrary to regard it as simulation work. This does not perhaps take us very far, but it points to the conceptual barrier between theoretical work (simulation work in the broader sense) and experimental work as a hazy and rather unsatisfactory one: theory involves constructs; experiment involves apparatus; both manipulate and control data. Of much greater concern, since it is not just a semantic matter, is the question of whether it is legitimate to use experimental data (an archive of which may be stored as a `test collection') in a simulation study when the data was obtained using questionable methodology. A clear example here is the employment in the experiment of a requirement that `relevance' be judged in relation to a `question' where the latter is defined to be a solely verbal construct having no explicit relationship to a context of information need. (The question might be a sentence or paragraph in English, for example.) Here the relevance judgements that appear as experimental data are made with ambiguous terms of reference. Is the arbiter of relevance to hypothesize