IRE
Information Retrieval Experiment
Simulation, and simulation experiments
chapter
Michael D. Heine
Butterworth & Company
Karen Sparck Jones
All rights reserved. No part of this publication may be reproduced
or transmitted in any form or by any means, including photocopying
and recording, without the written permission of the copyright holder,
application for which should be addressed to the Publishers. Such
written permission must also be obtained before any part of this
publication is stored in a retrieval system of any nature.
180 Simulation, and simulation experiments
features of just one concept, both justifying our provisional use of the one
term and encouraging us to relate it to information systems. First, the notion
of `optimization' provides one general feature. One does not vary either the
representatives of the components of a system or the sequence of operations
in a system, capriciously. One does so in order to identify a system that is
`best' according to some criterion. In that a notion of social utility is so
implied, simulation could be seen as differing from the classical sciences-
which are simply predictive systems of thought-and positioned rather more
closely to technology. Yet it differs from the latter too in that [OCRerr]udgement' is
always a component of simulation in practice a second general feature.
This consists, at the least, in the specification of a criterion or of criteria by
which optimization is to be judged; and will also consist in the delimitation
of the `system' to be studied and the identification of the system's essential
components. If simulation could both delimit an area for analysis as well as
be the analysis, this would involve attributing an `intelligence' to it. Obviously
it is the presence of human judgement together with some rationale for
analysing phenomena that embodies this intelligence. (It would be `an error
of judgement' to omit to identify a variable that sensitively affected another
variable by which optimality was to be judged.) This raises intriguing
questions where the subject of the simulation is human decision-making, for
example `relevance-judging' or `query-forming' in information retrieval.
Here one is (in simulating same) making judgements about judgements. And
since a formulation of a query (say) is in part a judgement about how an
indexer will have chosen index terms, i.e. again a judgement about a
judgement, we have a third tier of judgement. It seems that a recursive
definition of simulation is needed to clarify things here, but to the writer's
knowledge this has not yet been attempted. A third general feature of
simulation is the admission, into the system description, of the notions of
`input' and `output'. These entities, usually seen as separated in time, may be
real artefacts (e.g. documents, money) or information (a message on a VDU)
or decisions (`Get file XYZ'), to choose information retrieval instances.
Lastly, and really in continuation of the first feature above, we note that one's
interest in simulating systems is usually prompted by the possibility of
intervention in the system: or even total control of it. Without an
interventionist ethos we would be back in the realm of classical theoretical
science.
In the face of this complexity, partly of the making of those who have
written on simulation, the temptation is to wield an Occam's razor in the
form of just one clear prescriptive definition of `simulation'. In the writer's
view however this would be a mistake in the particular context in which
information retrieval experimentation finds itself: a context which is
commonly denoted `information science' but which might better be called
`the philosophy of information'. For the ambiguities of simulation in this
context are indicative of the ambiguities that we need to overcome if
information science is to develop. The characterization of `information',
`variable', `representation', `system' as part of the human cognitive apparatus,
and the attaching of abstract meanings to concepts such as `observable' or
`objective viewpoint' or `explaining power' or `language' are just the kinds of
things we need to do in order to develop information science. So that taking
the broadest definition of information science we could say, more rhetorically,
I