<DOC> 
<DOCNO> IRE </DOCNO>         
<TITLE> Information Retrieval Experiment </TITLE>         
<SUBTITLE> The Smart environment for retrieval system evaluation-advantages and problem areas </SUBTITLE>         
<TYPE> chapter </TYPE>         
<PAGE CHAPTER="15" NUMBER="327">                   
<AUTHOR1> Gerard Salton </AUTHOR1>  
<PUBLISHER> Butterworth & Company </PUBLISHER> 
<EDITOR1> Karen Sparck Jones </EDITOR1> 
<COPYRIGHT MTH="" DAY="" YEAR="1981" BY="Butterworth & Company">   
All rights reserved.  No part of this publication may be reproduced 
or transmitted in any form or by any means, including photocopying 
and recording, without the written permission of the copyright holder, 
application for which should be addressed to the Publishers.  Such 
written permission must also be obtained before any part of this 
publication is stored in a retrieval system of any nature. 
</COPYRIGHT> 
<BODY> 
                                      Concluding remarks  327

in a morass of one's own creation. If the system is misdesigned and does not
adequately reflect any part of the real world, the evaluation results themselves
will likely prove to be useless. This point of view has been espoused most
cogently by L. B. Doyle in an early book review44:
 `A comment is needed about the Smart system... The word "system" is
 misleading. It is really a chemistry laboratory for retrieval principles and
 procedures... it is a tour deforce in experimentation in the documentation
 area, the like of which is seldom seen... My only reservation about Smart
 is that it may not be doing the right kind of chemistry-but then hardly
 anyone is... The aspect adjudged most negative is that so much research
 should have been done by one party under a suboptimized set of
 assumptions....
 It is now unfortunately too late to ask the author of the foregoing quote to
explain these statements-the only comment actually made by Doyle raises
the question of `what good is a retrieval system when nine-tenths of the
possible users use the telephone instead ?`-a statement that is surely less
appropriate in 1979 than it was when written in 1969. But obviously the
reviewer's principal contention is certainly correct: if the assumptions in an
evaluation system are suboptimized, the results may not be worth a great
deal.
 How then do the Smart assumptions relate to reality? In principle, many
questions can be raised about the appropriateness of the basic model, quite
apart from the problem specifically due to the restricted experimental
environment. Thus the vector space model may be questioned based on the
fact that the scope (as opposed to the subject area) of an item cannot be
represented by a simple vector length and direction. In particular, two items
might cover the same subjects and hence be represented by identical vectors,
yet the topic areas could be treated narrowly in one case and broadly in the
other. The suggestion is therefore made that bibliographic items should be
represented by vectors supplemented by scope or extension measures, instead
of by vectors alone as in Smart45
 Other problems, already mentioned in part, concern the implicit
assumptions of term independence in the vector processing model, that is,
the partially false notion that content identifiers occur independently of each
other in document and query vectors46. Independence among evaluation
parameters is also assumed by certain statistical tests used in Smart to assess
the significance of the evaluation results. Additional methodological
objections are easy to find in a computer environment comprising many tens
of thousands of processing steps.
 Some of these comments are formally correct; a laboratory model by its
very nature can never fully reflect the real life conditions. The question is
whether the deviations are sufficiently serious to affect the usefulness of the
model. So far as Smart is concerned, the first-order characteristics of the real
world are believed to be properly represented-the subject content of a
document represented by a vector in multi-dimensional space is more
important than its extension, and to a first-order approximation, the terms
used to characterize the documents are indeed independent. Furthermore,
while the formal proofs of effectiveness of some Smart procedures are
applicable only in restricted environments (binary vectors, term independ-

</BODY>                  
</PAGE>                  
</DOC>