IRE
Information Retrieval Experiment
An experiment: search strategy variations in SDI profiles
chapter
Lynn Evans
Butterworth & Company
Karen Sparck Jones
All rights reserved. No part of this publication may be reproduced
or transmitted in any form or by any means, including photocopying
and recording, without the written permission of the copyright holder,
application for which should be addressed to the Publishers. Such
written permission must also be obtained before any part of this
publication is stored in a retrieval system of any nature.
306 An experiment; search strategy variations in SDI profiles
Our boolean statements were meant to be optimum ones but to have optimuili
boolean profiles producing ranked outputs suggests a conceptual contradi[OCRerr]
tion. To have used less-than-optimum boolean statements would be no reil
test of boolean logic.
Costs
In a computerized search system the costs of the system may be allocated t(I
a number of factors-original database production or purchase; compute
operations such as searching, output preparation and printing, profile
maintenance, etc.; and non-computer operations such as the time ol
information scientists, keyboarding, general clerical, etc. Two of these
factors were considered in this investigation, viz. information scientist time
and computer search costs. Now, nearly five years later, the values used foi
salaries and computer processing costs are dated. Rather than simply update
them this section considers in more general terms what was learned in the
experiment about the various cost factors and how they related to condition[OCRerr]
in an operational system.
Intellectual (information scientist) effort
Although an accurate record of events in the experiment, the times obtaine[OCRerr]l
for profile compilation and modification (pp. 293 and 294 above) arc
somewhat artificial in the context of a real system in that: (1) only one
analysis/modification stage was undertaken; (2) the times for trivial (but
necessary) profile modifications were not recorded; and (3) only the actu[OCRerr]tl
compilation/modification times were included, i.e. there is no allowance for
`dead-time' (tea breaks, etc.) which in real life might involve an additional 20
per cent or so.
To translate the figures on pp.293 and 294 to a more real situation we
could: (1) assume an average of 3 significant profile modifications per year;
and (2) add 20 per cent dead-time to the above compilation/modification
times. If that is done comparative estimates are obtained for the information
scientist effort required per query per annum for the different search strategies
(Table 14.7).
TABLE 14.7. Estimated annual information
scientist effort per query
Search strategy' Information scientist
time (mm)
CT 90
CTW 107
CG 107
TWC 110
CGW 125
GTWC 128
GWC 136
B 151
BW 168
Assuming a 35-hour week and 47 weeks worked per year, the number of
profiles handled per year by an information scientist working full time would