IRE
Information Retrieval Experiment
An experiment: search strategy variations in SDI profiles
chapter
Lynn Evans
Butterworth & Company
Karen Sparck Jones
All rights reserved. No part of this publication may be reproduced
or transmitted in any form or by any means, including photocopying
and recording, without the written permission of the copyright holder,
application for which should be addressed to the Publishers. Such
written permission must also be obtained before any part of this
publication is stored in a retrieval system of any nature.
296 An experiment: search strategy variations in SDI profiles
(3) T4 Assign weights to terms
In assigning weights subjectively to individual terms, values in the rangc
1-10 were used but where a single concept seemed paramount to the
query the scale was extended to cover weights of 1-20. The facility for
utilizing a wider range of weights was available (-999 to + 999) and in
an operational system much higher positive and negative weights could
be used for ranking highly or excluding altogether items in certain
journals or languages, etc.
An interesting characteristic of assigning weights subjectively to
individual terms is that almost invariably one finds oneself separating the
terms into groups (task T2), mentally ordering the groups into their
relative importance (task T5), and setting imaginary threshold weights in
a manner very similar to constructing a logical search. It is not suggested
that one necessarily has to formally perform tasks T2 and T5 in order to
assign individual weights to a single list of terms; only that it is difficult
to proceed directly from task Ti to task T4 without thinking in terms of
tasks T2 and T5, and even simulating T3 (assigning boolean equations).
A reservation concerning subjective weighting of terms is its possible
lack of appeal to individual users interested in compiling their own
profiles. No doubt some individuals would delight in the facility; others
might be quickly frustrated by the problems of unanticipated homonyms,
etc.
(4) T5[OCRerr]Arrange groups in order of importance
Normally it is a fairly quick task to arrange subjectively the groups of
terms in order of their importance to the query. When `powers of 2'
weighting is being used this task becomes quite crucial to strategy
GTWC. The feeling persisted that `powers of 2' weighting would be
better suited to controlled- rather than free-language searching, i.e. it
would operate better with greater pre-coordination of terms. If this is so
then strategy GTWC may have suffered somewhat from the approach
advocated for task T2 which was to divide into as many concepts as
possible.
(5) T6[OCRerr]Assign weights to groups
The same general procedure was followed in assigning weights subjec-
tively to the groups of terms as was described above in (3) for the
individual terms.
Relevance assessments
An important factor when considering the mechanics of the experiment was
the role of the user group. It seemed desirable that they should operate in a
`near-real' situation but at the same time it was necessary to have as many
documents assessed as possible to ensure the validity of the recall figures.
The users were not, and did not need to be, aware that a number of profile
versions (representing the different search strategies) had been compiled
from each statement of interests. On a particular search run the separate
outputs from all the profile versions prepared from the user's original
statement were merged to produce a single set of notifications without
duplicates in random (document number) order. This was not only the most
convenient procedure for the user but was also methodologically necessary in