Information Retrieval Experiment

IRE Information Retrieval Experiment An experiment: search strategy variations in SDI profiles chapter Lynn Evans Butterworth & Company Karen Sparck Jones All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, including photocopying and recording, without the written permission of the copyright holder, application for which should be addressed to the Publishers. Such written permission must also be obtained before any part of this publication is stored in a retrieval system of any nature. Results 301 (averages of numbers) were also calculated. Perhaps it should be pointed out that the precision values were rather superfluous since, with the ranked- output cutoff procedure, all the available information is contained in the recall figure-at a particular cutoff point if the recall figure for one strategy is better than that for another then the precision figure is automatically similarly so. The single performance figure usually associated with the ranked-output cutoff procedure is normalized recall1 5. Strictly this requires knowledge of the positions of all relevant items in the ranked output. The data used here were the recall figures at 9 cutoff points down to the 55th ranked document. Taking the average of the recall figures at the nine cutoff points gave an `average' recall value which approximated to the normalized recall in its effect of ranking the search strategies in an order of merit. This order together with the approximated normalized recall values for the eight non-boolean search strategies for runs 1, 5 and 6 are shown in Table 14.3. No great weight can be given to the actual values of the approximated normalized recall since they depend on the number and positions of the cutoff points used. However with different cutoff points the relative positions of the search strategies would not be expected to change. TABLE 14.3. Ranking of search strategies by normalized recall [OCRerr]ased on 9 cutoff poInts, averages of Dumbers) Order Relevance RI documents Relevance RJ/2 documents of merit Search strategy (normalized recall) Search strategy (normalized recall) Runi RunS Run6 Runi RunS Run6 1 TWC (51.5) TWC (58.5) GWC (56.4) TWC (44.5) GWC (49.7) TWC (47.8) 2 GWC (51.4) GWC (57.3) TWC (54.7) GWC (44.2) TWC (49.2) GWC (47.5) 3 GTWC (49.0) GTWC (55.7) GTWC (53.4) COW (42.5) COW (46.9) CTW (45.9) 4 CRTW (47.1) CRTW (53.0) COW (52.0) CTW (42.4) CTW (46.3) COW (45.8) 5 COW (47.0) CTW (52.8) CO (50.3) CRTW (41.9) CRTW (45.6) CO (44.3) 6 CTW (46.1) COW (52.6) CTW (50.0) CO (40.1) OTWC (45.2) CRTW (43.9) 7 CO (43.1) CO (48.6) CRTW (49.4) CT (39.4) CO (43.8) CT (43.7) 8 CT (41.8) CT (47.3) CT (48.5) OTWC (39.4) CT (42.6) OTWC (43.2) I I Some points to emerge from Table 14.3 are: (1) The two strategies TWC (term-weight cumulation) and GWC [OCRerr]roup- weight cumulation) are always the best, occupying first and second positions on all three runs for both relevance Rl and Rl/2 documents. (2) Strategy CT (simple co[OCRerr]ordination of terms) performs the worst, always being in one of the last two positions. (3) Perhaps surprisingly, strategy CRTW (co[OCRerr]rdinate matching of restricted list of terms) holds its own with the others, mostly taking the middle positions. (4) Strategy GTWC k%roup/term `powers of 2' weight cumulation) is unusual * in being always third best for relevance Ri documents but well down the lists when considering R1/2 documents. This suggests that the harsher consequences of `powers of 2' weighting are not particularly suited to retrieval of `in-between' partly relevant documents.