Information Retrieval Experiment

IRE Information Retrieval Experiment The Cranfield tests chapter Karen Sparck Jones Butterworth & Company Karen Sparck Jones All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, including photocopying and recording, without the written permission of the copyright holder, application for which should be addressed to the Publishers. Such written permission must also be obtained before any part of this publication is stored in a retrieval system of any nature. 282 The Cranfield tests alternative relevance assessments25, and Lesk and Salton's study26, showed that comparative performance was not materially affected by different relevance assessments. Second, the test was not concerned with specific, established indexing languages like the UDC, or language types like facetted classifications, so the results could not be regarded as threatening by language proprietors and advocates. At the same time, insofar as the devices studied might be regarded as associated with types of language, it is likely that Cranfield 1 had softened up the potential opposition. Third, other work, notably by the Smart Project, suggested that the results were not eccentric, but could be paralleled for other data, with alternative test designs. Fourth, an increasing interest in the use of natural rather than artificial, controlled indexing languages, both for intellectual reasons among research workers and practical ones among the new computer-based system operators, meshed in with the Cranfield 2 results. An additional reason was perhaps the realization that anyone wishing to subvert the Cranfield findings, either in terms of a laboratory test or operational system investigation, would have to do a great deal of work. For Cleverdon personally, Cranfield 2 was followed on the one hand by controversy about the recall/precision relationship27, and on the other by involvment in tests like the DOAE28 and Precision Engineering29 studies of 1970, and the NASA experiment30 of 1977, described in Chapter 12. These tests clearly show their descent from Cranfield 1 and 2, in being concerned with comparing index languages in relation to language control or indexing descriptions in relation to description exhaustivity; but they also represent studies of indexes and indexing in new environments: the NASA test for instance involved online searching. The underlying continuity of Cleverdon's work is also apparent in his interest in costs, which repeatedly figures in his discussions of system evaluation. In the research community there were two direct responses to Cranfield 2. One was the application of Cranfield 2 principles and practices in other tests, usually selectively, or with modifica- tions, but perceptibly: the Inspec tests31 and Keen's ISILT project32 are good examples of this trend. Other projects, like that of Sparck Jones33, utilized Cranfield 2 performance representation methods, for example. The second response was to exploit the Cranfield 2 collection, once it had been made available in machine-readable form. A great many Smart Project experiments were carried out with the Cranfield data34, and the collection was also used by, for instance, Svenonius35 in the US and by Sparck Jones and Bates36 and van Rijsbergen and Croff37 in the UK. It is more difficult to assess the indirect influence of Cranfield 2 than of Cranfield 1. Substantively, Cranfield 2 suggested that retrieval systems may not work very well and are difficult to upgrade significantly. But this was being suggested by other tests in the later 1960s as well. Methodologically, it is very probable that Cranfield 2 encouraged care in the analysis of systems and the conduct of experiments, but by the later 1960s the same point was being made by other projects too. Cranfield 2 did not, moreover, lead directly into work on those topics which were regarded as most important in the 1 970s, namely the effects of searching on performance and the design of mechanized systems. But that its indirect influence was great can be inferred from the fact that so many research workers in the late 1960s and [OCRerr]970s referred to Cranfield.