CRANV2 Aslib Cranfield Research Project: Factors Determining the Performance of Indexing Systems: Volume 2 Conclusions chapter Cyril Cleverdon Michael Keen Cranfield An investigation supported by a grant to Aslib by the National Science Foundation. Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government. - 252 - CHAPTER 8 CONCLUSIONS The first step in testing a theory (qua theory) is to examine it to see what deductions can be made from it - to set up post- ulates which may be tested either experimentally or by observ- ations of the 'real-life' situation. That is to say. the first step in testing a theory is to state the practical consequences of it. If the deduced practical consequences (operational definitions) are proved to be unsustained, the theory is discre- dited. No theory can ever be proved to be true; it is held for so long as no better theory can be found. L.T. Wilkins: Social Deviance Page 36 Although the results presented in this volume inevitably represent only a condensation of the tens of thousands of individual results which have been obtained, it is hoped that they are in sufficient detail for anyone interested to make their own interpretation. It might, therefore, be argued that much of this final chapter is redundant, and that it would be better to leave readers to reach their own conclusions. However, the following comments are offered as a personal contribution, with the hope -and expectation - that others will feel free to deduce and argue. The results have been presented in three main ways. Firstly, there are the details of the search results for the various index languages, recall and precision devices and search rules as obtained with the conventional coordin- ation level cut-off. Secondly, some of these results have been regrouped to illustrate various aspects of the test and thirdly, many of the test results have been re-calculated by the document output cut-off method based on simulated ranking. While the opinions presented in this chapter may be illustrated by referring to a particular set of figures, they are not usually based on a single result. Within the definition as given in Chapter 2 of Volume 1, every set of figures supports the original hypothesis of an inverse relationship between recall and precision. It is immaterial which variable is changed to give a new system; it may be the coordination level (e.g. Fig. 4.100T), the exhaustivity of indexing (e.g. Fig.4,912P),the recall devices (e.g. Fig. 6/0T),the precision devices (e. g. Fig. 6. t 7]-), the search programmes (e. g. Fig. 4. 850T), or the relevance decisions (e.g. Fig. 6.3P); it has been impossible to find any exception to what can be claimed as a basic rule. Quite the most astonishing and seemingly inexplicable conclusion that arises from the project is that the single term index languages are superior to any other type. This is mainly evidenced by the results based on the normalised recall ratios of Fig. 5.15T, but also, although less obviously, by the comparison of different systems using the conventional coordination level cut-off (see Fig. 6.2P). This conclusion is so controversial and so unexpected that it is bound to throw considerable doubt on the methods which have been used to obtain these results, and our own first reaction was to doubt the evidence. A complete recheck has failed to reveal any discrepancies, and unless one is prepared to say that the whole test conception is so much at fault that the results are completely distorted, then there is no other course except to attempt to explain the results which seem to offend against every canon on which we were trained as librarians.