CRANV2 Aslib Cranfield Research Project: Factors Determining the Performance of Indexing Systems: Volume 2 Citation indexing and bibliographic coupling chapter Cyril Cleverdon Michael Keen Cranfield An investigation supported by a grant to Aslib by the National Science Foundation. Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government. - 243 - CHAPTER 7 CITATION INDEXING AND BIBLIOGRAPHIC COUPLING It is true [OCRerr]hat librarians do an almost religious job of storing information; it is 9]aced on record, but without evaluation, and much of it is not worth its rental of space. Each oper- ational unit tends to use its own special language, and translators are very few. In warfare, the questioning of returning forces is regarded as a highly skilled speciality. Would social action (and perhaps social research) not gain much if an analogous speciality could be created to assess and consolidate relevant information? Such an organization could not confine its attention to information retrieval, no matter }low efficient such a retrieval process might be. Storage and retrieval systems do not represent evaluation and consolidation of information . L.T. Wilkins: Social Deviance Page 111 In Chapter 7 of Vol. I, an account was given of the compilation of a citation index, and the subsequent preparation of bibliographic coupling groups, with a view to an evaluation being made of this index. As stated in this earlier account, it is a matter for some argument as to how this type of index can be tested in an experimental situation. In carrying out a test in an operational cnvironment, there would be no difficulty beyond the effort required, but although several different ways of presenting the results of this test have been considered, there does not appear to be any procedure which can be considered entirely satisfactory. For this reason two sets of figurcs are being given; the first method probably results in a performance which is better (in comparison with the result[OCRerr] obtained with conventional systems) than should be the case, but it has the major advantage that it does not involve any manipulation of the test results and therefore permits direct comparison to be made between different subsets of questions, different relevance decisions etc. If an evaluation of a citation index can be carried out in an experimental environ- ment, then the second method of presenting results is probably nearer the real performance of the system, and is used for comparison with the results of the conventional index languages. As described in some detail in Vol. I (page 110 and Fig. 7.5) the score sheets for each question gave the results with coupling strength from 1 to 7. The basic scoring at the seven coupling levels for the 42 aerodyn- amics questions with the 1400 document collection is shown in Fig. 7.1T and the results for this set of questions are presented in Fig. 7.2T. Fig. 7.3T presents the results for the 42 questions dealing with structures, while Fig. 7.4T gives the results for the 35 questions having 7 starting terms. A comparative plot of these three question sets is given in Fig, 7.5P. All the results so far shown are obtained with documents of relevance grades 1-4; Figs. 7.6T, 7.7T and 7.8T show the results for documents of relevance grades 1-3, relevance grades 1-2 and relevance grade 1, with the 42 aerodynamics questions, Fig. 7.9P plots the results of the four grades of relevance.