CRANV2 Aslib Cranfield Research Project: Factors Determining the Performance of Indexing Systems: Volume 2 Methods for presentation of results chapter Cyril Cleverdon Michael Keen Cranfield An investigation supported by a grant to Aslib by the National Science Foundation. Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government. - 56 - important practical advantage, namely the comparative ease of calculation. To have used the method of the average of ratios would have increased the calculations forty-fold; work that has taken hundreds of hours would have taken hundreds of weeks. The really important matter in any test is to know which method is being used and to use it consistently in all situations. Method of totalling results Apart from deciding on whether to use the average of ratios or the average of numbers, we were faced with the additional problem which is involved in totalIing results of sets of searches where tim co- ordination level cut-off is employed; an idea of what is involved in this problem can be seen in Appendix 4A. There are given the performance results, in actual figures, for the 221 questions (subset 3), being tested on Language I. la (single terms, natural Ianguage and coordination), Exhaustivity 3, Search rule type A, Document Relevance 1 4 and searched on the 1400 document collection. The questions are arranged in numerical order, and for each question is given the total number of relevant documents in the whole collection, followed by the relevant and non-relevant documents actually retrieved at each coordination level. In the final column is given the sum of the total number of postings for the search terms; the total must, of course, equal the sum of the relevant and non-relevant documents at all coordination levels. The variations between the 221 questions that affect the problem of arriving at a single result of a single performance curve for the 221 questions can be seen in tabular form in Fig. 3.20T. Here the two characteristics of the 221 questions are listed, namely the numbers of terms initially selected from the search question and used as search terms (starting terms), and the number of retrieving terms, that is the maximum number of starting terms which, used in logical product coordination, may be put to the index and will still retrieve documents (whether relevant or non- relevant). The table shows how, for this particular.test, the starting terms ranged from 2 to 15, and the retrieving terms varied from 2 to 10. Within this 14 x 9 matrix the actual number of questions involved is recorded, so it can be seen, for example, that of the 35 questions having seven starting terms (column headed 7) only three of these questions could coordinate all seven terms and still retrieve some documents. The figures in the table refer.only to the particular index language in use, and a different index language such as index language 1.5 which includes synonyms, word endings and quasi-synonyms, would alter the distribution of the questions in relation to the retrieving terms, while any test involving a different basic index language (such as simple concepts as compared to single terms) would alter the starting term groups also.