CRANV2 Aslib Cranfield Research Project: Factors Determining the Performance of Indexing Systems: Volume 2 Test Environment chapter Cyril Cleverdon Michael Keen Cranfield An investigation supported by a grant to Aslib by the National Science Foundation. Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government. - 26 - comparison was made of 42 questions on aerodynamics and 42 questions on structures. Fig. 2.13 is a chart showing the distribution of the number of relevant documents throughout the 221 questions, from which it can be seen that the range is from six questions each having only one relevant document to one question which had forty relevant documents. Sample Precision Results At low precision ratios, the clerical work involved in obtaining correct figures was so great that in some cases it did not appear to be justified. This was due to the large number of non-relevant documents which would be retrieved and therefore had to be recorded. With index language I. la, results were obtained down to the single term level but with other index languages the decision was taken that, with the searches in the 1400-document collection, no attempt would be made to obtain precision figures below 5[OCRerr]0. This. however, introduced a variation between questions, since for a question having six starting terms, a precision figure lower than 5% might not be reached until the coordination level was down to two terms. However, with a ten starting term question, this figure might be reached by the coordination of four terms. In the presentation of the test results, note has been taken of this point and also the additional point regarding the number of questions capable of giving results, this being dependent on the number of starting terms which each question had. This can be best illustrated by referring to Fig. 2.14, which presents condensed results for 221 questions on the 1400-document collection with Index Language 1.2a. The column headed 'z' presents the figures for the number of questions that were potentially capable of giving results, and it can be seen that at a coordination level of 2, every question came'in this category. However, at a coordination level of 3, the total has dropped to 220, this indicating that there is one question which had only two starting terms. At a coordination level of 4, the total drops to 212, showing that there are eight questions with only three starting terms. As the coordination level rises, so the number of questions drops until, at a level of 15, it is seen that only three questions have thisnumber of starting terms. The column headed 'y' shows the number of questions which actually contributed figures for the calculation of the precision and fallout ratios - not. it should be noted, for the recall ratio which was always checked down to single term level. In Fig. 2.14, y is equal to z from a coordination level of 15 down to a coordination level of 7, and therefore the precision and fallout ratios can be based on complete data. However, at a coordination level of 6, only 161 questions were searched, and the precision and fallout ratios have been calculated on the basis of the non-relevant documents retrieved in these 161 searches. To indicate this, an asterisk