CRANV2
Aslib Cranfield Research Project: Factors Determining the Performance of Indexing Systems: Volume 2
Supplementary tests and results
chapter
Cyril Cleverdon
Michael Keen
Cranfield
An investigation supported by a grant to Aslib by the National Science Foundation.
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
- 228 -
Coord -
ination
Level
1
2
3
4
5
6
7
FIGURE 6.7T.
of
numbers
1.1%
2.7%
5.2%
13.5%
23.8%
37.2%
50.0%
Average of Ratios
1 2 3
(Total divided (Total divided (Total divided
by 35) by figure by figure
shown in shown in
brackets) brackets)
1.4% 1.4%(35) 1,4%(35)
4.3% 4.3%(35) 4.4%(34)
8.0% 8.2%(34) 9.0%(31)
16.0% 20.1%(28) 24.4%(23)
21.7% 42.2%(18) 54.3%(14)
11.0% 55.0%(7) 64.2%(6)
6.7% 77.8%(3) 77.8%(3)
PRECISION RATIOS OBTAINED BY THREE
DIFFERENT AVERAGE OF RATIOS PROCEDURES.
use the average of numbers.
Comparison of documents dealing with aerodynamics and structures
The main sets of test results in Chapter 4 were concerned with a
subset of 42 questions all of which dealt with aerodynamics rather than
structures. For comparison purposes, a set of 42 questions on structures
was prepared. Searched on the 1400 document collection, with index
language I.l.a, the tests results are given in Fig. 6.8T. Comparison is
made in Fig. 6.9P with the results as given in Fig. 4.120T for the 42
aerodynamic questions under the same conditions. This plot shows an
unusual characteristic, in that at the higher recall levels, the structure
questions have superior precision, but at a recall ratio of about 25%, the
curves cross over, and the aerodynamic questions have the better
performance.
There are two" reasons why one would expect the structure questions to
do better. Firstly there are more relevant documents, and therefore the
generality number is higher, namely 4.3 as against 3.4. Secondly, although
to calculate the generality number N is presumed to be 1400, real N must
(as argued on pages 71 - 76) be considerably less than this number.
If the position at a coordination level of 3 is considered, the perform-
ance figures are as follows:
Aerodynamics Structures
(As'Fig. 4.120T) (As Fig. 6.9T)
Recall Precision Fallout Recall Precision Fallout
Ratio Ratio Ratio Ratio Ratio Ratio
66.7% 3.2% 6.790% 67.5% 8.6% 1.732%
To allow for the difference in the generality number, the precision
ratio for the aerodynamic questions can be adjusted by the equation given
on page 73 and this would result in a new precision ratio of 4.1% which
continues to be well below the comparable figure for the structures
questions.