CRANV2
Aslib Cranfield Research Project: Factors Determining the Performance of Indexing Systems: Volume 2
Supplementary tests and results
chapter
Cyril Cleverdon
Michael Keen
Cranfield
An investigation supported by a grant to Aslib by the National Science Foundation.
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
- 230 -
Since this direct adjustment on the basis of generality does not equate
performance, it is therefore necessary to consider whether N should be revised
for the structures questions. It has already been established (on pages 71 to
76} that real N for the aerodynamics questions is not 14',)0, but in the region of
1027, at which figure the true generality number is 4.6. One might first
hypothosise that the remaining 373 documents represent N for the structures
questions; the corresponding generality number would be 16.3. To match
this new figure, the adjusted precision ratio for the aerodynamics questions
would now be 10.6%. which is higher than the figure {8.6%) for the structures
questions. It therefore appears that in the collection of 1400 documents,
there must be a subset which is common to both. Using the methods given
on pages 71 to 76, N for the structures questions is shown to be probably
at least 474, which gives a generality number of 12.8. Adjusted to this
generality number the precision ratio for the aerodynamics questions is now
8.6%, the same as for the structures questions. The fallout ratios also
now match; for the aerodynamics questions, where N = 1027, the fallout
ratio is 9.2%; for the structures questions, where N = 474, the fallout ratio
is 9.270%.
The phrase "probably at least 474" was used because no account has
been taken of the possibility that the performance figures will be affected by
the comparative firmness of the terminology of aerodynamics and structures.
The phrase, in fact, implied a belief that aerodynamics has the mushier or
more imprecise language, and that for this reason, one would expect the
set of structures questions to provide the better performance.
However, the matter is complicated even if this latter point is ignored.
At a coordination level of five, the structures questions have a performance
of 16.1% recall and 18.1% precision. No exact matching figures can be
obtained from Fig. 4.120T, but reference to 4.125P shows that, for the
aerodynamic questions, at 16% recall, precision would be approximately
25%. Adjusted for generality.on the basis worked out earlier, this would
increase tile precision ratio to 62%, which is far in advance of the figures
for the structures questions. On the other hand at a single term level, it is
found that the 42 structures questions have retrieved a total of 22,929
documents, which is an average of 538 documents for each question. This is
a figure larger than the 474 documents earlier hypothosised as representing N.
The above discussion is neither clear nor conclusive, and offers no
explanation for the crossover in the performance figures of the two sets of
questions {which is probably an 'otyerration caused by the relatively small
number of results). Rather it serves to point up some of the difficulties
which are involved in- attempting to compare performance in different subject
areas by the coordination level cut-off, and emphasises the necessity for
further research in this and related fields.
Performance comparison by coordination levels
In Chapter 4, all the tables of results and accompanying performance
curves were based on the variation of coordination level. From these
tables, sets of figures are extracted where the coordination level is held
constant while the variable is the index language. Figs. 6.10T and 6.11T
deal with the Single Term index languages at a coordination level of 3 and
6. Figs. 6.12T and 6.13T present the results at coordination levels of
2 and 4 for the Simple Concept index languages, while Figs. 6.14T and
6.15T present results at the same coordination levels for the Controlled
Term index languages.