CRANV2
Aslib Cranfield Research Project: Factors Determining the Performance of Indexing Systems: Volume 2
Methods for presentation of results
chapter
Cyril Cleverdon
Michael Keen
Cranfield
An investigation supported by a grant to Aslib by the National Science Foundation.
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
- 33 -
majority of cases, the most straightforward and most meaningful method
is the Recall/Precision curve. These are, in general, the measures used
in this report, although to illustrate certain points various other measures
and methods of presentation are used.
A detailed account of the Cranfield work on performance measures
has been presented in a thesis by M. Keen, but the following is a resum6
of the more important points which led to the decisions; other matters
relevant to the presentation of results in this volume are also considered.
In tests of experimental systems, it is essential that measures should
be used that accurately reflect the changes in the particular component
being tested, which primarily, in this particular test, was a range of index
language devices. In addition, there is the strong desirability, if not
the absolute necessity, that it should be possible to make direct comparison
between different sets of test results.
Measures of retrieval performance may be used in experimental
tests of information retrieval systems when the following requirements are
met: -
1.
2.
A document collection of known size to be used in the test;
A set of questions, together with decisions as to exactly
which documents are relevant to each question;
3. A set of results of searches made in the test; these
usually give the numbers of documents retrieved in the
searches, divided into the relevant and non-relevant
documents.
The successive dichotomies of the total collection have been displayed by
B.C, Vickery (Ref. [OCRerr], page 174) by the following table:-
TOTAL COLLECTION
RELEVANT NON-RELEVANT
NOT RETRIEVED RETRIEVED NOT RETRIEVED
(c) (a) (b) (d)
The more usual way to present the categories is in the form of a
2 x 2 contingency table as shown in Fig. 3:2. The notation given in
this figure will be used throughout the remainder of this report.