CRANV2
Aslib Cranfield Research Project: Factors Determining the Performance of Indexing Systems: Volume 2
Methods for presentation of results
chapter
Cyril Cleverdon
Michael Keen
Cranfield
An investigation supported by a grant to Aslib by the National Science Foundation.
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
-35
of situations can be taken from these tests. Relevance decisions were
based on four levels of relevance; ff we consider Relevant 1 documents,
there are 12 such documents relevant to the 42 questions of subset 2.
Relevance 1 and 2 documents come to 57, Relevance 1, 2 and 3
documents total 154 and Relevance 1-4 documents come to 198. It
can be "seen that changing the decision as to the relevant documents
(a + c) materially alters the proportion of relevant documents in the
complete collection (N).
On the other hand, the collection size can be changed. Originally
there were 1400 documents in the collection. A subset of the collection
was formed which consisted of 200 documents; a characteristic of this
subset was that it retained all of the 198 documents that were relevant
to the 42 questions of subset 2.
While the number of relevant documents is now held constant, the
proportion changes because of the reduction in the document collection
from 1400 to 200 documents, it is convenient to express this variation
as a parameter, and this is the aforementioned Generality lmmber i.e.
1000 (a + c)
, the total relevant documents divided by the collection
N
size, with a constant. This parameter is not a measure of retrieval
performance, but one which reflects the environment of the relevance
decisions made; e.g. if the generality number for a set of questions
is 5, this means that there are, for each question, an average of five
relevant documents for every thousand documents in the collection,
irrespective of what the actual size of the collection might be. For the
example given above, the change from the larger to the smaller collection
size (bearing in mind that there are 42 questions} changes the generality
1000 x 198 1000 x 198
number from 42 x 1400 - 3.4 to 42 x 4-4-90"[OCRerr]=c[OCRerr] 23.6. Therefore,
as far as retrieval performance is concerned, the significance of a change
in either the relevance decisions or the collection size is that in both
cases it is the generality number which alters.
The single performance measures that can be used can be listed
as follows:-
a
a+c
usually known as Recall Ratio; at Western Reserve University
it is called 'Sensitivity'. and has also been called 'Hit Rate'.
12
a ÷ C
complementary to recall ratio. Called by Fairthorne, 'Snobbery
Ratio',
a
a+b
now generally known as Precision Ratio, formerly called by
Cranfield 'Relevance Ratio', "Also described as 'Pertinency
Factor' or 'Acceptance Rate'.