SP500215
NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC-2)
Retrieval of Partial Documents
chapter
A. Moffat
R. Sacks-Davis
R. Wilkinson
J. Zobel
National Institute of Standards and Technology
D. K. Harman
results to see if it was helpful to use the rank of
the whole documents along with the rank of its
component parts.
Experiment 1: Rank full documents against
the queries using standard cosine measure.
Experiment 2: Split documents into sections.
Measure similarity of each section against
the queries using standard cosine mea-
sure. Order documents based on the highest
ranked section.
Experiment 3: Split documents into sections.
Measure similarity of each section against
the queries using standard cosine measure.
Order documents based on
0.5[OCRerr][OCRerr]1wt(s,,)
where S,, is the flth highest ranked section in
the document. The effect of this formula is
that a document's weight is determined by
a decay formula using the weight of all of a
document's sections. (Values other than 0.5
were tried, but gave poorer performance.)
Experiment 4: Split documents into sections.
Measure similarity of each section against
the queries using standard cosine measure.
Then weight each section using its type, for
example (introduction) or (address). Order
documents based on
tw(i[OCRerr]pe(s,, ))0.5"[OCRerr]1wt(s,,)
where S,, is the flth highest ranked section in
the document, type(s,,) is the type of the sec-
tion, and tw(t[OCRerr]) is the weight of the type t[OCRerr]
The weights of the types of sections were ob-
tained by conducting a set of experiments
where all types but one were given a weight
of 1, and in turn, each type was given a
weight of 2. Using these experiments it was
determined that (purpose) and (summary)
were each more helpful, and that (misc) was
less helpful. As a result, in this experi-
inent tw((purpose)) = tw((summary)) = 2,
tw((misc)) = 0.5, and other weights were set
to 1.
Experiment 5: Rank the documents, and rank
the sections. Form a new rank based on the
average rank of these two ranks.
189
Other experiments were carried out using best
two sections, and formulas that more closely ap-
proximated the cosine measure. None of these
experiments achieved better results than the ones
displayed here. The obvious conclusion is that if
documents are available for ranking as whole doc-
uments, then for this collection it is preferable to
do so.
3.3 Section retrieval
For very long documents it may be desirable to
return relevant sections rather than relevant doc-
uments. We were interested to see whether it
might be useful to know about the rank of the
containing document. In the first experiment
documents were ranked, and sections shown in
document order. This produced very poor re-
sults. Next, we still ranked sections in higher
ranked documents ahead of lower ranked docu-
ments, but used section ranking for sections in
the same document. This was reasonable but
there were still many irrelevant sections being ex-
amined. Finally, we attempted to delete these ir-
relevant sections by using document ranking, and
then section ranking, but this time discarding sec-
tions that had a section rank of greater than 200.
Experiment 6: Rank sections against the
queries using standard cosine measure.
Experiment 7: Rank full documents against
the queries using standard cosine measure.
Order the sections by their appearance
within documents.
Experiment 8: Rank full documents against
the queries using standard cosine measure.
Order sections, first by document, then by
rank within documents.
Experiment 9: As in experiment 3, but then
delete all but the 200 highest ranked sec-
tions.
These experiments show that ranking both
documents and sections does help to find more
relevant sections. This result is in contrast to
the earlier investigation of finding relevant docu-
ments. We are able to find relevant sections much
easier if the rank of both the sections, and the
containing documents are taken into account.
3.4 Paged versus section retrieval
Our results showing that retrieving documents
based on section ranking was not as useful as