ISR10
Scientific Report No. ISR-10 Information Storage and Retrieval
The Query-Document Matching Function
chapter
Joseph John Rocchio
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
Doc. No. Corr. Doc. No. Corr. Doc. No. Corr. Doc. No. Corr.
52 1.00 52 .71 52 .71. 123 .71
311 .57 *[OCRerr]177 .70 264 .69 177 .70
77 .49 264 .69 171 .67 264 .70
53 .48 171 .67 123 .66 171 .68
264 .36 123 .66 77 .62 127 .67
171 .34 77 .62 174 .62 263 .66
174 .33 174 .62 127 .58 259 .63
123 .33 * 263 .61 259 .58 52 .63
259 .32 * 127 .58 53 .58 225 .62
47 .32 259 .58 47 .57 361 .60
309 .31 53 .58 225 .55 174 .60
236 .30Th * 361 .58 311 .54 403 .59
cutoff 47 .57 403 .49 77 .58
225 .55 336 .40 47 .57
340 .54 345 .38 (cutoff)
311 . .54 . a 53 .51
S
[OCRerr] 336 .47
__________ b.. dl
cutoff 1 345 .45
g
* - previously ne 311 .44
clustered
Initial clas- b) Correl. distri- c) Partition d) Correl. distri-
sification subset. bution of vector class of class. bution of class.
Correl. of doc.#52. formed from subset vector formed vector formed from
of part (a). from subset of subset of part (c).
part (a).
Progression of Categories and Correlation Distributions
Figure 4.7 t)1'
I,