ISR11
Scientific Report No. ISR-11 Information Storage and Retrieval
On Some Clustering Techniques for Information Retrieval
chapter
J. D. Broffitt
H. L. Morgan
J. V. Soden
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
i =0
W=D
10
if- i = l,m
C = fi)
I Wn==Wl[OCRerr]C
?0=W
= C(n)
5k
D
F
w
Ix-9
= (jJs3 = 1 in "lasttt similarity matrix)
k
= (ild.E D, i=1,.. .,m)
1
= list of clusters found
= subset of D from which clusters are to
be dra[OCRerr]m
C = possible cluster currently being formed
P = list of documents currently being con-
n
sldered for inclusion in C
j = no. of clusters found
m = no. of d.E D
1
n = no. of documents currently in C
P = S fl P
In k n-[OCRerr]
P =[OCRerr] N
n
Y.
<$4~Cc~?FN
[m=nC:lC(n)
ifflz=nC++lPn(l)
j =j+l
C
F=F+T
Y
n = 0 N k = C(n) F
9
Y N____
10 __________
LL-j=%l[OCRerr]%l(l
LIST THEj
CLUSTERS, Ti, I
\½[OCRerr]NF
Bonner's Cluster Building Algorithm II
Figure 2