ISR11
Scientific Report No. ISR-11 Information Storage and Retrieval
Information Analysis and Dictionary Construction
chapter
G. Salton
M. E. Lesk
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
Iv-70
Case 1 5ynOfl3TmO[OCRerr]5 te[OCRerr]ns
v1 = ( 3, 0, 0, 5, 1, 0
= ( 2, 0, 1, 5, 2, 0
[OCRerr](2, 0, 0, 5,
= [OCRerr](3, 0, 0,
1, 0) 8
5, 1, 0) 9
[OCRerr]2, 0, 0, 5,
C _ -
31 [OCRerr](2, 0, 1,
1, 0) 8
5, 2, 0) 10
Assuming cut-off K [OCRerr] [OCRerr].
Case 2 unrelated terms
v1 = ( 3, 0[OCRerr] 0, [OCRerr]` 0)
= ( 0, 1, 3, 0,1, 0)
... = 1 C.. 1
-13 [OCRerr] -31 [OCRerr]
and C.. > K
31
For cut-off K = 0.7[OCRerr]c.. and %. [OCRerr] <K
13
Case 3 term i is a parent of term j
_ = ( 3, 0, 0, 5, 1, 0
_ = ( 1, 0, 1, 3, 2, 0
... = 6
-13 9
c.. =6
-31 -
7
Here c..<K and C.. >K[OCRerr]term i isparentof j
-13 -13
Sample Automatic Hierarch[OCRerr] Formation
Fig. 22