ISR10
Scientific Report No. ISR-10 Information Storage and Retrieval
The Indexing Function
chapter
Joseph John Rocchio
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
2-11
Keyword Set: £A,B,C,I),E,F,G,H[OCRerr]
a) The Property Space
Keyword
A
B
E
R
I
Relative
Frequency (Si[OCRerr]i£ic,ance)
1
3
10
5
b) .[OCRerr]..Tabular[OCRerr]J)oc[OCRerr]ent?:-:Repreaentation
p
d = [OCRerr]E,H,B,A}
c) Document Ima[OCRerr] as a Set
d = (1,1,0,0,1,0,0,1)
d) Document lma[OCRerr] as a Binary Property Vector
(Keywords in lexical order)
d = (1 ,3,0,0,10,0,0,5)
*e) Document Image as a Numeric Property Vector
(Keywords in lexical order)
5
Alternative Property Space Index Representations
Fi[OCRerr]ure 2.2