IRS13
Scientific Report No. IRS-13 Information Storage and Retrieval
Thesaurus, Phrase and Hierarchy Dictionaries
chapter
E. M. Keen
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
VII-4
Word Stems
Appearing in
Collection Distinct Concepts Average More Than One
and Text in Words Per Thesaurus Concept
Dictionary Words Thesaurus Concepts Total Percent
IRE-3 Thesaurus-2 *5,477 511 10.72 45i 8.2%
IRE-3 Thesaurus-3 *5,477 686 7.98 159 i 2.9%
CRAN-1 Thesaurus-i 3,291 377 8[OCRerr]73 155 i 4.7%
CRAN-1 Thesaurus-2 3,291 495 6.65 389 I 11.8%
+ CRA[OCRerr]-2 Thesaurus-3 *7,449 736 10.1 78 1.1%
ADI Thesaurus-i 8,099 541 14.97 54 I 0.7%
ADI Thesaurus-SAl 8,099 289 28.02 416 I 5.1%
____________________________________________________________________ I
* Estimated Values
+ Data for Cran-1 Use of this Thesaurus are not available.
Grouping Characteristics of Seven Thesaurus Dictionaries
Fig. 1