MONO91
NIST Monograph 91: Automatic Indexing: A State-of-the-Art Report
Table of Contents
table of contents
Mary Elizabeth Stevens
National Bureau of Standards
Page
5. Automatic classification and categorization 106
5.1 Factor analysis 108
5.2 The theory of clumps 110
5.3 Latent class analysis 113
5.4 Examples of other proposed classificatory techniques 113
6. Other potentially related research 114
6.1 Thesaurus construction, use and up-dating 114
6.2 Statistical association techniques 118
6.2.1 Devices to display associations: EDIAC 119
6.2.2 Statistical association factors - Stiles 119
6.2.3 The association map - Doyle and related work at SDC 122
6.2.4 Work of Giuliano and associates, the ACORN devices 124
6.2.5 Spiegel and others at Mitre Corporation 126
6.3 Clues to index-term selection from automatic syntactic analysis
6.4 Probabilistic indexing and natural language text searching
127
132
6.4.1 Probabilistic indexing - Maron, Kuhns and Ray 133
6.4.2 Natural language text searching - Swanson 134
6.4.3 Full text searching - legal literature 135
6.5 Other examples of related research in linguistic data processing
6.6 Machine assistance in translations of subject content indications
to special search and retrieval language
6.7 Example of a proposed indexing-system utilizing related research
techniques
136
140
142
7. Problems of evaluation 143
7.1 Core problems 145
7.2 Bases and criteria for evaluation of automatic indexing procedures 149
7.2.1 The Cranfield project 150
7.2.2 O'Connor investigations 151
7.2.3 Questions of comparative costs 153
7.2.4 Summary: potential advantages as bases for evaluation 156
7.3 Findings with respect to inter-indexer and intra-indexer consistency
7.4 Special factors and other suggested bases for evaluation
157
160
8. Operational considerations 164
8.1 Questions of input 164
8.2 Examples of processing considerations 168
8.3 Output considerations 171
9. Conclusion: Appraisal of the state of the art in automatic indexing 173
v