CRANV1P1
ASLIB Cranfield Research Project: Factors Determining the Performance of Indexing Systems: VOLUME 1. Design, Part 1. Text
Indexing Procedures
chapter
Cyril Cleverdon
Jack Mills
Michael Keen
Cranfield
An investigation supported by a grant to Aslib by the National Science Foundation.
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
-47 -
by a mixture of the devices already described - by hierarchical linkage, by confound-
Lugword forms, etc. "L'Unite" uses none of these, although the last-mentioned will
in fact normally be a prominent accompaniment of its class definition, since different
word forms will usually appear in the same alphabetical cluster. So it must be ad-
mitted that it forms a discrete index device in its own right.
The above eleven devices may be compared with the list of techniques for con-
trolling index languages given by B. C. Vickery in his book 'On retrieval system
theory' (Ref. 9 ) (see Chapter 1). Of those listed above, three devices, namely Bib-
liographic coupling, Associative machine indexing, and L'Unite, were not mentioned
by Vickery. The others include all the techniques given by Vickery, since a number
of these were variants of the more broadly defined devices above.
We have tried to distinguish the basic device itself, as a method of class definition,
from the different ways in which it might be implemented in different index languages.
The latter may be regarded as different amalgams of the various devices, with fur-
ther differences resulting from the various methods of file organization.
The different ways in which coordination is applied in preeoordinate and post-
coordinate systems, have already been mentioned. The fundamental difference is
that in the former a limited number of coordinations are made and the resultant com-
pound headings are then filed in linear order, and rules observed (as to citation order,
etc. } to allow determination of the exact position of any particular combination. This
difference has repercussions for the other devices. Even in a largely single-entry pre-
coordinate system, for example, 'weighting' of an elementary kind is implicit in
the restrictions placed on the number of entries which can be recognized. Links are
fundamental to a precoordinate system by 'partitioning', e.g., separate entries
would be made for Copper-Hardness and for Titanium-Conductivity.
Roles are indicated in a precoordinate system by citation order or by explicit
syntactical devices; in a faceted index an index description such as
Wings - High aspect ratio - Drag - Low angle of attack
conveys by its citation order of Thing (Wing, etc. ) - Property (Aspect ratio, etc. ) -
Process (Load, Drag, etc} - Condition (Aerodynamic parameters, Angle of attack,
etc. ) that High specifies Aspect ratio and Low specifies Angle of attack. In an Alpha-
betical Subject Catalogue, a similar function is served by such headings as Children in art,
Pressure vessels - Heat transfer,
Acceleration - Psychological effects.
Synonyms are treated with varying strictness of interpretatio[OCRerr] in different systems.
Often, it is a reflection of the degree of specificity sought, as when one system dis-
tinguishes Potential flow from Irrotational flow and another confounds them. The
problems of synonymity occurring at the level of multiple term descriptions raises
a particular problem for post coordinate systems, since it implies a degree of pre-
coordination at some point in the system; e.g. a Ground effect machine is a synonym
for Air cushion vehicle, although there is no synonymity between the individual con-
stituent terms.
Confounding of word forms may be implemented at the indexing stage, via a con-
trolled vocabulary (e. g. , Conducting see Conduction}, or by search rules (e. g., accept
Conducting + Conduction + Conductor}. In a precoordinate system, where the dif-
ferent forms are separated in different categories, confounding may be possible only
at the search stage; e.g. , by consulting the A/Z index of a classified index and ob-
serving the variant forms used.