CRANV2 Aslib Cranfield Research Project: Factors Determining the Performance of Indexing Systems: Volume 2 Test Environment chapter Cyril Cleverdon Michael Keen Cranfield An investigation supported by a grant to Aslib by the National Science Foundation. Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government. -22- two subsets which had 19 questions with 7 starting terms and 17 questions with 11 starting terms. QUESTIONS 1. Relevance assessments, 4 grades 2. Differing number of starting terms and retrieving terms 3. Differing totals of relevant documents 4. Two sources of questions, 'basic' and ,supplementary' 5. Question sets of different sizes, picked according to different criteria, searched on collections of varying sizes. COLLECTION SIZE 1. 1400 documents 2. 350 documents from the 1400,documents. 3. 200 documents from the 350 document subset. SUBJECT TERMINOLOGY 1. Aerodynamics 2. Aircraft Structures. FIGURE 2.11 SUMMARY OF MAIN ENVIRONMENTAL FACTORS By the time we came to investigate the simple concept languages and the controlled term languages, the clerical effort involved in carrying out searches precluded the use of the full sets of questions, and accordingly a set of 42 questions was prepared, consisting entirely of questions in the field of aerodynamics. It is this set which is used for presenting the majority of the test results in Chapter 4. At a later stage, this subset was extended to 77 questions in the field of aerodynamics; finally an additional set of 42 questions in the field of structures was compiled for purposes of comparison, with the aerodynamic question set of similar sizes. The subsets of questions are all numbered, and details of these appear in Fig. 2.12. Lists of the question numbers for subsets 1, 2 and 3 were given in Vol. I, Appendix 3E; the remaining subsets are shown in Appendix 3.2 of this volume. Reduced collection sizes were also used for reasons of the effort involved in testing. This was not only the clerical effort involved in the searching, but also the intellectual effort involved in compiling word lists for the various index languages. When it was decided to test simple concepts, a set of 200 documents was chosen, and the initial task involved re-formulating the indexed concepts from the original