IRE Information Retrieval Experiment Opportunities for testing with online systems chapter Elizabeth D. Barraclough Butterworth & Company Karen Sparck Jones All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, including photocopying and recording, without the written permission of the copyright holder, application for which should be addressed to the Publishers. Such written permission must also be obtained before any part of this publication is stored in a retrieval system of any nature. The opportunities of real life Systems 131 ilic current file. Thus any recall and precision assessments must be done at I lic s[OCRerr]ime time to ensure that the same part of the database is used otherwise I[OCRerr]iiI1)arisons are not valid. System providers can also add new databases to flicir files and hence extend the search possibilities; again, by making [OCRerr]`[OCRerr]ii11parisons at the same time, this problem can be avoided. Iii vestigation of the facilities provided by the system can be affected by Ii[OCRerr]inges that the system providers may make over time. Thus they could offer hiore facilities for inspecting the dictionary and any statistics collected over %iich a change would be invalid. Perhaps the most serious constraint imposed by the system on the user and hence on the test possibilities, is the time pressure caused by the method of charging. Many users, or their librarian intermediaries, are very conscious of tuc cost and therefore look for a quick solution rather than taking time to consider how to achieve a better result. The system providers are in some (Iirnculty here; they have to implement a charging mechanism which covers both the cost of running the system and the cost of creating and making 4ivailable the databases. They are also constrained by the facilities that are provided by the machine manufacturers for implementing a charging scheme. As a result of these constraints most systems base their charges on (he time the user is connected to the system and the number of references retrieved and printed. The second parameter is probably quite justifiable. The first parameter also has benefits for the system provider as it encourages users to access the system at slack times as an overloaded system will run more slowly and thus be more expensive. From the user's point of view, this method of charging is most unfortunate. He is charged for thinking time, term selection and searching in the database at precisely the same rate. He ought, at least, to be able to think `freely'. Also many facilities that only require access to the dictionary are not costly in computer time or storage, but would, if provided cheaply, offer the means of formulating much better searches. For users who are accessing the systems from remote locations, particularly in the UK, the charging is confounded by the fact that telecommunications costs also have a component which is duration dependent in addition to the charges related to the volume of data. The effect of this crude method of charging for system use is to force users to take a gross view of the system rather than considering the effectiveness of different parts of the system. This naturally imposes the same limitations on the tests that can be attempted, unless the constraint can be removed by providing free searches or a local charging mechanism. 7.6 The opportunities of real life Systems The advantages of testing in a real environment, for many purposes, outweigh the constraints that they impose. The most immediate advantage is the possibility that tests carried out with real systems can bring direct benefit to the users in a realistic time-scale as opposed to testing on static systems where the application of the results is not immediately obvious. The feature of online systems, which cannot be overestimated for testing and evaluation, is the ability to observe user behaviour. This can be done in