IRE Information Retrieval Experiment The pragmatics of information retrieval experimentation chapter Jean M. Tague Butterworth & Company Karen Sparck Jones All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, including photocopying and recording, without the written permission of the copyright holder, application for which should be addressed to the Publishers. Such written permission must also be obtained before any part of this publication is stored in a retrieval system of any nature. 100 The pragmatics of information retrieval experimentation 5.10 Decision 10: How to present results? Information retrieval experiments should be written up as experiments. This rather obvious recommendation is not always followed in practice. In many reports, one does not realize an experiment has been performed until half way through the paper. The first part is all background. The various aspects of an experiment are generally described in the following order: Purpose of the experiment. Background for the experiment. Methodology. Presentation of results. Summary and conclusions. The purpose of the experiment should be described both in general and specific terms, i.e. the general problem or hypothesis being investigated and its realization, for this experiment, in terms of operational variables. The background section should provide justification for the experiment. What previous work has been carried out in this area? Why is the present study needed? What led the investigator to undertake the work? Only references that specifically relate to the problem under study should be included. Methodology can usually be subdivided into two sections: the test environment and the test procedures. The environment refers to the characteristics of the documents, document surrogates, queries, users, searchers, equipment, etc., used in the test. These should be characterized in detail, as the generality of the results depend on these aspects. Procedures relate to the actual methods used to select the sample (experimental design), run the experiment, collect the data, and analyse the results. Procedures should be described in sufficient detail that another experimenter can repeat the experiment. However, aspects which have been described previously, in generally accessible documents (i.e. not in private communications), such as search algorithms or statistical tests, should simply be referenced. In the results section, the investigator attempts to summarize verbally what the experimental results have shown, not just present pages of tables. Detailed computations or mathematical derivations should be relegated to an appendix and their conclusions only incorporated in the text. Similarly, detailed results and analyses, such as a query by query failure analysis, should also be in the appendix. The final section should serve to review, reiterate, and summarize what has gone before. Remember that this section is all many people read! Some small but important matters remain to mention. Symbols should not be introduced without precise definition. Even conventional symbols can have several interpretations: for example, it can represent either a mathematical constant, or a product operation, or system precision. Both the horizontal and vertical axes in graphs should be labelled and the scale indicated. Unless results are really voluminous, graphs should be accompanied by tables showing the specific values used to construct them. All graphs and other figures should have legends. Other investigators may