SP500207 NIST Special Publication 500-207: The First Text REtrieval Conference (TREC-1) Query Improvement in INformation Retrieval Using Genetic Algorithms - A Report on the Experiments of the TREC Project chapter J. Yang R. Korfhage E. Rasmussen National Institute of Standards and Technology Donna K. Harman 10. Further Studies In dealing with these large databases, our algorithm shows many interesting phenomena which we could not deal with due to the short time period of TREC, and which could form the agenda for further studies. (1) The document term weights Although we use the vector space model mentioned in Section 3, unweighted keyterms are assigned to describe the documents. We have mentioned the two factors based on which we made the decision. Moreover, the databases consist of a variety of subjects which may make the general term weights less reliable than in most experimental databases, which deal with only a narrow area. While our Cranfield study supports this decision, the effect of weighted and unweighted document keywords in large databases should be further studied. (2) Window size for document retrieval Although we established the maximum size of the retrieved document set as 40, it would be interesting to set the window size smaller than 40 at the beginning of the process and gradually increase the size after the query individuals move toward convergence. Various thresholds and cutoff points are often arbitrarily set. Further experimentation is planned to investigate the effect of changes in these values. (3) Document classification In this experiment we found many cases in which different query individuals retrieved different documents. This may arise because the database contains relevant documents that belong to different classes, that can be retrieved only by using different queries. This fact would never be discovered by systems that rely on a single query. Our present algorithm aims at convergence to a single query form. We need a mechanism which will detect the development of distinct classes, and automatically separate the query into two or more distinct queries. Similar ideas, not based on genetic algorithms, have been discussed in document classification (Worona, 1971) and in query splitting (Borodin, et. al., 1971). (4) Term addition and deletion In our feedback process, no terms were added to or removed from the original query. This may cause lower recall. However, the determination of which terms should be added or removed is not an easy task (e.g., Harman, 1988, 1992). It would be interesting to see how this problem can be handled in the genetic process. Reference Aalbersberg, I.J. (1992). Incremental relevance feedback. Proceedings of the lSth Annual International SIGIR Conference, pp 11-22. Copenhagen, Denmark (June, 1992). 56