SP500215 NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC-2) Appendix B: System Features Appendix National Institute of Standards and Technology D. K. Harman N AUTOMATICALLY BUILT QUERIES (ROUTING) lID. QUERY CONSTRUCTIO- seconds per query -- encompassing the parsing of all known relevant documents, thesaurus extraction, and merging of the thesaurus and query terms. Thesaurus extraction over the set of relevant documents results in a set of terms to add to a query. The Importance Coefficient (descrileed ahove) is derived from the `discourse location' of terms in the topic. Statistics for IDFfUF calculation are derived from the training documents. All terms extracted from known relevant documents via thesaurus extraction are assigned an Importance Coefficient of 0.5. Yes. (This is necessary in order to extract statistics.) submitted two routing runs. Isiri simply used the text of the query topics to construct a routing filter. lsi[OCRerr] - used the text of all relevant documents for routing filter for that topic. me required to take vector sum of terms in query (1sirl) or relevant documents (1sir2). Yes Subset of 1 GB from WSJ1, APi, DOE, FRI, ZIFi. Subset of them.