SP500215 NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC-2) Appendix B: System Features Appendix National Institute of Standards and Technology D. K. Harman I'D. QUERY CONSTRU[OCRerr]ON[OCRerr][OCRerr]AUTOMATICALIN BUILT QUERIES [OCRerr]OUTING) [ SYSThM NAME [OCRerr] Dortmund [ Cornell [_Berkeley J_Rutgers [OCRerr] Siemens [OCRerr] UMASS i:[OCRerr][OCRerr] F Automatically Built Queries (routing) Yes None [1] NM Topic Fields Used Everything except Everything except ____________________________________ Defmitions Defmitions All See [1) ____ 212/50 (crnlRl) Total Computer Time to Build 28/50 for each 50/50 (crnlCl) 0.7 seconds per 4623 for (cpu seconds) query per query query 50 queries Me[OCRerr]hods Used in Building Query - Terms Selected from (*) 1, 3 1, 3 1 3 - Term Weighting with weights based on (*) 1, 2 1, 2 1 3 - Phrase Extraction from (*) 1, 3 1, 3 No No - Syntactic Parsing of (*) No No - Word Sense Disambiguation using (*) ________________ No No - Proper Noun Identification Algorithm from (*) No No - Tokenizer from (*) _______________ _______________ No No -- Which Patterns? - Heuristic Associations to add terms from (*) No No History of term History of term - Expansion of Queries Using occurrence in occurence in No No Previously Constructed Data Structure relevant docs relevant docs -- Which Structure? - Automatic Addition of Boolean Connectors Proximity Qperators Using Information from (*) No No Additional term specific weights None - Other added for 2nd ________________ routing run (1) Topic (2) All training Documents (3) Documents with relevance judgements