SP500215
NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC-2)
Appendix B: System Features
Appendix
National Institute of Standards and Technology
D. K. Harman
`B. CONSTRUCTION OF INDICES, KNOWLEDGE BASES AND OThER DATA STRUCTURES -- STATISTICS ON DATA SThUC[OCRerr]IU
Pre-indexing -- most of the helow don't apply.
[OCRerr]d SMART's pre-processing to construct a term[OCRerr]ocument matrix for input to the SVD. This took ahout 9-10 hours. After this, SMART is used only to acce
[OCRerr] do not store an inverted index, since we use the 151-space for matching and retrieval.
4 MB (0.55 GB WSJ text); 101 MB (0.3 GB SJMN text).
stem creates a network. Files created are descrihed in B.5 (special routing structures) and B.6 (other data structures).
Proper noun, complex nominal, and text structure index - 1,000 MB for WSJ and SJM.
conceptual graph - 284 MB (WSJ).
Proper noun, complex nominal, and text structure index - 24 hours (for both WSJ and SJM).
conceptual graph -?