SP500207
NIST Special Publication 500-207: The First Text REtrieval Conference (TREC-1)
Appendix C: System Features
appendix
National Institute of Standards and Technology
Donna K. Harman
3. oilier tenn weights (where do they come from?)
augnorm, c([OCRerr]mputed by SMART using the above factors
IV. What machine did you couduci ilie TREC experilneilt on? DECstati()n 5()([OCRerr][OCRerr][OCRerr] Model 25
How much RAM did it h[OCRerr][OCRerr]ve? 4([OCRerr]M bytes
What wa.'; tile clock rate of [OCRerr]e CPU? MIPS R3()()([OCRerr] at 25MHz
At the end of our work f[OCRerr],r the submission, we finally had 3Gbytes of disk storage to work
with.
V. Some systems [ire rese'.irch protolypes t.[OCRerr]nd other"; we c()Inlnerci[d.
To help COInpL.[OCRerr]C tiiese systems:
1. How much "soflwaie eugiucerilig" went into tile development of your system?
We began with the 1983 version of SMART, and have enhanced it. We tried to use
the new version ([OCRerr]f SMART on an RSI6(i(N[OCRerr] but could not get reliable results and so
went back to our older version. We underwent extensive software development since
May but due to lack to disk space could m[OCRerr]t use most of what we developed for the
submission.
2. Give'i appropnate resources, could your system be m'.ide to run f[OCRerr][OCRerr]ter? By how much
(estimate)?
With m([OCRerr]re disk space we could have used the inverted tile option and that would
have made things much faster. That would have allowed real time interactive
searching. Also, with Ill()[OCRerr]C disk space, we c(Juld have used an RSI[OCRerr][OCRerr]O, a[OCRerr]uming
SMART could l)e ported and made fully operational.
3. What features is your system missing thL.a it would benefit by if it had them?
Because ([OCRerr]f the disk space problem we were n([OCRerr]t al)le t([OCRerr] do many of the efforts we
wanted to do. Work will continue this fall if disks are received in time. Among the
features:
- phrase identification and matching
- building "decisi[OCRerr][OCRerr]n trees" after training with a sufficient set of relevance
judgments
- implementing the CE() model ([OCRerr]f P. Thompson and trying it out in a
variety of ways to combine results from a variety of runs and
indexing schemes (that could include stemming andlor
IliorphEJIogical analysis).
512