SP500207
NIST Special Publication 500-207: The First Text REtrieval Conference (TREC-1)
Appendix C: System Features
appendix
National Institute of Standards and Technology
Donna K. Harman
9. document lengtli
10. c()Inpleteness (what [OCRerr] of the query ter'n[OCRerr] Łu-e pre.[OCRerr]ent)
12. word specificity (i.e., [OCRerr]`wiinal vs. dog vs. p()()dle)
To provide a c([OCRerr]rse-grain ranking we ran several (lueries per topic, to provide
increasing levels ([OCRerr]f recall. The five methods al)()ve were used in addition to Boolean
logic, numeric ranging, and word order.
IV. What machine did you conduct [OCRerr]e Tl[OCRerr]C expelilflellt on? Sun-3/16() with FDF2E)[OCRerr])() and C-51 disk array
How much RAM did it have? 8 NIB
What w[OCRerr][OCRerr]s the clock rate of [OCRerr]e CPU?
A Sun-3116() is a couple of Mips. N([OCRerr]te that the Sun is just the host, the FDF does
the actual pattern matching. The FDF2()()() model used for TREC clocks at around
12 MHz.
V. Some Systems [OCRerr]ue rese[OCRerr]uch prototypes and others [OCRerr]`u-e co'n'nerci[OCRerr]'d.
To help comp[OCRerr]ire [OCRerr]ese systems:
1. How much `s()ftwL'ue cllginecnng" went ilito the development of your system?
No special programming was done for the TREC conference. The FDF system iLself
was the result of extensive pri([OCRerr]r development.
2. Given appropriate resources, could your system be made to run f[OCRerr]L';ter? By how much
(estimate)?
How fast would y([OCRerr]u like it to go? The system used to execute the TREC ([OCRerr]ueries was
2()% ([OCRerr]f a full-up system. We're currently working on software that will
automatically c([OCRerr]E)r(linate multiple FDF systems to working in parallel. We're aLso
considering faster FDF chips and data transfer methods.
3. VVhat features is y[OCRerr][OCRerr]ur system missing th[OCRerr]it it would benefit by if it had them?
The next generation ([OCRerr]f FI)F systems have an in-hardware term weighting capahility
that can l)e used, in c()nll)inati()Il with the existing features, to return a numeric score
for a document. This wouki allow f[OCRerr])r finer grain in ranking. New model prototypes
were not availal)le for this eff[OCRerr],rt.
509