SP500215
NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC-2)
Latent Semantic Indexing (LSI) and TREC-2
chapter
S. Dumais
National Institute of Standards and Technology
D. K. Harman
5. Onward to TREC-3
We were quite pleased that we were able to use many
of the existing LSIISVD tools on the ThEC-1 and
TREC-2 collections. The most Important findmg in
this regard was that the large, sparse SVD problems
could be computed without numerical or convergence
problems. We modified the preprocessing
substantially for [OCRerr]fl[OCRerr]C-2, now bave many of the basic
tools in place and should be able to conduct more
experiments comparing various indexing and query
matching ideas using the same underlying LSI engine.
Bigger SVDs, faster query matching, Improving
precision, and interactive interface issues are the
major areas targeted for Improvement.
[8] Dumais, S. T. and Schmitt, D. 0. Iterative
searcbing in an oniine database. In
Proceedings of Human Factors Society 35th
Annual Meeting, 1991, 398402.
[9] Evans, D., Leffe[OCRerr], R., Grefenstette, 0.,
Handerson, S., Hersh, W., and Archbold, A.
CLM[OCRerr] fl[OCRerr]EC Design, experiments and
results. lii D. Harman (Ed.) The First Text
REtrieval Conference (TREC-1), NIST Special
Publication 500-207, 1992,251-286.
[10] Foltz, P. W. and Dumals, S. T. Personalized
information delivery: An analysis of
information filtering methods.
Communications of the ACM, Dec. 1992,
35(12), 51-60.
6. References
[1]
Berry, M. W.
computations.
Supercomputer
49.
Large scale singular value
International Journal of
Applications, 1992, 6(1), 13-
[2] Buckley, C., Allan, J., and Salton, 0.
Automatic routing and ad-hoc retrieval using
SMART: IREC 2. To appear in: Proceedings
of TREC-2.
[3] Buckley, C. and Salton, 0. Automatic
retrieval with locality information using
SMART. In D. Harman (Ed.) The First Text
REtrieval Conference (TREC-1), NIST Special
Publication 500-207, 1992,59-72.
[4] Cullum, JX. and Willoughby, R.A. Lanczos
algorithms for large symmetric eigenvalue
computations - Vol 1 Theory, (Chapter 5: Real
rectangular matrices). Brii:hauser, Boston,
1985.
[5] Deerwester, S., Dumais, S. T., Landauer, T.
K., Furnas, 0. W. and Harshinan, R. A.
Indexing by latent semantic analysis. Journal
of the Society for Information Science, 1990,
41(6), 391-407.
[6] Dumais, S. T. Improving the retrieval of
information from exteiiial sources. Behavior
Research Methods, Instruments and
Computers, 1991,23(2), 229-236.
[7] Dumais, S. T. LSI meets ThEC: A staus
report. In D. Harman (Ed.) The First Text
REtrieval Conference (TREC-1). NIST special
publication 5O([OCRerr]207, 137-152.
114
[11] Furnas, 0. W., Deerwester, S., Dumals, S. T.,
Landaner, T. K., Harshman, R. A., Streeter, L.
A., and Lochbaum, K. E. Information retrieval
using a singular value decomposition model of
latent semantic structure. In Proceedings of
SIGIR, 1988,465-480.
[12] Gallant, S., Hecht-Nielson, R., Cald, W., Qing,
K., Carleton, J., Sudbeck, D. TIPSThR Panel -
`INC's MatchPlus System. In D. Harman
(Ed.) The First Text REtrieval Conference
(TREC-1), NIST Special Publication 500-207,
1992, 107-112.
[13] Jacobs, P, Krupka, 0., and Rau, L. A Boolean
approxImation method for query construction
and topic assignment in TREC. In D. Harman
(Ed.) The First Text REtrieval Conference
(TREC-1), NIST Special Publication 500-207,
1992,297-308.
[14] Kane-Esrig, Y., Streeter, L., Dumais, S. T.,
Keese, W. and Casella, G. The relevance
density method for multi-topic queries in
information retrieval. In Proceedings of the
23rd Symposium on the Interface, E.
Keramidas (Ed.), 1991,407-410.
[15] Nelson, P. Site report for the Text REtrieval
Conference. In D. Harman (Ed.) The First
Text REtrieval Conference (TRFC-1), NIST
Special Publication 500-207, 1992,287-2%.
[16] Salton, 0. and McGill, M.J. Introduction to
Modern Information Retrieval. McGraw-Hill,
1983.
[17] Voorhees, E. On expanding query vectors
with lexically related words. To appear in:
Proceedings of TREC-2.