Indexing
Tokenization
might include identifying phrases
might include identifying other lexical structures such as names, amounts, etc
Remove “stop words”
Perform stemming
Weight terms
Previous slide
Next slide
Back to first slide
View graphic version