- small web task:
- Are best methods in ad hoc task best for Web?Can links be exploited?
- approx. 2 GB subsample of VLC2
- 50 ad hoc topics
- standard relevance assessments & trec_eval evaluation
- large web task:
- How do methods scale?
- full VLC2 collection (100 GB)
- 10,000 queries from Alta Vista & Electronic Monk
- judged top 20 docs for 50 questions; Prec(20) evaluation