Data: Formation of training/test document sets
Each of 10 NIST information analysts chose one set of newswire/paper articles of each of the following types:
- A single event with causes and consequences
- Multiple distinct events of a single type
- Subject (discuss a single subject)
- One of the above in the domain of natural disasters
- Biographical (discuss a single person)
- Opinion (different opinions about the same subject)
-
Each set contains about 10 documents (mean=10.2, std=2.1)
All documents in a set to be mainly about a specific “concept”