Evaluation (3)
There are some difficulties:
the number of cases for each -/S/S is not the same for all topics
This can be used to establish confidence levels for differences between systems
it is a Friedman type test.
Previous slide
Next slide
Back to first slide
View graphic version