Rating scale | Category observations ≥ 10 | Monotonic average measures | Outfit MNSQs < 2.0 | Monotonic threshold calibrations | Thresholds 1.4–5.0 logits apart | Peaked probability curves |
---|---|---|---|---|---|---|
Fluency | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
Accuracy | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
Complexity | X | ✓ | ✓ | ✓ | X | ✓ |
Interaction | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
Effectiveness | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |