Rating scale | Category observations ≥ 10 | Monotonic average measures | Outfit MNSQs < 2.0 | Monotonic threshold calibrations | Thresholds 1.4–5.0 logits apart | Peaked probability curves |
---|---|---|---|---|---|---|
Fluency | X | ✓ | ✓ | ✓ | X | X |
Accuracy | X | ✓ | ✓ | X | X | X |
Complexity | X | ✓ | ✓ | ✓ | X | X |
Interaction | X | X | X | X | X | X |
Effectiveness | X | X | ✓ | ✓ | X | X |