Post hoc evaluation of analytic rating scales for improved functioning in the assessment of interactive L2 speaking ability

Language Testing in Asia

Table 6 Summary of adherence to Linacre’s (2002a) guidelines for the 5-point rating scales

Rating scale	Category observations ≥ 10	Monotonic average measures	Outfit MNSQs < 2.0	Monotonic threshold calibrations	Thresholds 1.4–5.0 logits apart	Peaked probability curves
Fluency	✓	✓	✓	✓	✓	✓
Accuracy	✓	✓	✓	✓	✓	✓
Complexity	X	✓	✓	✓	X	✓
Interaction	✓	✓	✓	✓	✓	✓
Effectiveness	✓	✓	✓	✓	✓	✓