Group | Label (notation) | Notes and explanation | Specification or formula | Variant labels in literature |
---|---|---|---|---|
 | Types (T) | Number of different words | T | NDW |
 | Tokens (N) | Total running words | N |  |
I | TTR | Types divided by Tokens | T/N | Â |
 | RootTTR | square root adjustment of TTR | \( T/\sqrt{N} \) | Guiraud Index |
 | LogTTR | Logarithm adjustment of TTR | log(T)/log(N) | Herdan C |
 | D | A lexical diversity measure based on iterated calculation of TTR (Malvern & Richards, 1997) | \( TTR=\frac{D}{N}\left(\sqrt{1+2\frac{N}{D}}\kern0.5em -1\right) \) |  |
 | Uber | A quantity based on Types and Tokens (Dugast, 1979) | \( \frac{\log^2T}{\log (N)-\log (T)} \) | Maas |
 | V1 | Number of hapax legomena | V1 | V(1,N) |
 | V2 | Number of dis legomena | V2 | V(2,N) |
II | V1TR | V1 token ratio | V1/N | V1 Ratio |
 | V2TR | V2 token ratio | V2/N | V2 Ratio |
 | Honore | A quantity involving V1, Types, and Tokens (Honored, 1979) | 100 log(N)/(1-V1/N) | R |
 | Sichel | a characteristic constant proposed by Sichel (1975) | V2/T | S |
 | Entropy(E) | A quantity measuring the complexity of a text (Shannon, 1951) | \( \mathrm{E}=-{\sum}_{i=1}^T{p}_i\log {p}_i \) |  |
III | Relative entropy (RE) | Entropy scaled by the maximum entropy of a text | RE=E/log(N) | Â |
 | Yule K | A characteristic constant proposed by Yule (1944) | \( {10}^4\left(-\frac{1}{N}+{\sum}_i{V}_i{\left(\frac{i}{N}\right)}^2\right) \) |  |
 | Yulk I | An algebraic transformation of Yule K | \( \frac{10^4}{\mathrm{Yule}\ \mathrm{K}} \) |  |
 | Vm | A modification of Yule K proposed by Herdan (1960) | Vm2=Yule K + (1/N - 1/T ) |  |