Skip to main content

Table 1 LR measures studied in the current paper

From: How effective are lexical richness measures for differentiations of vocabulary proficiency? A comprehensive examination with clustering analysis

Group

Label (notation)

Notes and explanation

Specification or formula

Variant labels in literature

 

Types (T)

Number of different words

T

NDW

 

Tokens (N)

Total running words

N

 

I

TTR

Types divided by Tokens

T/N

 
 

RootTTR

square root adjustment of TTR

\( T/\sqrt{N} \)

Guiraud Index

 

LogTTR

Logarithm adjustment of TTR

log(T)/log(N)

Herdan C

 

D

A lexical diversity measure based on iterated calculation of TTR (Malvern & Richards, 1997)

\( TTR=\frac{D}{N}\left(\sqrt{1+2\frac{N}{D}}\kern0.5em -1\right) \)

 
 

Uber

A quantity based on Types and Tokens (Dugast, 1979)

\( \frac{\log^2T}{\log (N)-\log (T)} \)

Maas

 

V1

Number of hapax legomena

V1

V(1,N)

 

V2

Number of dis legomena

V2

V(2,N)

II

V1TR

V1 token ratio

V1/N

V1 Ratio

 

V2TR

V2 token ratio

V2/N

V2 Ratio

 

Honore

A quantity involving V1, Types, and Tokens (Honored, 1979)

100 log(N)/(1-V1/N)

R

 

Sichel

a characteristic constant proposed by Sichel (1975)

V2/T

S

 

Entropy(E)

A quantity measuring the complexity of a text (Shannon, 1951)

\( \mathrm{E}=-{\sum}_{i=1}^T{p}_i\log {p}_i \)

 

III

Relative entropy (RE)

Entropy scaled by the maximum entropy of a text

RE=E/log(N)

 
 

Yule K

A characteristic constant proposed by Yule (1944)

\( {10}^4\left(-\frac{1}{N}+{\sum}_i{V}_i{\left(\frac{i}{N}\right)}^2\right) \)

 
 

Yulk I

An algebraic transformation of Yule K

\( \frac{10^4}{\mathrm{Yule}\ \mathrm{K}} \)

 
 

Vm

A modification of Yule K proposed by Herdan (1960)

Vm2=Yule K + (1/N - 1/T )

Â