Exploration of vocational high school students experiencing difficulty in cloze test performances: a mixed-methods study in Taiwan

Feng, Kuo-Zheng

doi:10.1186/s40468-024-00274-4

Research
Open access
Published: 26 April 2024

Exploration of vocational high school students experiencing difficulty in cloze test performances: a mixed-methods study in Taiwan

Kuo-Zheng Feng ORCID: orcid.org/0000-0002-8394-5944¹

Language Testing in Asia volume 14, Article number: 16 (2024) Cite this article

428 Accesses
Metrics details

Abstract

This study addressed a gap in existing research on Multiple-Choice (MC) cloze tests by focusing on the learners’ perspective, specifically examining the difficulties faced by vocational high school students (VHSs). A nationwide sample of 293 VHSs participated, providing both quantitative and qualitative data through a self-developed questionnaire. The results revealed that vocabulary and grammar posed the greatest challenges in the MC cloze test, while sentence patterns were perceived as the least difficult by VHSs. Factors contributing to these difficulties included the need for increased focus on vocabulary and grammar learning. Some participants attributed challenges to personal perceptions of intellectual capability, while others highlighted the influential role of teachers’ attitudes on their learning motivations and outcomes. The study suggested implications for test designs and teaching approaches. Despite these contributions, the study acknowledged limitations and offered suggestions for future research directions.

Introduction

Vast numbers of publications in English have produced a growing awareness of language learners’ needs for comprehending various types of texts (Huttner, 2008). Constructing a high-stakes test with different types of reading passages could further strengthen this notion, and a positive backwash could be expected from those tests (Hughes & Hughes, 2020; Madsen, 1983). Most test takers in EFL contexts believe that the process and preparation for English learning should be largely directed toward the contents of those high-stakes tests (Kohonen, 1999), where reading comprehension is perceived to play a significant role (Lee, 2004). High-stakes tests in Taiwan, such as the General Scholastic Ability Test (GSAT) and the Technological and Vocational Education Joint College Entrance Examination (TVE-JCEE), are recognized as having an enormous influence in terms of language teaching and learning (Hughes & Hughes, 2020).

In the GSAT and TVE-JCEE English test, multiple-choice (MC) cloze tests are among the most common types of reading comprehension tests (Brown, 2002). This type of MC cloze test has been observed to be uncomplicated (Jonz, 1976). In MC cloze tests, students’ English comprehension is tested by requiring them to select the best answer from four possible options to fill in the blanks in the passage to make a sentence semantically coherent and syntactically complete (Hao, 2011; Tabatabaei & Shakerin, 2013). Because several fundamental competencies are usually embedded in MC cloze tests, students are expected to fail to provide correct answers if their comprehension ability and logical thinking ability are not well developed (Luo, 2022). That is, several parts in different passages should be logical and comprehension clues that contribute to the meaningfulness of the whole passage; most students have found that this type of test is the most difficult of all exams. The use of contextual clues can be problematic for students, and a series of other difficulties may possibly co-occur (Katalin, 2000; Luo, 2022). Therefore, the need to investigate the issues and factors involved in performing and constructing MC cloze tests have begun to attract many scholars’ attention (Bachman & Palmer, 2010; Chou & Chen, 2009; Tabatabaei & Shakerin, 2013).

Many studies have been done on these topics with different types of participants, including the exploration of the use of both senior and vocational high school students’ (VHSs’) strategies (Ai, 2015; Chen, 2013; Cheng, 2008; Joe, 1993; Kuo, 2003), the effects of scaffolding instructions on both senior and VHSs (Luo, 2022; Wang, 2018a), factors that affect junior high school and college students’ performance on cloze tests (Azimi, 2016; Kumazawa, 2016; Trace et al., 2017), and features of the cloze test (Wang, 2018a). In addition, a few researchers (Kuo, 2003; Wang, 2018a) have found from observations that high school students encounter difficulties when taking MC cloze tests, although these conclusions were not based on scientific methods. Only a few studies have examined the difficulties that learners face from their own perspective. Although the reasons for both senior and VHSs’ difficulties in performing well on cloze tests remain unexplored, the issue is more urgent for VHSs. Most VHSs are directed to gain specific skills due to the curriculum design, which is aimed at providing certification. Thus, confidence in learning English is gradually lost and huge discrepancies will appear in English competence as their peers at senior high schools continue to advance (Chang et al., 2007).

In Taiwan, there is a large amount of VHSs, after decades of this type of instruction (Chou, 1995; Xu, 1999). Currently 345,225 VHSs are studying in Taiwan, according to MOE statistics. In 2022, there were 79,292 VHSs taking the TVE-JCEE, only slightly less than the number of senior high school students taking the GSAT. Existing studies of test performance in vocational high school educational systems remain under-examined. Accordingly, this study investigated VHSs’ difficulties in performing an MC cloze test, as well as whether any differences among the difficulties identified by VHSs. Finally, the factors that affect VHSs’ performance difficulties in MC cloze test were examined and contrasted with the findings of previous studies. In particular, this study investigated the following research questions:

1.
What types of difficulties do VHSs perceive in taking MC cloze tests?
2.
What are the differences among the types of difficulties that are perceived by VHSs in taking MC cloze tests?
3.
What factors affect VHSs’ difficulties in taking MC cloze tests?

Through this study, it is hoped that significant and perspicacious implications will be derived in both the theoretical and pedagogical aspects. Theoretically, the difficulties that VHSs have in taking MC cloze tests should be given closer attention to studies of language assessment. From a pedagogical point of view, educators and test designers may understand what focuses should be brought to bear to promote students’ testing strategies and performance in cloze tests. From this, better teaching procedures and curricula can be developed and designed. Most importantly, work along these lines will produce positive backwash, for the effects of tests on teaching and learning (Hughes & Hughes, 2020).

Literature review

Background of cloze test development

The cloze procedure, a technique used to assess text readability and communication effectiveness, was introduced by Wilson Taylor in 1953 (Bickley et al., 1970; Kumazawa, 2016). Unlike the testing concept of closure (Rankin, 1959), in which a missing gap is filled to complete a whole, as in Parviz and Sorayya (2012), the cloze procedure involves the systematic deletion of preselected texts to evaluate readers’ competence by having them provide the precise words that were removed. From that point on, increasing interest in and attention to research on cloze procedure has been seen, including studies of the effectiveness of cloze test (Ajideh & Mozaffarzadeh, 2012; Akmedovna, 2022; Alderson, 1990), factors in cloze test performance (Tabatabaei & Shakerin, 2013), and item difficulty (Brown, 1989). Separate from these research topics, the use of the cloze procedure has become distinctive as a tool for conduct reliability research (Taylor, 1953). Results of such tools were considered to be diverse, examples of which were found in the reliability values, which ranged from 0.13 to 0.96 (Bachman, 1985; Brown, 1989; Pike, 1973), and criterion-related validity values, which ranged from 0.06 to 0.91 (Bachman, 1985; Brown, 1989). At the same time, a group of researchers began to focus their attention to the various types of cloze procedure, such as the C-test, developed by Raatz and Klein-Braley (1981), and MC cloze tests, developed by Jonz (1976). In addition to the two major types, several types of cloze procedure appeared, including a fixed-ratio cloze test (Cohen, 1994), a rational cloze test (Alderson, 2000), a conversational cloze (Brown, 1983), and a matching cloze (Baldauf & Propst, 1979). In addition, various scholars have held different points of view with respect to the types of cloze test. For example, Alderson (2000) considered the rational cloze to be a gap-filling test, while the random cloze type was restricted by the term cloze, meaning that it was only a low method to measure English proficiency. In addition, Bachman (1990) indicated that the types of cloze procedure should include rational deletion. Among the various classifications of cloze types, the MC cloze test is the only type that VHSs face in TVE-JCEE, so this study focuses on that.

Construction of the MC cloze test

Drawing on Goodman’s (1967) psychological perspective, Boonsathorn (1987) developed the MC cloze test; the principle of the MC cloze test related to the belief in readers’ engagement of whole processing levels all at once. Due to the disadvantages of the C-test, it was expected that the MC cloze test could better test students’ overall ability (Wonghiramsombat, 2013). Regarding the MC cloze test, three critical aspects should be taken into considerations, including test passages, word deletion, and the distribution of testing points. Each aspect is presented and discussed in the following.

Text passages

First, text passages are crucial for constructing MC cloze tests (Ajideh & Mozaffarzadeh, 2012; Tavakoli et al., 2011). Let us take Tabatabaei and Shakerin (2013) as an example. The effectiveness of content familiarity on the cloze test performances of 60 Iranian EFL learners was investigated. A statistically significant difference was discovered between the testing results of MC cloze tests with familiar and unfamiliar content, where familiar content was linked to successful performance on the MC cloze tests. Likewise, Tavakoli et al. (2011) examined the effects of genre familiarity on an MC cloze test and a C-test. The results showed a significant impact of genre familiarity on both the MC cloze test and the C-test. In recent years, Trace (2023) investigated how the passage cohesion affected the function of the items. The results showed that the passage factors and item function are closely linked. The conclusion was made that aside from content and grammatical structure, test designers should investigate the impact of cohesion in potential cloze passages. Hughes and Hughes (2020) provided suggestions and measures to develop a relevant MC cloze test. First, the difficulty levels of selected passages should match the test takers’ level of proficiency. After the issue of level is perfectly controlled, several passages should be involved in the trailing. Second, the text style should match with the level of language ability that is being tested. Third, as words are systematically deleted, it is critical to have native speakers closely inspect the test and provide their opinions on the ideally predetermined answers. Four, responses should be given with the provision of clear instructions, so that irrelevant factors can be diminished. Five, descriptions could be given to better interpret the scores on the MC cloze test. In the light of it, the text passage is an important factor in the perspective of constructing MC cloze test.

Deletions of words

For deletions of words in an MC cloze test, Cohen (1980) remarked that “A cloze test in its form is a passage from which after every certain number of words a word is deleted” (p. 91). However, Bachman (1985) believed that systematic and unsystematic deletions are both possible methods to use in making MC cloze test. From reviews of the existing literature, a systematic approach to deletion appears to be used more widely. In general, deletions are made on every nth word (Brown, 2002; Dhyaaldian et al., 2022; Tabatabaei & Shakerin, 2013), and various words counts have been advocated the purpose of systematic deletions, including the deletion of roughly every fifth word (Yaseen & Rasheed, 2022), the deletion of every seventh word (Tavakoli et al., 2011), deletions of every sixth to eighth word (Tabatabaei & Shakerin, 2013), deletions between the fifth to tenth word (Azimi, 2016), deletions of every eighth or tenth word (Hughes & Hughes, 2020), and deletions of every twelfth word (Brown, 1989). Hughes and Hughes (2020) reported that in deletion, a few sentences at the beginning and in the end of the passages should remain untouched so that any clues in this text can be referenced as test takers seek to complete the MC cloze test. In summary, MC cloze tests in TVE-JCEE appear to adopt the approach of deleting words based on a certain range, around 7 to 8 words on average between blanks. This measure is more reasonable because repeated or irrelevant testing constructs may still be included by applying sufficient exact word methods.

Distribution of testing points

The other critical aspect in constructing an MC cloze test is the development of relevant item constructs and sub-skills for testing. Due to constant changes in pedagogical beliefs, questions of what skills should be included and how far each construct should be incorporated have dynamically altered in terms of the accepted means of formulating MC cloze tests. Constructs can be equally divided into five items for the grammar aspect and the vocabulary aspect. Imbalanced testing constructs have been observed between the two aspects according to reviews of MC cloze tests on previous TVE-JCEE tests. Lu (2003) examined the item distribution across five consecutive years’ items in MC cloze tests, ranging from 1998 to 2002. The questions mainly assessed test takers’ comprehension within a relatively limited set of texts. The beliefs and conventions applied in making high-stake tests began to change as the 108 curricula were brought forward and preferred. After the implementation of these curricula, which stressed an orientation toward competency, the tendency to construct test items more comprehensively was perceived. From this educational policy, texts become longer, and test takers may need to apply more than one skill to complete the tests. Whether test takers’ competencies are flawlessly assessed simply by increasing texts length and task complexity remains debatable and dubious, as test takers appear to be becoming more incompetent in completing the tests. In addition, concrete descriptions of students’ test-taking difficulties have been blurred by the recent emergence of new test types.

Test takers’ difficulty in performing cloze test

Many studies have been performed hitherto on MC cloze test difficulty, indirectly and obliquely indicating test takers’ difficulties in performing this test (Abraham & Chapelle, 1992). Hughes and Hughes (2020) called for all tests to be carefully designed to allow test takers to know what to refer to. This indicates that an MC cloze test would be difficult with fewer clues. Boonsathorn (1987) determined the reliability of the C-test and the MC-Test using comparisons. Whether the different starting points of deletion would affect the difficulty was further explored. To investigate this, two forms of test, a C-test and an MC-Test, were created. Four tests were given to L1 and L2 participants. Both types of test were highly reliable. The MC-Test appeared to be more challenging for both L1 and L2 participants because the type of test required a greater than usual reading comprehension process, as well as better discrimination. In the same vein, Kumazawa (2016) inspected factors influencing score variance in MC cloze tests. In particular, the study investigated the linguistic and textual effects on an MC cloze test. The results identified interactions of those factors were found, and the reliability of MC cloze test was established. Although the primary goals of those two studies did not focus directly on test takers’ difficulties in taking MC cloze tests, relevant ideas were implicitly revealed and inferred through the results.

For more direct results, Han (2022) investigated the relationships among vocabulary ability, use of vocabulary learning strategy, and cloze test performance. The participants were Korean college students. The results indicated a positive correlation between students’ vocabulary ability and their performance on a cloze test. Although this study highlighted the importance of vocabulary competence, additional factors were uncovered, and the application of quantitative method prevented deeper insight. Most importantly, VHSs were ignored. In a study targeting participants VHSs, Wang (2018a, 2018b) reported that VHSs indeed had difficulty performing an MC cloze test. From her observations and teaching experience, the vocabulary of most VHSs was excessively limited, and they were unfamiliar with grammatical concepts and sentence patterns. For this reason, they were unable to comprehend the reading passages used in the MC cloze test. Most VHSs relied heavily on their rote learning, and they reported that there were too many targeted words and rules to remember. Ultimately, most VHSs described by Wang (2018b) gave up on learning English. Their frustrations and difficulties were vividly portrayed; however, it cannot be denied that evidence from observations may not match VHSs’ inner thoughts. As the idea of a learner-centered approach to language assessment literacy has attracted significant attention recently (Butler et al., 2021; Lee & Butler, 2020), explorations of VHSs’ test-taking difficulties can be scientifically conducted through the direct involvement of students as participants with proper instruments to elicit their inner voices.

Using learners’ perspectives as a lens to investigate the difficulties of an MC cloze test, Ajideh and Mozaffarzadeh (2012) investigated whether the MC cloze test and C-test were appropriate to assess leaners’ reading comprehension. In addition, opinions and reflections on these two types of tests were further explored via a retrospective study. The results indicated that the MC cloze test was much more applicable for measuring test takers’ reading comprehension than the C-test. For the results of participants’ views of these two types of tests, it was found that the MC cloze test was easier to complete than the C-test, and these results are reasonable and predictable. Surprisingly, participants remarked that probability of guessing the correct answers was greater than 50% for both tests. Even if scientific methods were used to justify the results, involving advanced learners inevitably makes made the results less convincing. Most importantly, it is urgent to explore VHSs’ test-taking difficulties to provide timely support to them, as the results of TVE-JCEE English scores can be a decisive factor in being able to enroll in better colleges. Thus, the significance of exploring VHSs’ difficulties in performing MC cloze test through learners’ perspectives should be noted.

Method

Participants

The participants were Taiwanese VHSs studying at different grade levels. A total number of 309 VHSs completed the online questionnaire. Incomplete questionnaires and those where the same value was chosen for all items were discarded. After the deletion of 16 invalid or incomplete responses, 293 questionnaires were included in this study. In final group, there were 119 male students and 174 female students. In all, 73 students were in grade 10, 131 students were in grade 11, and 89 students were in grade 12. Most were studying at public vocational schools (n = 250), and the remainder students were at private vocational schools (n = 43). Table 1 presents participants’ demographic information. These students mainly used textbooks published by San Min Book Co. or Longteng Education for English. All students had five English lessons a week, with each one lasting 50 min.

Table 1 Demographic information of participants

Exploration of vocational high school students experiencing difficulty in cloze test performances: a mixed-methods study in Taiwan

Abstract

Introduction

Literature review

Background of cloze test development

Construction of the MC cloze test

Text passages

Deletions of words

Distribution of testing points

Test takers’ difficulty in performing cloze test

Method

Participants

Research design

Data collection method

The questionnaire

Written narrative inquiry

Demographic information

Data collection procedure

Data analysis

Validity and reliability

Results

VHSs’ perceived difficulties in performing MC cloze test

Differences in perceived difficulties

VHSs’ self-perceiving factors in MC cloze test difficulties

Discussions

Conclusion and suggestions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords