International Journal of Language Testing
International Journal of Language Testing, Volume 14, , Issue 1, March 2024
مقالات
حوزه های تخصصی:
The present study explored the effects of interactionist Dynamic Assessment (DA) and age on the writing accuracy of IELTS candidates. This research investigated the existence of an interactional effect between the age of learners and the type of treatment they received. First, 140 IELTS candidates were split into two age groups, adult and young. Each group was divided into two equal smaller groups, randomly assigned to two types of DA. All four groups took a writing pre-test. After a nine-session treatment of interactionist DA in study groups and interventionist DA in control groups, all groups took a writing post-test. The data collected during the post-test were analyzed by two-way ANOVA and compared with the pre-test data. The findings showed that implementation of interactionist DA significantly improved the learners’ writing accuracy while the learners’ age had no significant effect on their writing accuracy. With respect to the study findings, EFL teachers could use interactionist DA to enhance the accuracy of learners' writing skills by providing more opportunities for the learners to interact.
A Cognitive Diagnostic Modeling Analysis of the Reading Comprehension Section of an Iranian High-Stakes Language Proficiency Test
حوزه های تخصصی:
The purpose of this study was to compare the functioning of five restrictive CDMs, including DINA, DINO, A-CDM, LLM, and RRUM, against the G-DINA model to identify the best-fitting CDM which can better explain the interaction underlying the attributes of the reading comprehension section of an Iranian high-stakes language proficiency test. To achieve this aim, item responses of 1152 examinees to the items of the test were examined. The six CDMs were initially compared in terms of relative and absolute fit statistics at test-level to choose the best model. It was found that the G-DINA model outperformed compared to the restrictive models; thus, it was selected for the second phase of the study. Concerning the second purpose of the study, the G-DINA was used to identify strengths and weaknesses of the examinees. The results revealed that making an inference and vocabulary are the hardest attributes for examinees of the test, and understanding the specific information is the easiest attribute. Finally, the models were also compared at item-level. The presence of a combination of L2 reading attributes was found.
New Perspectives on IELTS Authenticity: An Evaluation of the Speaking Module
حوزه های تخصصی:
This study represents the inaugural attempt at assessing the authenticity of the tasks encompassed in the IELTS Speaking Module. The evaluation is conducted from the vantage points of applied linguistics and general education, and serves to enhance comprehension of authenticity and authentic assessment. In order to achieve this objective, an analysis was conducted on the Speaking Module tasks using Bachman and Palmer's (1996; 2010) model of test usefulness from the discipline of applied linguistics, as well as Herrington and Herrington's (1998; 2006) inventory of the fundamental attributes of genuine assessment from the realm of general education. The results of both task analyses revealed low indices of authenticity of the tasks. The high degree of the convergence of the results from the analyses could open new horizons to experts in applied linguistics to exchange ideas about authentic language assessment with those in general education. Furthermore, such characteristics as problem-solving skills, higher-order thinking skills, integrated assessment, and multiple indicators of learning, are not included in Bachman and Palmer’s framework.
Applying IRT Model to Determine Gender and Discipline-based DIF and DDF: A Study of the IAU English Proficiency Test
حوزه های تخصصی:
The purpose of this study was to examine gender and discipline-based Differential Item Functioning (DIF) and Differential Distractor Functioning (DDF) on the Islamic Azad University English Proficiency Test (IAUEPT). The study evaluated DIF and DDF across genders and disciplines using the Rasch model. To conduct DIF and DDF analysis, the examinees were divided into two groups: Humanities and Social Sciences (HSS) and Non-Humanities and Social Sciences (N-HSS). The results of the DIF analysis showed that four out of 100 items had DIF across gender, and two items had discipline DIF. Additionally, gender DDF analysis identified one item each for Options A, B, and C, and four items for Option D. Similarly, the discipline DDF analysis revealed one item for Option A, three items for Option B, four items for Option C, and three items for Option D. The findings of this study have significant implications for test developers. The identification of potential biases in high-stakes proficiency tests can help ensure fairness and equity for all examinees. Furthermore, identifying gender DIF can shed light on potential gender-based gaps in the curriculum, highlighting areas where male or female learners may be disadvantaged or underrepresented in terms of knowledge or skills.
Examining Local Item Dependence in a Cloze Test with the Rasch Model
حوزه های تخصصی:
Local item dependence (LID) refers to the situation where responses to items in a test or questionnaire are influenced by responses to other items in the test. This could be due to shared prompts, item content similarity, and deficiencies in item construction. LID due to a shared prompt is highly probable in cloze tests where items are nested within a passage. The purpose of this research is to examine the occurrence and magnitude of LID in a cloze test. A cloze test was analyzed with the Rasch model and locally dependent items were identified with the residual correlations. Findings showed that three pairs of items were locally dependent. When these items were removed from the analysis, test reliability dropped but item fit and unidimensionality improved. Removing the three locally dependent items did not affect person ability mean and standard deviation, though. The findings are discussed in terms of LID detection and modeling in the context of cloze test and language testing.
Unveiling Indonesian EFL Teacher’s Perceptions and Challenges of Technology-based Assessment as and for Learning
حوزه های تخصصی:
The significance of assessment as a learning component has led to the development of a sound assessment system that supports the teaching and learning objectives. Achievement of learning assessment should prioritize assessment as learning (AaL) and assessment for learning (AfL) rather than the assessment of learning (AoL). The appropriate AaL and AfL have influenced the process of AoL. Almost all teachers in Indonesia have faced the challenge of implementing Aal and AfL. This study used a qualitative study to report English as a Foreign Language (EFL) teachers' perceptions and challenges regarding technology-based AfL and AaL. The design of the study was content analysis using the coding procedure and the grounded theory framework as the analytical foundation. Data gathered involved conducting structured interviews with 60 EFL teachers from three central provinces within Indonesia. The findings revealed that EFL teachers in Indonesia had different perspectives about technology-based AfL and AaL as flourishing assessment approaches. Most of the EFL teachers' perceptions said that due to a lack of time, large classes, and lack of knowledge and training on technology-based AfL and AaL principles and practices in EFL contexts, implementation of technology-based AfL and AaL was viewed as challenging. The study has implications for EFL teachers and trainers in developing an instructional model of technology-based AfL and AaL.
Attaining C1-level Scores in TOEFL iBT: From Challenges to Needs
حوزه های تخصصی:
This study sought to identify the challenges and needs of TOEFL iBT candidates in achieving C1 level scores in the speaking and writing sections of the exam. To this end, the researcher employed a mixed-method approach to collect data from a population of 46 students, both male and female, between the ages of 22 to 30. The participants were enrolled in two TOEFL preparation classes at Universidad Nacional de Educación. The data were collected through a survey, class observations, and TOEFL speaking and writing mock tests, and were subsequently analyzed both qualitatively and quantitatively. The findings indicated that the participants encountered challenges in discussing unfamiliar topics fluently, struggled with producing language at the C1 level while employing a variety of grammar and vocabulary, and demonstrated reduced accuracy due to influences from their first language (L1). To address these challenges, it is recommended to offer a bridging course prior to exam preparation courses wherein the primary focus is devoted to enhancing candidates’ accuracy and communicative competence in discussing unfamiliar topics through corrective feedback. The findings of this study could offer valuable insights to TOEFL iBT test preparation instructors, enabling them to better understand the challenges students face and plan strategies to address these challenges effectively within preparation courses.
Incremental Theory of Intelligence and Writing Performance of Iranian IELTS Candidates
حوزه های تخصصی:
Incremental intelligence plays an important role in self-regulating and enhancing writing performance among language learners. The present study aimed to investigate the relationship among variables of the incremental theory of intelligence, self-regulatory writing strategies, implicit theory of writing, and writing performance in EFL learners. To do so, 320 Iranian IELTS candidates were invited to participate in the study. Three questionnaires, namely incremental intelligence, self-regulatory writing strategy, and implicit theory of writing, were utilized. The participants included male and female IELTS candidates aged 19-27 years with a minimum band score of 6.5 in their writing skill. Furthermore, their educational qualifications included BA and MA degrees. The collected data were analyzed using SPSS and structural equation modeling (SEM). The correlation analysis showed that the incremental theory of intelligence had a significant relationship with self-regulatory writing strategy, implicit theory of writing, and writing performance. Moreover, the obtained results of SEM confirmed the findings of the correlational analysis, indicating statistically significant positive correlations among the variables of the study. The findings revealed that a growth-oriented mindset could affect students’ use of four types of self-regulatory writing strategies. Self-regulatory writing strategies also played a pivotal role in guiding, stimulating, motivating, and sustaining learners’ efforts, thus predicting writing performance. The results of this study can help EFL learners learn how to regulate and manage themselves in writing skills. Learners can assess the extent of their professionalism and endeavor to identify their weaknesses in the learning process.
Expanding Novice and Experienced Iranian EFL Teachers’ Assessment Identity Landscape: Does Online Explicit Instruction Make a Difference?
حوزه های تخصصی:
Teacher assessment identity (TAI) is an integral element of quality assessment practice and professionalism, which has been widely overlooked in teacher education research. To bridge this gap, the present study explored the dynamism of TAI through an explicit training course. To do so, 22 novice and experienced Iranian EFL teachers completed a questionnaire on TAI before and after the course. Then both novice and experienced teachers attended a 16-hour treatment in which various assessment dimensions were taught. The participants also completed tasks reflecting TAI components. Moreover, five novice and five experienced EFL teachers were interviewed. The results of the independent samples t-test indicated that novice and experienced Iranian EFL teachers did not significantly differ regarding their assessment identity before and after the treatment despite improvements in their mean scores. However, the results of paired samples t-test revealed that the formal intervention has developed the TAI of both novice and experienced groups. This was substantiated by qualitative findings which demonstrated that, owing to the training, novice teachers manifested TAI in 16 domains while experienced teachers showed their identity as assessors in five domains. The study offers promising implications for EFL teachers and trainers, who can realize the dynamism of TAI and its core dimensions in light of instruction.
Perceptual (mis)matches between learners’ and teachers’ rating criteria in the Iranian EFL writing self-assessment context
حوزه های تخصصی:
As a formative assessment procedure, self-assessment aims to converge learners’ and teachers’ views in assessment. Hence, reducing the perceptual mismatches between the learners’ and the teachers’ assessment would positively affect the learning process. For this aim, the present study investigated to what extent the learners’ assessment of their writing before and after being provided with a list of rating criteria, agrees that of their teachers. Therefore, a body of 6 EFL writing teachers and 27 EFL learners participated in this study. The learners were asked to rate their writing before and after being provided with rating criteria developed by the researchers. The teachers also rated the students’ writings following the same criteria. The obtained results showed a significant difference between the students' scores on the first and second assessment occasions. The teachers’ and the students’ assessment on the second time also were found to significantly correlate. Moreover, the analysis of the students’ comments showed that while they rated their writing on some limited aspects of writing in the first rating occasion, they assessed their essays using more components in the second assessment phase. Overall, the findings revealed that providing the learners with rating criteria would not only reduce the perceptual mistaches between the students’ and the teachers’ assessment but through involving the students’ voices in their assessment would promote democratic classroom assessment. Pedagogical implications of the study are discussed.
An Investigation of Written Corrective Feedback in EFL Writing Assessment: How Teachers’ Feedback Practices Meet Students’ Expectations
حوزه های تخصصی:
Written corrective feedback (WCF) becomes an essential part of students writing development. The efficacy of WCF for EFL learners has been the subject of numerous studies. However, the studies neglected the idea of the feedback literacy. Thus, this study aimed to investigate how two English teachers provide WCF, what students expect from their teachers, and whether or not both students and teachers share common grounds seen from the feedback literacy framework. Thus, thirty EFL students and two teachers from EFL writing classes of one Indonesian Islamic university participated in this study. The data related to students' expectations of WCF and teachers' practices were gathered through open ended questionnaire, interview, and observations. Positive attitudes toward WCF were shared by students and teachers. It was discovered that students place a higher value on direct feedback rather than indirect feedback. Students perceived their learning motivation is best preserved through a positive tone of feedback, such as interaction with teachers and teachers' compliments on their strength instead comments on deficiencies. The study indicated that there was a gap between students’ expectation and teacher practices about the type and relevance of feedback. Students perceived indirect feedback as less helpful in addressing their specific writing issues. While teachers have demonstrated feedback literacy in their practices, students' readiness and ability to fully engage in feedback literacy may vary depending on their proficiency levels and developmental stages. To bridge this gap, both students and teachers need to have good literacy in feedback. The findings highlight the potential significance of further exploring the impact of cultural factors in driving WCF decisions.