Item response theory

۱.

On the Use of Offline Short Tests for Scoring and Classifying Purposes(مقاله علمی وزارت علوم)

نویسنده: فائزه صفری علیرضا احمدی

منبع: زبان شناسی کاربردی دوره شانزدهم پاییز و زمستان ۱۳۹۲ شماره ۲

کلیدواژه‌ها: Offline short form Item response theory Item parameter Conventional test

حوزه‌های تخصصی:

حوزه‌های تخصصی زبان شناسی علوم مرتبط آموزش زبان دوم

تعداد بازدید : ۱۱۱۷ تعداد دانلود : ۵۳۴

In response to the increasing interest in and need for a practical brief measure in language testing, this study explored the properties of an offline short-form test (OSF) versus a conventional lengthy test. From the total of 98 vocabulary items pooled from the Iranian National University Entrance Exams, 60 items were selected for the conventional test (CT). To build the OSF, we created an item bank by examining the item response theory (IRT) parameter estimates. Data for the IRT calibration included the responses of 774,258 examinees. Upon the results of the item calibration, 43 items with the highest discrimination power and minimal guessing values from different levels of ability were selected for the item bank. Then, using the responses of 253 EFL learners, we compared the measurement properties of the OSF scores with those of the CT scores in terms of the score precision, score comparability, and consistency of classification decisions. The results revealed that although the OSF generally did not achieve the same level of measurement precision as the CT, it still achieved a desired level of precision while lessening the negative effects of a lengthy test. The results also signified an excellent degree of correspondence between OSF and CT scores and classification results. In all, findings suggest that OSF can stand as a reasonable alternative for a longer test, especially when conditions dictate that a very short test be used.

۲.

The Retrofit of an English Language Placement Test Used for Large-scale Assessments in Higher Education(مقاله علمی وزارت علوم)

نویسنده: Arturo Mendoza Joaquín Martinez

منبع: International Journal of Language Testing, Volume ۱۳, Issue ۱, Winter and Spring ۲۰۲۳ 139 - 165

کلیدواژه‌ها: English in higher education item quality Item response theory large-scale assessment placement tests

حوزه‌های تخصصی:

حوزه‌های تخصصی زبان شناسی

تعداد بازدید : ۳۳۲ تعداد دانلود : ۲۰۹

Language placement tests (LPTs) are used to assess students’ proficiency in a progressive manner in the target language. Based on their performance, students are assigned to stepped language courses. These tests are usually considered low stakes because they do not have significant consequences in students’ lives, which is perhaps the reason why studies conducted with LPTs are scarce. Nevertheless, tests should be regularly examined, and statistical analysis should be conducted to assess their functioning, particularly when they have a medium or high-stakes impact. In the case of LPTs administered on a large-scale, the logistic and administrative consequences of an ill-defined test may lead to an economic burden and unnecessary use of human resources which can also affect students negatively. This study was undertaken at one of the largest public institutions in Latin America. Nearly 1700 students sit an English LPT every academic semester. A diagnostic statistical analysis revealed a need for revision. To retrofit the test, a new test architecture and blueprints were designed in adherence to the new curriculum, and new items were developed and tried out gradually in several pilot studies. Item Response Theory (IRT) was used to examine the functioning of the new test items. The aim of this study is to show how the test was retrofitted, and to compare the functioning of the retrofitted version of the English LPT with the previous one. The results show that the quality of items was higher than that of the former English LPT.

۳.

Distractor Analysis in Multiple-Choice Items Using the Rasch Model(مقاله علمی وزارت علوم)

نویسنده: Omarov Nazarbek Bakytbekovich Aisha Mohammed Ammar Muhi Khleel Alghurabi Hajir Mahmood Ibrahim Alallo Yusra Mohammed Ali Aalaa Yaseen Hassan Lyazat Demeuova Shvedova Irina Viktorovna Bekenova Nazym

منبع: International Journal of Language Testing, Volume ۱۳, Special Issue, March ۲۰۲۳ 69 - 78

کلیدواژه‌ها: Distractor analysis Item response theory Multiple-choice items Rasch model

حوزه‌های تخصصی:

حوزه‌های تخصصی زبان شناسی

تعداد بازدید : ۴۹۴ تعداد دانلود : ۲۷۰

Multiple-choice (MC) item format is commonly used in educational assessments due to its economy and effectiveness across a variety of content domains. However, numerous studies have examined the quality of MC items in high-stakes and higher education assessments and found many flawed items, especially in terms of distractors. These faulty items lead to misleading insights about the performance of students and the final decisions. The analysis of distractors is typically conducted in educational assessments with multiple-choice items to ensure high quality items are used as the basis of inference. Item response theory (IRT) and Rasch models have received little attention for analyzing distractors. For that reason, the purpose of the present study was to apply the Rasch model, to a grammar test to analyze items’ distractors of the test. To achieve this, the current study investigated the quality of 10 instructor-written MC grammar items used in an undergraduate final exam, using the items responses of 310 English as a foreign language (EFL) students who had taken part in an advanced grammar course. The results showed the acceptable fit to the Rasch model and high reliability. Malfunctioning distractors were identified.