مطالب مرتبط با کلیدواژه
۲۱.
۲۲.
۲۳.
۲۴.
۲۵.
۲۶.
۲۷.
۲۸.
۲۹.
۳۰.
۳۱.
Validity
حوزههای تخصصی:
The objective of this study was to validate a bilingual Spanish-English version of the Vocabulary Size Test (VST) considering its potential use as a discriminator between learners in terms of language competence. This version was designed based on the two forms available on one of the creators’ websites as well as considering practices recommended regarding the elimination of cognates and loans. A one-way ANOVA test was used to confirm the test’s capacity to discriminate among learners of different linguistic competence. Additionally, Principal Axis Factoring (PAF) was conducted to revise the existence of only one underlying variable. As a result of this study, a VST version for Spanish speakers consisting of 9 vocabulary frequency levels is shared. This version is in line with validation standards put forward in previous research. It is expected that this instrument will help future studies that seek to measure Spanish speakers’ competence in English as a foreign or second language without having to deal with the interference of other intervening factors.
Psychometric Evaluation of Cloze Tests with the Rasch Model
منبع:
International Journal of Language Testing, Volume ۱۲, Issue ۲, Summer and Autumn ۲۰۲۲
95 - 106
حوزههای تخصصی:
Cloze tests are gap-filling tests designed to measure overall language ability and reading comprehension in a second language. Due to their ease of construction and scoring, cloze tests are widely used in the context of second and foreign language testing. Previous research over the past decades has shown the reliability and validity of cloze tests in different contexts. However, due to the interdependent structure of cloze test items, item response theory models have not been applied to analyze cloze tests. In this research, we apply a method to circumvent the problem of local dependence for analyzing cloze tests with the Rasch model. Using this method, we applied the Rasch model to a cloze test composed of eight passages each containing 8-15 gaps. Findings showed that the Rasch model fits the data and thus it is possible to scale persons and cloze passages on an interval unidimensional scale. The test had a high reliability and was well-targeted to the examinees. Implications of the study are discussed.
Reliability and Validity of Self-Assessments among Iranian EFL University Students
منبع:
International Journal of Language Testing, Volume ۱۳, Issue ۱, Winter and Spring ۲۰۲۳
225 - 235
حوزههای تخصصی:
Modern teaching practices emphasize learner autonomy and learner-centered approaches to language learning. Such teaching methods require corresponding assessment approaches. Self-assessment is viewed as an assessment mode which matches modern learner-centered teaching methodologies. However, the validity and reliability of self-assessments are not yet conclusively established. This study aimed to provide validity and reliability evidence for self-assessments among Iranian EFL university learners. The Common European Framework of Reference (CEFR) Self-Assessment Grid was translated into Persian and was given to a sample of Iranian undergraduate students of English. A C-Test battery containing four passages was used as a criterion for concurrent validation. Self-assessments of university EFL learners were examined for internal consistency and test-retest reliability. Findings showed that while self-assessments are highly reliable they lack validity as evidenced with low correlations between components of self-assessment grid and the C-Test. The implications of the study for the application of self-assessments in foreign language education are discussed.
Validity and reliability of the Iranian force plate(مقاله علمی وزارت علوم)
منبع:
Sport Sciences and Health Research, Volume ۱۴, Issue ۱,۲۰۲۲
109 - 114
حوزههای تخصصی:
Background: Force plates are widely used in biomechanics and sports sciences to measure various aspects of human movement. The accuracy and reliability of force plate measurements are critical for valid data interpretation. Aim: The purpose of the study was to evaluate the validity and reliability of the Iranian force plate in the vertical, anterior-posterior, and medial-lateral directions using two manual dynamometers and a load cell. Materials and Methods: In this study, the force plate device utilized had a frequency of 1200 Hz and was manufactured by the Danesh Salar Iranian Company. Additionally, to determine the device's validity, we used Lafayette hand-held dynamometers manufactured in the United States and a load cell by Zemik. Pearson's correlation coefficient was employed to determine the validity of the force plate, while the internal consistency coefficient (ICC) was used to assess the force plate's reliability. Results: The study findings indicated a significant and high level of reliability between the maximum force obtained from the force plate device and manual dynamometer devices and load cell. Additionally, the internal consistency coefficient was found to be excellent (very high) for 20 trials in the three directions of vertical (0.98), anterior-posterior (0.96), and medial-lateral (0.97). Conclusion: The study demonstrated that the Iranian force plate is a reliable device for measuring maximum force in the three directions of vertical, anterior-posterior, and medial-lateral, with very high validity.
Investigating Psychometric properties of the Scale of Emotional Experience towards the Spouse(مقاله علمی وزارت علوم)
حوزههای تخصصی:
The objective of the present study was to establish and assess the psychometric properties of the scale of emotional experience towards the spouse in 2018-19. For this purpose, all the married women in the city of Isfahan were considered as the statistical population from which 300 married women were selected as the statistical sample using convenience sampling. The research instruments included the scale of emotional experience towards the spouse, extroversion and introversion subscales of NEO Personality Inventory (Costa & McCrae, 1992), and triangulation (Dehghan and Yousefi, 2019). The data were analyzed using descriptive statistics, mean and standard deviation) and inferential statistics (correlation analysis, exploratory factor analysis, and norm determination. Convergent validity and divergent validity results revealed that the subscale of negative emotional experience towards the spouse was significantly positively related to neuroticism and triangulation (convergent validity), but negatively related to extroversion (divergent validity). The subscale of positive emotional experience towards the spouse, on the other hand, had a positive relationship to extroversion (convergent validity) and a significantly negative relationship to neuroticism and triangulation (divergent validity). Exploratory factor analysis showed two basic factors called positive emotional experience towards the spouse and negative emotional experience towards the spouse. Test-retest coefficients, at a three-week interval, confirmed test-retest reliability. Thus, based on what the results revealed, this test can be used to assess the scale of emotional experience towards the spouse in the married women both in research and psychotherapy.
A systematic review of validity and reliability assessment of measuring Spasticity Evaluation Tool and Wheelchair Skills Tests at the level of international classification of functioning, disability and health (ICF) in people with spinal cord injury(مقاله علمی وزارت علوم)
منبع:
Sport Sciences and Health Research, Volume ۱۵, Issue ۲, ۲۰۲۳
203 - 217
حوزههای تخصصی:
Background: Assessment of spasticity and wheelchair skills performance is important in both clinical practice and research.Aim: The present study aimed to systematically review the psychometric properties (reliability and validity) of outcome measures used to assess spasticity and wheelchair skill tests in people with spinal cord injury.Materials and Methods: A search was conducted using terms through PubMed, Embase, Scopus, and Web of Science databases. Related articles included measures of spinal cord injury patients published in English from 2010 to 2021.To determine the publication quality of studies COSMIN checklist was used.Results: A total of 2150 potentially eligible studies were retrieved from four databases. The remaining 20 full-text studies were retrieved for complete review. Finally, 12 studies involving a total of 658 participants were included in the systematic review.Conclusion: Ethical, safety, and psychological issues were considered during the test for people with disabilities. According to previous studies, the Spasticity Evaluation Tool has been suggested as a reliable tool for assessing spasticity in SCI subjects. However, due to the variety of tests and the elimination of selected tools, wheelchair skills tests cannot be recommended.
Validation of C-Test among Iraqi EFL University Students
حوزههای تخصصی:
This study aims to assess the performance of university students through the C-Test and to analyze the extent to which this test is valid in measuring language ability. A standardized C-Test has been created with four brief passages, each containing 20 gaps. The length of each passage varied from 95 to 109 words. Throughout each passage, only the first and last sentences were not changed. The test was taken by 100 students; 39 were male and 61 were female at Al-Nisour University/Department of English in Baghdad, Iraq. The sample consists of two groups. Both groups come from the same school and would receive similar educational input in both cases based on their grade level. The validity and reliability of the C-Test were investigated using various techniques. The study analyzed the performance of Stage 4 and Stage 3 students on the Common Language Proficiency Test in Iraq. The results showed that the test discriminates well between high-ability and low-ability examinees, with no significant difference between the two groups. The Rasch model separation reliability was relatively high, and the data were one-dimensional. The students faced difficulties in guessing the most appropriate words due to their limited English proficiency. The results suggest that developing and implementing this test could significantly improve students' academic achievements in basic foreign language classes in Iraq.
Normative Study and Psychometric Properties of the Digital Quotient Test in Children and Adolescents Aged 8-18 in the Iranian Community
حوزههای تخصصی:
Objective: Digital Quotient (DQ) refers to a comprehensive set of digital competencies derived from universal ethical values that aim to enhance human interaction with, control, and create technology. The present study aimed to establish norms and examine the psychometric properties of the Digital Quotient Test in children and adolescents aged 8-18 in the Iranian community.Methods: This study's statistical population included students of the First and Second Elementary Schools and the First and Second Secondary Schools of Tehran in the academic year 2020-2021. A total of 521 students (277 girls and 244 boys) were examined using a convenience sampling method. To analyze the data obtained from the test, inferential statistics to determine construct validity, Pearson correlation matrix, and test-retest reliability using SPSS software version 26.Results: The results indicated that the construct validity of the Digital Quotient Test, using the internal consistency between its eight domains and the total score as evidence for this validity, was found to be appropriate (P < 0.05). Using the test-retest method with a coefficient of 0.872, the test reliability was estimated to be appropriate (P < 0.01).Conclusion: The Digital Quotient Test has appropriate validity and reliability in children and adolescents aged 8-18 years in Iranian community.
Validity of the Persian translation of the COVID-19 Attitudes and Behaviors (ACAB)(مقاله علمی وزارت علوم)
حوزههای تخصصی:
Introduction: Of particular global concern is the coronavirus disease of 2019 (COVID-19) outbreak. All Persian versions of COVID-19 measures assess the intrapsychic aspects of it, and there is a crucial need to measure the intergroup aspects of this pandemic. Aim: The current study aims to validate the Persian version of COVID-19 attitudes and behaviors in the Iranian sample. Method: The participants included 250 people from all over Iran in cyberspace who were selected availability (177 men and 73 women). They voluntarily participated in the study by filling out questionnaires that were made available through Google Forms and then disseminated online. Results: The ACAB scale had satisfactory reliability and validity according to content, face, and construct validity tests except for the first subscale (social distancing adjustment). Consequently, confirmatory factor analysis supported the ACAB with 12-item and three subscales. Therefore, three subscales remained, including self-prioritization, prosocial behaviors, and belief in conspiracies, and social distancing adjustment was eliminated because the factor loading values of its items were less than 0.4. Conclusion: Results indicated that the ACAB is a reliable and helpful tool in research, especially for governmental surveys to understand why people do not cooperate in vaccination or prosocial behaviors.
The validity and reliability of the Persian version of passions athlete adults
حوزههای تخصصی:
The passion scale mainly focuses on the passion for achievement or becoming good in some area/theme/skill. This study aimed to translate the passion scale and assess reliability and content and construct validity for the passion scale in athletic adults in Tehran city. A cross-cultural translation was used to generate a Persian-English version of the passion scale. A total of 200 athletes adults with age 26/79 ± 5/01 completed Persian version of passion scale (PS), enabling us to investigate its feasibility, content validity, internal consistency, construct validity and test-retest reliability. 30 athletes adults stated that all the questionnaire items were simple, clear, and related to the objectives. The overall pattern of results suggests that the scale for passion presented here is applicable for the age studied. The calculated CVI and CVR were 0.94 and 0.91, respectively. All individual item scores correlated positively with the total score, with correlations ranging from 0.67 to 0.81. The Cronbach's alpha value for the standardized items was 0.88. Pearson correlations coefficient between total score passion scale and Grit-S scale were 0.53 for athletic adults. Intra class correlation coefficients (ICCs) between test and retest scores for the total score was 0.92. The results of this study showed that this Persian version of passion scale in athletes adults has a good validity and reliability and can be used in investigating passion of athletes adults.
Examining Indicators of Validity in Online Formative Assessment: Insights from Iranian EFL Teachers(مقاله پژوهشی دانشگاه آزاد)
منبع:
The Journal of English Language Pedagogy and Practice, Vol.۱۶, No.۳۳, Fall & Winter ۲۰۲۳
201 - 223
حوزههای تخصصی:
Valid online formative assessments are crucial for accurate measurement of students' progress and effective pedagogical decision-making in digital learning. This quantitative-based study followed two primary aims. First, it aimed to investigate the extent to which Iranian EFL teachers working in universities and language institutes apply indicators of online formative assessment validity. Twenty-one online classrooms were observed in three sessions using a checklist. The second aim of this study was to determine the effect of EFL teachers’ place of living on the validity of online formative assessment. To this end, 316 Iranian EFL teachers from diverse EFL settings, including public schools, private schools, language institutes, and universities were asked to fill out online formative assessment validity scale developed by Maleki et al. (2023). The sample included both male and female teachers with varying age group ranges and academic degrees. The findings of the study indicated that Iranian EFL teachers in universities and language institutes tend to overlook indicators associated with the learner-centered aspects of online formative assessment validity. Furthermore, it was revealed that EFL teachers’ place of living could impact the validity of online formative assessment. This study has several implications for online EFL teachers and policymakers. The findings of this study emphasize the context-bound nature of validity in online formative assessment. Besides, it helps Iranian EFL teachers identify specific areas that need more attention and improvement in order to enhance the validity of their online formative assessments.