Robust Scale Transformation Methods In Irt True Score Equating Under Common Item Nonequivalent Groups Design

Download Robust Scale Transformation Methods In Irt True Score Equating Under Common Item Nonequivalent Groups Design PDF/ePub or read online books in Mobi eBooks. Click Download or Read Online button to get Robust Scale Transformation Methods In Irt True Score Equating Under Common Item Nonequivalent Groups Design book now. This website allows unlimited access to, at the time of writing, more than 1.5 million titles, including hundreds of thousands of titles in various foreign languages.
Robust Scale Transformation Methods in IRT True Score Equating Under Common-item Nonequivalent Groups Design

Common test items play an important role in equating multiple test forms under the common-item nonequivalent groups design. Inconsistent item parameter estimates among common items can lead to large bias in equated scores for IRT true score equating. Current methods extensively focus on detection and elimination of outlying common items, which usually leads to enlarged random equating error and inadequate content representation of common items. New robust scale transformation methods based on robust regression, the robust Deming regression method, the robust Haebara method, and the least absolute values (LAV) method, were proposed. In simulation studies, performances of the proposed methods were compared to the Stocking-Lord method which yields the least equating errors among the traditional method and to outlier removal methods. The results indicate: 1) the robust Haebara method and the LAV method usually outperform the robust Deming regression method, 2) the robust Haebara method and the LAV method perform as well as the Stocking Lord method under the condition of No outlier, 3) the robust Haebara method and the LAV method perform better than the Stocking-Lord method when a single outlying common item is simulated, 4) the LAV method and the robust Haebara method are better than, or at least comparable to, the existing outlier removal methods in the presence of a single outlying common item, and 5) the LAV method and the robust Haebara method have smaller equated scores than the Stocking-Lord method using the CBASE data of English and Mathematics.
D-scoring Method of Measurement

D-scoring Method of Measurement presents a unified framework of classical and latent measurement referred to as D-scoring method of measurement (DSM). Provided are detailed descriptions of DSM procedures and illustrative examples of how to apply the DSM in various scenarios of measurement. The DSM is designed to combine merits of the traditional CTT and IRT for the purpose of transparency, ease of interpretations, computational simplicity of test scoring and scaling, and practical efficiency, particularly in large-scale assessments. Through detailed descriptions of DSM procedures, this book shows how practical applications of such procedures are facilitated by the inclusion of operationalized guidance for their execution using the computer program DELTA for DSM-based scoring, equating, and item analysis of test data. In doing so, the book shows how DSM procedures can be readily translated into computer source codes for other popular software packages such as R. D-scoring Method of Measurement equips researchers and practitioners in the field of educational and psychological measurement with a comprehensive understanding of the DSM as a unified framework of classical and latent scoring, equating, and psychometric analysis.
Test Equating, Scaling, and Linking

Author: Michael J. Kolen
language: en
Publisher: Springer Science & Business Media
Release Date: 2013-03-14
Test equating methods are used with many standardized tests in education and psychology to ensure that scores from multiple test forms can be used interchangeably. In recent years, researchers from the education, psychology, and statistics communities have contributed to the rapidly growing statistical and psychometric methodologies used in test equating. This book provides an introduction to test equating which both discusses the most frequently used equating methodologies and covers many of the practical issues involved. This second edition expands upon the coverage of the first edition by providing a new chapter on test scaling and a second on test linking. Test scaling is the process of developing score scales that are used when scores on standardized tests are reported. In test linking, scores from two or more tests are related to one another. Linking has received much recent attention, due largely to investigations of linking similarly named tests from different test publishers or tests constructed for different purposes. The expanded coverage in the second edition also includes methodology for using polytomous item response theory in equating. The themes of the second edition include: * the purposes of equating, scaling and linking and their practical context * data collection designs * statistical methodology * designing reasonable and useful equating, scaling, and linking studies * importance of test development and quality control processes to equating * equating error, and the underlying statistical assumptions for equating "Given the perennial debates about academic standards and grade inflation, it is my view that tools such as those described in this book should be adopted much more widely by the academic community than they are at present. This book provides an excellent overview, and I strongly recommend it." Short Book Reviews of the ISI, April2005 "I highly recommend this book to everybody who has any interest in equating and linking, be they a student, practitioner, or researcher." Psychometrika, 2006