A Comparison Of Observed Score Approaches To Detecting Differential Item Functioning Among Multiple Groups

Download A Comparison Of Observed Score Approaches To Detecting Differential Item Functioning Among Multiple Groups PDF/ePub or read online books in Mobi eBooks. Click Download or Read Online button to get A Comparison Of Observed Score Approaches To Detecting Differential Item Functioning Among Multiple Groups book now. This website allows unlimited access to, at the time of writing, more than 1.5 million titles, including hundreds of thousands of titles in various foreign languages.
A Comparison of Observed Score Approaches to Detecting Differential Item Functioning Among Multiple Groups

"The overall purpose of this dissertation was to compare various observed score approaches in detecting differential item functioning among multiple examinee groups simultaneously. Specifically, this study contributes to the literature base by investigating a lasso-constraint observed score method (i.e., logistic regression lasso; LR lasso) in the context of multiple groups as well as features of test design related to test information targets. Given that a lasso-constraint method has not been extended for multiple groups using observed scores, comparisons are made with other observed score techniques (i.e., generalized Mantel-Haenszel x2 and generalized logistic regression) while using item response theory to generate data (thus avoiding model-data congruity complications in the study design). Multiple variables were manipulated in a simulation study at the test-level (e.g., the location of the test information target relative to the central tendency of the examinee population, and the shape of the test information function), item-level (e.g., the location of DIF items relative to the test information target, and the percentage of DIF items), and for simulees (e.g., the amount of impact and sample size balance). The relative lack of literature which explores DIF as it relates to target test information functions provided the exigency for exploring it within this study, along with its typical absence in literature using IRT generation models. Practitioners may find the results useful in judging the merit of adopting the newer lasso method for detecting DIF within multiple groups as opposed to pre-existing methods. Furthermore, the test design features of this study allow for the interpretation to be less theoretical in nature and better aligned with standard operational practices, such as building exams to be optimized at test information targets, for example. The results provide consilience that the LR lasso method has inflated type I error overall with no additional benefit in power. In fact, even when type I error rates are comparable across methods, LR lasso has a lower hit rate in many instances (i.e., higher type II error rate). The sensitivity of LR lasso to detecting DIF items seems to be substantially influenced by having an increased number of DIF items on a form. Recommendations for practitioners, as well as limitations and directions for future research, are provided as well. Taken collectively, the results of the simulation study can be interpreted to support the claim that LR lasso fails to perform comparably with more established methods for multiple groups DIF detection across numerous instances but could potentially have merit in practical application in situations that have yet to be explored. While some limitations of LR lasso were noted within this study, there are a variety of other conditions which need to be explored before practitioners discard the method altogether (a few such studies are suggested). It may well be the case that the added complexity afforded by the regularization in estimating the group-specific model parameters through lasso constraints may confound the detection of the DIF items."--Abstract from author supplied metadata. [This abstract has been edited to remove characters that will not display in this system. Please see the PDF for the full abstract.]
Differential Item Functioning

Test fairness is a moral imperative for both the makers and the users of tests. This book focuses on methods for detecting test items that function differently for different groups of examinees and on using this information to improve tests. Of interest to all testing and measurement specialists, it examines modern techniques used routinely to insure test fairness. Three of these relevant to the book's contents are: * detailed reviews of test items by subject matter experts and members of the major subgroups in society (gender, ethnic, and linguistic) that will be represented in the examinee population * comparisons of the predictive validity of the test done separately for each one of the major subgroups of examinees * extensive statistical analyses of the relative performance of major subgroups of examinees on individual test items.
Handbook of Research Methods in Abnormal and Clinical Psychology

The Handbook of Research Methods in Abnormal and Clinical Psychology presents a diverse range of areas critical to any researcher or student entering the field. It provides valuable information on the foundations of research methods, including validity in experimental design, ethics, and statistical methods. The contributors discuss design and instrumentation for methods that are particular to abnormal and clinical psychology, including behavioral assessment, psychophysiological assessment and observational methods. They also offer details on new advances in research methodology and analysis, such as meta-analysis, taxometric methods, item response theory, and approaches to determining clinical significance. In addition, this volume covers specialty topics within abnormal and clinical psychology from forensic psychology to behavior genetics to treatment outcome methods.