Presentation
Parameter estimation accuracy of the effort-moderated IRT model under multiple assumption violations
September 2020
By: James Soland, Joseph Rios

Description
This session from the 2020 National Council on Measurement in Education virtual conference presents new research findings on understanding and managing test-taking disengagement.
Soland, J.& Rios, J. (2020, September). Parameter estimation accuracy of the effort-moderated IRT model under multiple assumption violations. National Council on Measurement in Education 2020 virtual conference.
See MoreSee the presentation
Related Topics


MAP Growth theory of action
The MAP Growth theory of action describes key features of MAP Growth and its position in a comprehensive assessment system.
By: Patrick Meyer, Michael Dahlin
Products: MAP Growth
Topics: Equity, Measurement & scaling, Test design


Bayesian uncertainty estimation for Gaussian graphical models and centrality indices
This study compares estimation of symptom networks with Bayesian GLASSO- and Horseshoe priors to estimation using the frequentist GLASSO using extensive simulations.
By: Joran Jongerling, Sacha Epskamp, Donald Williams
Topics: Measurement & scaling


Changes in school composition during the COVID-19 pandemic: Implications for school-average interim test score use
School officials regularly use school-aggregate test scores to monitor school performance and make policy decisions. In this report, RAND researchers investigate one specific issue that may contaminate utilization of COVID-19–era school-aggregate scores and result in faulty comparisons with historical and other proximal aggregate scores: changes in school composition over time. To investigate this issue, they examine data from NWEA’s MAP Growth assessments, interim assessments used by states and districts during the 2020–2021 school year.
By: Jonathan Schweig, Megan Kuhfeld, Andrew McEachin, Melissa Diliberti, Louis Mariano
Topics: COVID-19 & schools, Measurement & scaling


Examining the performance of the trifactor model for multiple raters
Using simulations, this study examined the “trifactor model,” a recent model developed to address rater disagreement.
By: James Soland, Megan Kuhfeld
Topics: Measurement & scaling


BFpack: Flexible Bayes factor testing of scientific theories in R
In this paper we present a new R package called BFpack that contains functions for Bayes factor hypothesis testing for the many common testing problems. The software includes novel tools for (i) Bayesian exploratory testing (e.g., zero vs positive vs negative effects), (ii) Bayesian confirmatory testing (competing hypotheses with equality and/or order constraints), (iii) common statistical analyses, such as linear regression, generalized linear models, (multivariate) analysis of (co)variance, correlation analysis, and random intercept models, (iv) using default priors, and (v) while allowing data to contain missing observations that are missing at random.
By: Joris Mulder, Donald Williams, Xin Gu, Andrew Tomarken, Florian Böing-Messing, Anton Olsson-Collentine, Marlyne Meijerink-Bosman, Janosch Menke, Robbie van Aert, Jean-Paul Fox, Herbert Hoijtink, Yves Rosseel, Eric-Jan Wagenmakers, Caspar van Lissa
Topics: Measurement & scaling


Putting the individual into reliability: Bayesian testing of homogeneous within-person variance in hierarchical models
This research introduces a novel approach, using the Bayes factor, wherein a researcher can directly test for homogeneous within-person variance in hierarchical models. Additionally, we introduce a membership model that allows for classifying which (and how many) individuals belong to the common variance model.
By: Donald Williams, Stephen Martin, Phillipe Rast
Topics: Measurement & scaling


GGMnonreg: Non-regularized Gaussian graphical models in R
Graphical modeling has emerged recently in psychology (Epskamp et al. 2018), where the data is typically long or low-dimensional (p < n; Donald R. Williams et al. (2019),
Donald R. Williams & Rast (2019)). The primary purpose of GGMnonreg is to provide methods that were specifically designed for low-dimensional data (e.g., those common in the
social-behavioral sciences).
By: Donald Williams
Topics: Measurement & scaling