Presentation paper
A large-scale, long-term study of scale drift: The micro view and the macro view
2016

Description
The development of measurement scales for use across years and grades in educational settings provides unique challenges, as instructional approaches, instructional materials, and content standards all change periodically. This study examined the measurement stability of a set of Rasch measurement scales that have been in place for almost 40 years.
See MoreView Asset
Related Topics


MAP Growth theory of action
The MAP Growth theory of action describes key features of MAP Growth and its position in a comprehensive assessment system.
By: Patrick Meyer, Michael Dahlin
Products: MAP Growth
Topics: Equity, Measurement & scaling, Test design


Bayesian uncertainty estimation for Gaussian graphical models and centrality indices
This study compares estimation of symptom networks with Bayesian GLASSO- and Horseshoe priors to estimation using the frequentist GLASSO using extensive simulations.
By: Joran Jongerling, Sacha Epskamp, Donald Williams
Topics: Measurement & scaling


MAP Growth item parameter drift study
This report documents the results of an item parameter drift study designed to check the parameter stability of MAP Growth items.
By: Wei He
Products: MAP Growth
Topics: Computer adaptive testing


Changes in school composition during the COVID-19 pandemic: Implications for school-average interim test score use
School officials regularly use school-aggregate test scores to monitor school performance and make policy decisions. In this report, RAND researchers investigate one specific issue that may contaminate utilization of COVID-19–era school-aggregate scores and result in faulty comparisons with historical and other proximal aggregate scores: changes in school composition over time. To investigate this issue, they examine data from NWEA’s MAP Growth assessments, interim assessments used by states and districts during the 2020–2021 school year.
By: Jonathan Schweig, Megan Kuhfeld, Andrew McEachin, Melissa Diliberti, Louis Mariano
Topics: COVID-19 & schools, Measurement & scaling


Examining the performance of the trifactor model for multiple raters
Using simulations, this study examined the “trifactor model,” a recent model developed to address rater disagreement.
By: James Soland, Megan Kuhfeld
Topics: Measurement & scaling


BFpack: Flexible Bayes factor testing of scientific theories in R
In this paper we present a new R package called BFpack that contains functions for Bayes factor hypothesis testing for the many common testing problems. The software includes novel tools for (i) Bayesian exploratory testing (e.g., zero vs positive vs negative effects), (ii) Bayesian confirmatory testing (competing hypotheses with equality and/or order constraints), (iii) common statistical analyses, such as linear regression, generalized linear models, (multivariate) analysis of (co)variance, correlation analysis, and random intercept models, (iv) using default priors, and (v) while allowing data to contain missing observations that are missing at random.
By: Joris Mulder, Donald Williams, Xin Gu, Andrew Tomarken, Florian Böing-Messing, Anton Olsson-Collentine, Marlyne Meijerink-Bosman, Janosch Menke, Robbie van Aert, Jean-Paul Fox, Herbert Hoijtink, Yves Rosseel, Eric-Jan Wagenmakers, Caspar van Lissa
Topics: Measurement & scaling


Putting the individual into reliability: Bayesian testing of homogeneous within-person variance in hierarchical models
This research introduces a novel approach, using the Bayes factor, wherein a researcher can directly test for homogeneous within-person variance in hierarchical models. Additionally, we introduce a membership model that allows for classifying which (and how many) individuals belong to the common variance model.
By: Donald Williams, Stephen Martin, Phillipe Rast
Topics: Measurement & scaling