Concurrent validity of the independent reading level assessment framework and a state assessment
This study investigates the use of screening assessments within the increasingly popular Response to Intervention (RTI) framework, specifically seeking to collect concurrent validity evidence on one potential new screening tool, the Independent Reading Level Assessment (IRLA) framework.
By: Beth Tarasawa, Nicole Ralston, Jacqueline Waggoner, Amy Jackson
Topics: Empowering educators, Measurement & scaling, Reading & language arts
Students improve even amid evaluation controversy
Positive student achievement and growth results for students in New York suggest that improvements to the teacher evaluation process that emphasize the importance of strong evaluation procedures, the systematic collection of evidence of teacher performance, and the use of data to inform the process, have promise for improving educator effectiveness far more than a narrower punitive approach.
Modeling student test-taking motivation in the context of an adaptive achievement test
This study examined the utility of response time‐based analyses in understanding the behavior of unmotivated test takers. For the data from an adaptive achievement test, patterns of observed rapid‐guessing behavior and item response accuracy were compared to the behavior expected under several types of models that have been proposed to represent unmotivated test taking behavior.
Topics: Innovations in reporting & assessment, Measurement & scaling, School & test engagement
Are all biases bad? Collaborative grounded theory in developmental evaluation of education policy
By: Ross Anderson, Meg Guerreiro, Jo Smith
The major purpose of this paper is to investigate the effects of CAT test design and bank distribution on the content coverage and the efficiency of the tests.
By: Shudong Wang, Hong Jiao
Topics: Test design, Computer adaptive testing, Learning standards & alignment
A large-scale, long-term study of scale drift: The micro view and the macro view
This study examined the measurement stability of a set of Rasch measurement scales that have been in place for almost 40 years.
Rapid‐guessing behavior: Its identification, interpretation, and implications
The rise of computer‐based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple‐choice items.
By: Steven Wise
Topics: Measurement & scaling, Innovations in reporting & assessment, School & test engagement