Comparing different response time threshold setting methods to detect low effort on a large-scale assessment

Journal article

Comparing different response time threshold setting methods to detect low effort on a large-scale assessment

April 2021

Published in:

Large-scale Assessments in Education 9, 8 https://doi.org/10.1186/s40536-021-00100-w

By: James Soland, Megan Kuhfeld, Joseph Rios

Abstract

Low examinee effort is a major threat to valid uses of many test scores. Fortunately, several methods have been developed to detect noneffortful item responses, most of which use response times. To accurately identify noneffortful responses, one must set response time thresholds separating those responses from effortful ones. While other studies have compared the efficacy of different threshold-setting methods, they typically do so using simulated or small-scale data. When large-scale data are used in such studies, they often are not from a computer-adaptive test (CAT), use only a handful of items, or do not comprehensively examine different threshold-setting methods. In this study, we use reading test scores from over 728,923 3rd–8th-grade students in 2,056 schools across the United States taking a CAT consisting of nearly 12,000 items to compare threshold-setting methods. In so doing, we help provide guidance to developers and administrators of large-scale assessments on the tradeoffs involved in using a given method to identify noneffortful responses.

Visit the journal

This article was published outside of NWEA. The full text can be found at the link above.

Topics: School & test engagement

Related Topics

Date

Journal article

An investigation of examinee test-taking effort on a large-scale assessment

Most previous research involving the study of response times has been conducted using locally developed instruments. The purpose of the current study was to examine the amount of rapid-guessing behavior within a commercially available, low-stakes instrument.

By: Steven Wise, J. Carl Setzer, Jill R. van den Heuvel, Guangming Ling

Topics: Measurement & scaling, School & test engagement, Student growth & accountability policies

2013

Journal article

The utility of adaptive testing in addressing the problem of unmotivated examinees

This integrative review examines the motivational benefits of computerized adaptive tests (CATs), and demonstrates that they can have important advantages over conventional tests in both identifying instances when examinees are exhibiting low effort, and effectively addressing the validity threat posed by unmotivated examinees.

By: Steven Wise

Topics: Measurement & scaling, Innovations in reporting & assessment, School & test engagement

2014

Journal article

Effort analysis: Individual score validation of achievement test data

Whenever the purpose of measurement is to inform an inference about a student’s achievement level, it is important that we be able to trust that the student’s test score accurately reflects what that student knows and can do. Such trust requires the assumption that a student’s test event is not unduly influenced by construct-irrelevant factors that could distort his score. This article examines one such factor—test-taking motivation—that tends to induce a person-specific, systematic negative bias on test scores.

By: Steven Wise

Topics: Measurement & scaling, Innovations in reporting & assessment, School & test engagement

2015

Journal article

Response time as an indicator of test taker speed: assumptions meet reality

The growing presence of computer-based testing has brought with it the capability to routinely capture the time that test takers spend on individual test items. This, in turn, has led to an increased interest in potential applications of response time in measuring intellectual ability and achievement. Goldhammer (this issue) provides a very useful overview of much of the research in this area, and he provides a thoughtful analysis of the speed-ability trade-off and its impact on measurement.

By: Steven Wise

Topics: Measurement & scaling, Innovations in reporting & assessment, School & test engagement

2015

Working paper

Modeling student test-taking motivation in the context of an adaptive achievement test

This study examined the utility of response time-based analyses in understanding the behavior of unmotivated test takers. For an adaptive achievement test, patterns of observed rapid-guessing behavior and item response accuracy were compared to the behavior expected under several types of models that have been proposed to represent unmotivated test taking behavior.

Topics: Measurement & scaling, Growth modeling, School & test engagement

2015

Journal article

Modeling student test-taking motivation in the context of an adaptive achievement test

This study examined the utility of response time‐based analyses in understanding the behavior of unmotivated test takers. For the data from an adaptive achievement test, patterns of observed rapid‐guessing behavior and item response accuracy were compared to the behavior expected under several types of models that have been proposed to represent unmotivated test taking behavior.

Topics: Innovations in reporting & assessment, Measurement & scaling, School & test engagement

2016

Journal article

Rapid‐guessing behavior: Its identification, interpretation, and implications

The rise of computer‐based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple‐choice items.

By: Steven Wise

Topics: Measurement & scaling, Innovations in reporting & assessment, School & test engagement

2017