Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 5 |
| Since 2017 (last 10 years) | 19 |
| Since 2007 (last 20 years) | 38 |
Descriptor
| Test Validity | 57 |
| Data Collection | 28 |
| Test Reliability | 28 |
| Test Construction | 26 |
| Data Analysis | 16 |
| Psychometrics | 12 |
| Scoring | 12 |
| Testing | 12 |
| Testing Programs | 11 |
| Achievement Tests | 10 |
| Item Response Theory | 9 |
| More ▼ | |
Source
Author
| Azin, Mariam | 1 |
| Benjawan Plengkham | 1 |
| Bonner, Cavan V. | 1 |
| Boone, William J. | 1 |
| Boyer, Michelle | 1 |
| Bruck, Margaret | 1 |
| Burkhardt, Amy | 1 |
| Calmettes, Guillaume | 1 |
| Care, Esther | 1 |
| Chen, Haiwen | 1 |
| Cizek, Gregory J. | 1 |
| More ▼ | |
Publication Type
Education Level
| Secondary Education | 10 |
| Early Childhood Education | 8 |
| Junior High Schools | 8 |
| Middle Schools | 8 |
| Elementary Education | 7 |
| Elementary Secondary Education | 7 |
| Grade 3 | 7 |
| Grade 4 | 7 |
| Grade 5 | 7 |
| Grade 6 | 7 |
| Grade 7 | 7 |
| More ▼ | |
Audience
| Practitioners | 3 |
| Researchers | 3 |
| Administrators | 1 |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
| National Assessment of… | 3 |
| Program for International… | 2 |
| Advanced Placement… | 1 |
| Conflict Tactics Scale | 1 |
| General Educational… | 1 |
| Graduate Record Examinations | 1 |
| National Teacher Examinations | 1 |
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Sandip Sinharay; Randy E. Bennett; Michael Kane; Jesse R. Sparks – Journal of Educational Measurement, 2025
Personalized assessments are of increasing interest because of their potential to lead to more equitable decisions about the examinees. However, one obstacle to the widespread use of personalized assessments is the lack of a measurement toolkit that can be used to analyze data from these assessments. This article takes one step toward building…
Descriptors: Test Validity, Data Analysis, Advanced Placement Programs, Art
Benjawan Plengkham; Sonthaya Rattanasak; Patsawut Sukserm – Journal of Education and Learning, 2025
This academic article provides the essential steps for designing an effective English questionnaire in social science research, with a focus on ensuring clarity, cultural sensitivity and ethical integrity. Developed from key insights from related studies, it outlines potential practice in questionnaire design, item development and the importance…
Descriptors: Guidelines, Test Construction, Questionnaires, Surveys
Valenza, Marco; Dreesen, Thomas; Kan, Sophia – UNICEF Office of Research - Innocenti, 2022
One tool that many families own, across the globe, is a basic mobile phone. The use of low-cost basic mobile phones for educational purposes in humanitarian settings is critical where access to connectivity and higher cost devices is limited. The portability of mobile phones, combined with their communication features, offers multiple uses to…
Descriptors: COVID-19, Pandemics, Telecommunications, Handheld Devices
Leighton, Jacqueline P. – Applied Measurement in Education, 2021
The objective of this paper is to comment on the think-aloud methods presented in the three papers included in this special issue. The commentary offered stems from the author's own psychological investigations of unobservable information processes and the conditions under which the most defensible claims can be advanced. The structure of this…
Descriptors: Protocol Analysis, Data Collection, Test Construction, Test Validity
Liou, Gloria; Bonner, Cavan V.; Tay, Louis – International Journal of Testing, 2022
With the advent of big data and advances in technology, psychological assessments have become increasingly sophisticated and complex. Nevertheless, traditional psychometric issues concerning the validity, reliability, and measurement bias of such assessments remain fundamental in determining whether score inferences of human attributes are…
Descriptors: Psychometrics, Computer Assisted Testing, Adaptive Testing, Data
Ting Zhang; Paul Bailey; Yuqi Liao; Emmanuel Sikali – Large-scale Assessments in Education, 2024
The EdSurvey package helps users download, explore variables in, extract data from, and run analyses on large-scale assessment data. The analysis functions in EdSurvey account for the use of plausible values for test scores, survey sampling weights, and their associated variance estimator. We describe the capabilities of the package in the context…
Descriptors: National Competency Tests, Information Retrieval, Data Collection, Test Validity
Pentimonti, J.; Petscher, Y.; Stanley, C. – National Center on Improving Literacy, 2019
Sample representativeness is an important piece to consider when evaluating the quality of a screening assessment. If you are trying to determine whether or not the screening tool accurately measures children's skills, you want to ensure that the sample that is used to validate the tool is representative of your population of interest.
Descriptors: Sampling, Screening Tests, Measurement, Test Validity
Vista, Alvin; Kim, Helyn; Care, Esther – Center for Universal Education at The Brookings Institution, 2018
The changes in the economy and society in this century have placed a greater emphasis on the skills that citizens need to be successful. This diverse set of skills, often referred to as 21st century skills, and including critical thinking, creativity, problem solving, communication, and socio-emotional skills, among others, are in high demand as…
Descriptors: Data Use, 21st Century Skills, Educational Assessment, Decision Making
Lottridge, Sue; Burkhardt, Amy; Boyer, Michelle – Educational Measurement: Issues and Practice, 2020
In this digital ITEMS module, Dr. Sue Lottridge, Amy Burkhardt, and Dr. Michelle Boyer provide an overview of automated scoring. Automated scoring is the use of computer algorithms to score unconstrained open-ended test items by mimicking human scoring. The use of automated scoring is increasing in educational assessment programs because it allows…
Descriptors: Computer Assisted Testing, Scoring, Automation, Educational Assessment
Flanagan, Agnes; Cormier, Damien C. – Communique, 2019
One of the areas subsumed under the data-based decision making and accountability practice identified in the National Association of School Psychologists' (NASP) "Model for Integrated School Psychological Services" is to collect information on psychological and educational variables to make decisions at a number of levels of service…
Descriptors: Test Bias, School Psychologists, Measurement, Data Collection
Rupp, André A. – Applied Measurement in Education, 2018
This article discusses critical methodological design decisions for collecting, interpreting, and synthesizing empirical evidence during the design, deployment, and operational quality-control phases for automated scoring systems. The discussion is inspired by work on operational large-scale systems for automated essay scoring but many of the…
Descriptors: Design, Automation, Scoring, Test Scoring Machines
von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen – Journal of Educational and Behavioral Statistics, 2019
International large-scale assessments (ILSAs) transitioned from paper-based assessments to computer-based assessments (CBAs) facilitating the use of new item types and more effective data collection tools. This allows implementation of more complex test designs and to collect process and response time (RT) data. These new data types can be used to…
Descriptors: International Assessment, Computer Assisted Testing, Psychometrics, Item Response Theory
Elturki, Eman – English Teaching Forum, 2020
Accrediting agencies for English language programs, such as the Commission on English Language Program Accreditation (CEA), require a plan in writing for monitoring and reviewing assessment practices. Nonetheless, web-search queries such as "assessing assessment," "how to assess assessment," "assessing assessment…
Descriptors: College Second Language Programs, English (Second Language), Student Evaluation, Test Reliability
Cizek, Gregory J. – Assessment in Education: Principles, Policy & Practice, 2016
Advances in validity theory and alacrity in validation practice have suffered because the term "validity" has been used to refer to two incompatible concerns: (1) the degree of support for specified interpretations of test scores (i.e. intended score meaning) and (2) the degree of support for specified applications (i.e. intended test…
Descriptors: Scores, Definitions, Evaluation Utilization, Data Interpretation
Hays, Danica G.; Wood, Chris – Measurement and Evaluation in Counseling and Development, 2017
We present considerations for validity when a population outside of a normed sample is assessed and those data are interpreted. Using a career group counseling example exploring life satisfaction changes as evidenced by the Quality of Life Inventory (Frisch, 1994), we showcase qualitative and quantitative approaches to explore how normative data…
Descriptors: Data Interpretation, Scores, Quality of Life, Life Satisfaction

Peer reviewed
Direct link
