Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 21 |
| Since 2007 (last 20 years) | 62 |
Descriptor
| Statistical Analysis | 82 |
| Test Validity | 82 |
| Test Reliability | 45 |
| Item Response Theory | 41 |
| Foreign Countries | 26 |
| Test Construction | 25 |
| Test Items | 25 |
| Correlation | 19 |
| Factor Analysis | 19 |
| Psychometrics | 18 |
| Multiple Choice Tests | 14 |
| More ▼ | |
Source
Author
| Graf, Edith Aurora | 2 |
| Liu, Ou Lydia | 2 |
| Adkins, Dorothy C. | 1 |
| Alavi, Seyed Mohammad | 1 |
| Alhaythami, Hassan | 1 |
| Allen, Sandra | 1 |
| Aron, Arthur | 1 |
| Aron, Elaine N. | 1 |
| Ashraf, Hamid | 1 |
| Assary, Elham | 1 |
| Baghaei, Purya | 1 |
| More ▼ | |
Publication Type
Education Level
Audience
Location
| Australia | 4 |
| California | 4 |
| Germany | 4 |
| Turkey | 3 |
| Colorado | 2 |
| Iran | 2 |
| United Kingdom | 2 |
| Africa | 1 |
| Asia | 1 |
| Brazil | 1 |
| Canada | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Graduate Record Examinations | 2 |
| Trends in International… | 2 |
| Defining Issues Test | 1 |
| Family Assessment Device | 1 |
| Metropolitan Readiness Tests | 1 |
| Test of English as a Foreign… | 1 |
| Woodcock Johnson Tests of… | 1 |
What Works Clearinghouse Rating
Sengül Avsar, Asiye – Measurement: Interdisciplinary Research and Perspectives, 2020
In order to reach valid and reliable test scores, various test theories have been developed, and one of them is nonparametric item response theory (NIRT). Mokken Models are the most widely known NIRT models which are useful for small samples and short tests. Mokken Package is useful for Mokken Scale Analysis. An important issue about validity is…
Descriptors: Response Style (Tests), Nonparametric Statistics, Item Response Theory, Test Validity
Karadavut, Tugba – Applied Measurement in Education, 2021
Mixture IRT models address the heterogeneity in a population by extracting latent classes and allowing item parameters to vary between latent classes. Once the latent classes are extracted, they need to be further examined to be characterized. Some approaches have been adopted in the literature for this purpose. These approaches examine either the…
Descriptors: Item Response Theory, Models, Test Items, Maximum Likelihood Statistics
Blazar, David; Braslow, David; Charalambous, Charalambos Y.; Hill, Heather C. – Educational Assessment, 2017
New systems that seek to evaluate teachers with regard to their classroom quality often rely on observation instruments that capture general instructional pedagogies. However, decades of research suggest that content-specific dimensions of instruction also are important to differentiate teachers and improve student outcomes. We explore the degree…
Descriptors: Mathematics Instruction, Teaching Methods, Observation, Factor Analysis
Smith, Tamarah; Smith, Samantha – International Journal of Teaching and Learning in Higher Education, 2018
The Research Methods Skills Assessment (RMSA) was created to measure psychology majors' statistics knowledge and skills. The American Psychological Association's Guidelines for the Undergraduate Major in Psychology (APA, 2007, 2013) served as a framework for development. Results from a Rasch analysis with data from n = 330 undergraduates showed…
Descriptors: Psychology, Statistics, Undergraduate Students, Item Response Theory
Castellano, Katherine E.; Duckor, Brent; Wihardini, Diah; Telléz, Kip; Wilson, Mark – Teacher Education Quarterly, 2016
With the adoption by most states of the Common Core State Standards (CCSS) for English language arts and literacy and for mathematics (CCSS Initiative, 2010a, 2010b) comes major changes in public education that will affect instructional practice, curriculum, and assessment across the nation. Heritage, Walqui, and Linquanti (2015) argued that the…
Descriptors: Elementary School Mathematics, Mathematics Teachers, Teacher Certification, Language Usage
Smith, Rhonda L.; Eklund, Katie; Kilgus, Stephen P. – School Psychology Quarterly, 2018
The purpose of this study was to evaluate the concurrent validity, sensitivity to change, and teacher acceptability of Direct Behavior Rating single-item scales (DBR-SIS), a brief progress monitoring measure designed to assess student behavioral change in response to intervention. Twenty-four elementary teacher-student dyads implemented a daily…
Descriptors: Behavior Rating Scales, Test Validity, Progress Monitoring, Student Behavior
Longabach, Tanya; Peyton, Vicki – Language Testing, 2018
K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…
Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency
Myers, Nicholas D.; Park, Sung Eun; Lefevor, G. Tyler; Dietz, Samantha; Prilleltensky, Isaac; Prado, Guillermo J. – Measurement in Physical Education and Exercise Science, 2016
The purpose of this study was to provide initial validity evidence for measuring multidimensional subjective well-being in a Hispanic sample with the Interpersonal, Community, Occupational, Physical, Psychological, Economic (I COPPE) Scale. Participants were 641 English-speaking adults who self-identified as Hispanic. Bi-factor analyses were used…
Descriptors: Well Being, Comparative Analysis, Correlation, Hispanic Americans
Karlin, Omar; Karlin, Sayaka – InSight: A Journal of Scholarly Teaching, 2018
This study had two aims. The first was to explain the process of using the Rasch measurement model to validate tests in an easy-to-understand way for those unfamiliar with the Rasch measurement model. The second was to validate two final exams with several shared items. The exams were given to two groups of students with slightly differing English…
Descriptors: Item Response Theory, Test Validity, Test Items, Accuracy
Turkan, Azmi; Cetin, Bayram – Journal of Education and Practice, 2017
Validity and reliability are among the most crucial characteristics of a test. One of the steps to make sure that a test is valid and reliable is to examine the bias in test items. The purpose of this study was to examine the bias in 2012 Placement Test items in terms of gender variable using Rasch Model in Turkey. The sample of this study was…
Descriptors: Item Response Theory, Gender Differences, Test Bias, Test Items
Walstad, William B.; Rebeck, Ken – Journal of Economic Education, 2017
The "Test of Financial Literacy" (TFL) was created to measure the financial knowledge of high school students. Its content is based on the standards and benchmarks stated in the "National Standards for Financial Literacy" (Council for Economic Education 2013). The test development process involved extensive item writing and…
Descriptors: Tests, Money Management, Literacy, High School Students
Hooker, J. F.; Denker, K. J.; Summers, M. E.; Parker, M. – Journal of Computer Assisted Learning, 2016
Previous research into the benefits student response systems (SRS) that have been brought into the classroom revealed that SRS can contribute positively to student experiences. However, while the benefits of SRS have been conceptualized and operationalized into a widely cited scale, the validity of this scale had not been tested. Furthermore,…
Descriptors: Technology Uses in Education, Factor Analysis, Audience Response Systems, Handheld Devices
Pedrosa, Ignacio; Suárez-Álvarez, Javier; Lozano, Luis M.; Muñiz, José; García-Cueto, Eduardo – Journal of Psychoeducational Assessment, 2014
Adolescence is a critical period of life during which significant psychosocial adjustment occurs and in which emotional intelligence plays an essential role. This article provides validity evidence for the Trait Meta-Mood Scale-24 (TMMS-24) scores based on an item response theory (IRT) approach. A sample of 2,693 Spanish adolescents (M = 16.52…
Descriptors: Foreign Countries, Adolescents, Secondary School Students, Emotional Intelligence
Hays, Danica G.; Wood, Chris – Measurement and Evaluation in Counseling and Development, 2017
We present considerations for validity when a population outside of a normed sample is assessed and those data are interpreted. Using a career group counseling example exploring life satisfaction changes as evidenced by the Quality of Life Inventory (Frisch, 1994), we showcase qualitative and quantitative approaches to explore how normative data…
Descriptors: Data Interpretation, Scores, Quality of Life, Life Satisfaction
Fiedler, Daniela; Tröbst, Steffen; Harms, Ute – CBE - Life Sciences Education, 2017
Students of all ages face severe conceptual difficulties regarding key aspects of evolution-- the central, unifying, and overarching theme in biology. Aspects strongly related to abstract "threshold" concepts like randomness and probability appear to pose particular difficulties. A further problem is the lack of an appropriate instrument…
Descriptors: College Students, Concept Formation, Probability, Evolution

Peer reviewed
Direct link
