Publication Date
| In 2026 | 0 |
| Since 2025 | 74 |
| Since 2022 (last 5 years) | 509 |
| Since 2017 (last 10 years) | 1084 |
| Since 2007 (last 20 years) | 2603 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 169 |
| Practitioners | 49 |
| Teachers | 32 |
| Administrators | 8 |
| Policymakers | 8 |
| Counselors | 4 |
| Students | 4 |
| Media Staff | 1 |
Location
| Turkey | 173 |
| Australia | 81 |
| Canada | 79 |
| China | 72 |
| United States | 56 |
| Taiwan | 44 |
| Germany | 43 |
| Japan | 41 |
| United Kingdom | 39 |
| Iran | 37 |
| Indonesia | 35 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Hammann, Marcus; Phan, Thi Thanh Hoi; Ehmer, Maike; Grimm, Tobias – Journal of Biological Education, 2008
This study is concerned with different forms of assessment of pupils' skills in experimentation. The findings of three studies are reported. Study 1 investigates whether it is possible to develop reliable multiple-choice tests for the skills of forming hypotheses, designing experiments and analysing experimental data. Study 2 compares scores from…
Descriptors: Multiple Choice Tests, Experiments, Science Process Skills, Skill Analysis
Julvez, Jordi; Forns, Maria; Ribas-Fito, Nuria; Mazon, Carlos; Torrent, Maties; Garcia-Esteban, Raquel; Ellison-Loschmann, Lis; Sunyer, Jordi – Early Education and Development, 2008
Research Findings: Few rating scales measure social competence in very young Spanish or Catalan children. We aimed to analyze the psychometric characteristics of the California Preschool Social Competence Scale (CPSCS) when applied to a Spanish- and Catalan-speaking population. Children were rated by their respective teachers within 6 months…
Descriptors: Social Class, Test Validity, Rating Scales, Factor Analysis
Gawronski, Bertram; Bodenhausen, Galen V. – Psychological Bulletin, 2006
Replies to commentaries by D. Albarracin, W. Hart, and K. C. McCulloch (see record 2006-10465-004), A. W. Kruglanski and M. Dechesne (see record 2006-10465-005), and R. E. Petty and P. Brinol (see record 2006-10465-006) on B. Gawronski and G. V. Bodenhausen's (2006; see record 2006-10465-003) recently proposed associative-propositional evaluation…
Descriptors: Criticism, Evaluation, Student Attitudes, Interrater Reliability
Wentzel, Carolyn – Journal of Science Education and Technology, 2006
INTEGRITY, an item analysis and statistical collusion detection (answer copying) online application, was reviewed. Features of the software and examples of program output are described in detail. INTEGRITY was found to be easily utilized with an abundance of well-organized documentation and built-in features designed to guide the user through the…
Descriptors: Item Analysis, Computer Software, Multiple Choice Tests, Costs
National Foundation for Educational Research, 2007
Statistical neighbour models provide one method for benchmarking progress. For each local authority (LA), these models designate a number of other LAs deemed to have similar characteristics. These designated LAs are known as statistical neighbours. Any LA may compare its performance (as measured by various indicators) against its statistical…
Descriptors: Benchmarking, Evaluation Methods, Evaluation Research, Research Tools
Emons, Wilco H. M.; Sijtsma, Klaas; Meijer, Rob R. – Psychological Methods, 2007
Short tests containing at most 15 items are used in clinical and health psychology, medicine, and psychiatry for making decisions about patients. Because short tests have large measurement error, the authors ask whether they are reliable enough for classifying patients into a treatment and a nontreatment group. For a given certainty level,…
Descriptors: Psychiatry, Patients, Error of Measurement, Test Length
Beevers, Christopher G.; Strong, David R.; Meyer, Bjorn; Pilkonis, Paul A.; Miller, Ivan R. – Psychological Assessment, 2007
Despite a central role for dysfunctional attitudes in cognitive theories of depression and the widespread use of the Dysfunctional Attitude Scale, form A (DAS-A; A. Weissman, 1979), the psychometric development of the DAS-A has been relatively limited. The authors used nonparametric item response theory methods to examine the DAS-A items and…
Descriptors: Measures (Individuals), Psychometrics, Depression (Psychology), Item Response Theory
Mullan, Mary; Lewis, Christopher Alan – Journal of Beliefs & Values, 2007
There are few self-report measures of morality. The Religious Status Inventory--"Being Ethical" subscale represents one approach. However, at present there is limited information on the psychometric properties of either the original 20-item version (RSInv-20) or the shortened embedded 10-item version (RSInv-S10). The aim of the present…
Descriptors: Item Analysis, Psychometrics, Ethics, Measures (Individuals)
Horst, S. Jeanne; Finney, Sara J.; Barron, Kenneth E. – Contemporary Educational Psychology, 2007
The current research explored the theory of social goal orientation. More specifically, we conducted three studies utilizing six-independent university student samples to evaluate the construct validity of the Social Achievement Goal Orientation Scale (SAGOS; Ryan & Hopkins, 2003), a measure representing the construct of social goal orientation.…
Descriptors: Measures (Individuals), Validity, Factor Structure, Construct Validity
Webb, Norman L. – Applied Measurement in Education, 2007
A process for judging the alignment between curriculum standards and assessments developed by the author is presented. This process produces information on the relationship of standards and assessments on four alignment criteria: Categorical Concurrence, Depth of Knowledge Consistency, Range of Knowledge Correspondence, and Balance of…
Descriptors: Educational Assessment, Academic Standards, Item Analysis, Interrater Reliability
Elosua, Paula; Lopez-Jauregui, Alicia – International Journal of Testing, 2007
This report shows a classification of differential item functioning (DIF) sources that have an effect on the adaptation of tests. This classification is based on linguistic and cultural criteria. Four general DIF sources are distinguished: cultural relevance, translation problems, morph syntactical differences, and semantic differences. The…
Descriptors: Semantics, Cultural Relevance, Classification, Test Bias
Liao, Chi-Wen; Livingston, Samuel A. – ETS Research Report Series, 2008
Randomly equivalent forms (REF) of tests in listening and reading for nonnative speakers of English were created by stratified random assignment of items to forms, stratifying on item content and predicted difficulty. The study included 50 replications of the procedure for each test. Each replication generated 2 REFs. The equivalence of those 2…
Descriptors: Equated Scores, Item Analysis, Test Items, Difficulty Level
MacSwan, Jeff; Mahoney, Kate – Journal of Educational Research & Policy Studies, 2008
Construct validity concerns for the IPT I Oral Grades K-6 Spanish Second Edition (IPT-S) as a measure of native oral language proficiency are examined. The examination included describing a subset of items that contributes most to overall score and native-language proficiency designation. Correlations between this subset of items and the overall…
Descriptors: Language Research, Oral Language, Language Tests, Construct Validity
Prins, Esther; Toso, Blaire Willson – American Educational Research Journal, 2008
The Parent Education Profile (PEP) is an instrument used by family literacy programs to rate parents' support for children's literacy development. This article uses Critical Discourse Analysis to examine how the PEP constructs the ideal parent, the text's underlying assumptions about parenting and education, and its ideological effects. The…
Descriptors: Outcomes of Education, Parent Participation, Parent Education, Child Rearing
Fernet, Claude; Senecal, Caroline; Guay, Frederic; Marsh, Herbert; Dowson, Martin – Journal of Career Assessment, 2008
The authors developed and validated a measure of teachers' motivation toward specific work tasks: The Work Tasks Motivation Scale for Teachers (WTMST). The WTMST is designed to assess five motivational constructs toward six work tasks (e.g., class preparation, teaching). The authors conducted a preliminary (n = 42) and a main study among…
Descriptors: Multitrait Multimethod Techniques, Measures (Individuals), Secondary School Teachers, Teacher Motivation

Peer reviewed
Direct link
