Glossary of Assessment Terms (pdf version)
| A-B | C-D | E-F | G-H | I-J | K-L | M-N | O-P | Q-R | S-T | U-V | X-Z |
Accountability
Accountability
is often viewed as an important factor in
education reform. An assessment system connected
to accountability can help identify the needs
of schools so that resources can be equitably
distributed. In this context, accountability
assessment can include such indicators as
equity, competency of teaching staff, physical
infrastructure, curriculum, class size, instructional
methods, existence of tracking, number of
higher cost students, dropout rates, and parental
involvement as well as student test scores.
It has been suggested that test scores analyzed
in a disaggregated format can help identify
instructional problems and point to potential
solutions.
[Sourc - http://www.newhorizons.org/strategies/assess/terminology.htm]
Achievement
Test
A standardized test designed to efficiently
measure the amount of knowledge and/or skill
a person has acquired, usually as a result
of classroom instruction. Such testing produces
a statistical profile used as a measurement
to evaluate student learning in comparison
with a standard or norm.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Action
Research
School
and classroom-based studies initiated and
conducted by teachers and other school staff.
Action research involves teachers, aides,
principals, and other school staff as researchers
who systematically reflect on their teaching
or other work and collect data that will answer
their questions. It offers staff an opportunity
to explore issues of interest to them in an
effort to improve classroom instruction and
educational effectiveness. (Source: Bennett,
C.K. "Promoting teacher reflection through
action research: What do teachers think?"
Journal of Staff Development, 1994, 15, 34-38.)
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Alternative
Assessment
Many educators prefer the description "assessment
alternatives" to describe alternatives
to traditional, Many educators prefer the
description "assessment alternatives"
to describe alternatives to traditional, standardized,
norm- or criterion-referenced traditional
paper and pencil testing. An alternative assessment
might require students to answer an open-ended
question, work out a solution to a problem,
perform a demonstration of a skill, or in
some way produce work rather than select an
answer from choices on a sheet of paper. Portfolios
and instructor observation of students are
also alternative forms of assessment.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Alternative
Assessment applies to any and all assessments
that require students to demonstrate knowledge
and skills in ways other than through the
conventional - methods used within a classroom,
school, or district. (See also Conventional,
or Traditional, Assessment.)
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124 ]
Analytic
Scoring
A type of rubric scoring that separates
the whole into categories of criteria that
are examined one at a time. Student writing,
for example, might be scored on the basis
of grammar, organization, and clarity of ideas.
Useful as a diagnostic tool. An analytic scale
is useful when there are several dimensions
on which the piece of work will be evaluated.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Anchor
An
anchor is a descriptive point on a scale/continuum.
The highest-level anchor is called the exemplar.
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
Anchor
Test
1.
A common set of items administered with each
of two or more different forms of a test for
the purpose of equating the scores obtained
on these forms. 2. "Anchor papers provide
a connection between a rubric narrative and
student writing, and an example of what writing
at a certain score of the rubric should look
like." (From: the REEP Writing Assessment
Trainer's Manual.) (see also Benchmark tasks;
Benchmarking.)
[Source - http://www.sabes.org/assessment/glossary.htm]
Aptitude
Test
A
test intended to measure the test-taker's
innate ability to learn, given before receiving
instruction.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Assessment
The Latin root assidere means to sit beside.
In an educational context, the process of
observing learning; describing, collecting,
recording, scoring, and interpreting information
about a student's or one's own learning. At
its most useful, assessment is an episode
in the learning process; part of reflection
and autobiographical understanding of progress.
Traditionally, student assessments are used
to determine placement, promotion, graduation,
or retention.
In
the context of institutional accountability,
assessments are undertaken to determine the
principal's performance, effectiveness of
schools, etc. In the context of school reform,
assessment is an essential tool for evaluating
the effectiveness of changes in the teaching-learning
process.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Assessment
is a process of gathering information to meet
a broad range of evaluation needs.
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
Assessment
Literacy
The possession of knowledge about the
basic principals of sound assessment practice,
including terminology, the development and
use of assessment methodologies and techniques,
familiarity with standards of quality in assessment.
Increasingly, familiarity with alternatives
to traditional measurements of learning.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Assessment
Task
An illustrative task or performance opportunity
that closely targets defined instructional
aims, allowing students to demonstrate their
progress and capabilities.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Authentic
Assessment
Evaluating by asking for the behavior the
learning is intended to produce. The concept
of model, practice, feedback in which students
know what excellent performance is and are
guided to practice an entire concept rather
than bits and pieces in preparation for eventual
understanding. A variety of techniques can
be employed in authentic assessment.
The goal of authentic assessment is to gather evidence that students can use knowledge effectively and be able to critique their own efforts. Authentic tests can be viewed as "assessments of enablement," in Robert Glaser's words, ideally mirroring and measuring student performance in a "real-world" context. Tasks used in authentic assessment are meaningful and valuable, and are part of the learning process.
Authentic
assessment can take place at any point in
the learning process. Authentic assessment
implies that tests are central experiences
in the learning process, and that assessment
takes place repeatedly. Patterns of success
and failure are observed as learners use knowledge
and skills in slightly ambiguous situations
that allow the assessor to observe the student
applying knowledge and skills in new situations
over time.
[Source -
http://www.newhorizons.org/strategies/assess/terminology.htm]
Authentic
assessment engages students in applying knowledge
and skills in the same way they are used in
the "real world" outside school.
It is performance-based assessment that requires
a student to go beyond basic recall and demonstrate
significant, worthwhile knowledge and understanding
through a product, perfor- mance, or exhibition.
The assessment comprises an authentic task
and a scoring rubric that are tied to an outcome
or "big idea" and are made clear
to the students up front.
[Source -
http://www.essentialschools.org/cs/resources/view/ces_res/124]
Benchmark
Student
performance standards (the level(s) of student
competence in a content area.)
An actual measurement of group performance against an established standard at defined points along the path toward the standard. Subsequent measurements of group performance use the benchmarks to measure progress toward achievement.
Examples
of student achievement that illustrate points
on a performance scale, used as exemplars.
(See Descriptor, Cohort.)
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
A
detailed description of a specific level of
student performance expected of students at
particular ages, grades, or development levels.
Benchmarks are often represented by samples
of student work. A set of benchmarks can be
used as "checkpoints" to monitor
progress toward meeting performance goals
within and across grade levels. In ABE, SPLs
(Student Performance Levels) are examples
of benchmarks; targets for instruction.
[Source - http://www.sabes.org/assessment/glossary.htm]
A
benchmark translates the standard into what
the student should know and be able to do
at developmentally appropriate levels. Benchmarks
are models that teachers, parents, and students
can refer to when designing, implementing,
and assessing student outcomes.
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
Cohort
A
group whose progress is followed by means
of measurements at different points in time.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Compentency
Test
A
test intended to establish that a student
has met established minimum standards of skills
and knowledge and is thus eligible for promotion,
graduation, certification, or other official
acknowledgment of achievement.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Concept
An
abstract, general notion -- a heading that
characterizes a set of behaviors and beliefs.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Constructed
Response Item
An exercise for
which examinees must create their own responses
or products (performance assessment) rather
than choose a response from an enumerated
set (multiple choice).
[Source - http://www.sabes.org/assessment/glossary.htm]
Conventional
or Traditional Assessment
Conventional assessment refers to paper-pencil
testing (multiple-choice, true/false, matching,
short answer) that typically must be completed
within a specific amount of time.
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
Countable
Outcomes (ACLS)
Results
that can be quantified; all measures of student
outcomes except learning gains, including
executive function skills, and affective-related
measures. Learning gains are gains in speaking,
listening, reading, writing, and numeracy.
Executive function skills include problem-solving,
critical thinking, and metacognition. Affective-related
measures include self-esteem, self confidence,
and interpersonal communication. Examples
of Countable Outcomes include: number of people
who get jobs, number of people who register
to vote, number of people who achieve a GED.
[Source - http://www.sabes.org/assessment/glossary.htm]
Criteria
Criteria
- sometimes called performance standards -
are the qualitative or quantitative statements
used to measure whether the program standard
(competency achievement) has been met. The
nature of the criteria may vaiy depending
on the specific assessment tool being used.
However, for the process skill (competency),
Manage work responsibilities, for example,
one criterion for measuring a student's ability
in that area would be, Gets work done on time.
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
Criterion-Referenced
Test
A test in which the results can be used
to determine a student's progress toward mastery
of a content area. Performance is compared
to an expected level of mastery in a content
area rather than to other students' scores.
Such tests usually include questions based
on what the student was taught and are designed
to measure the student's mastery of designated
objectives of an instructional program. The
"criterion" is the standard of performance
established as the passing score for the test.
Scores have meaning in terms of what the student
knows or can do, rather than how the test-taker
compares to a reference or norm group. Criterion-referenced
tests can have norms, but comparison to a
norm is not the purpose of the assessment.
Criterion
referenced tests have also been used to provide
information for program evaluation, especially
to track the success or progress of schools
and student populations that have been involved
in change or that are at risk of inequity.
In this case, the tests are not used to compare
teachers, teams or buildings within a district
but rather to give feedback on progress of
groups and individuals.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
An
assessment where an individual's performance
is compared to a specific learning objective
or performance standard and not to the performance
of other students. Criterion-referenced assessment
tells us how well students are performing
on specific goals or standards rather that
just telling how their performance compares
to a norm group of students nationally or
locally. In criterion-referenced assessments,
it is possible that none, or all, of the examinees
will reach a particular goal or performance
standard.
[Source - http://www.sabes.org/assessment/glossary.htm]
Curriculum
Alignment
The degree to which a curriculum's scope
and sequence matches a testing program's evaluation
measures, thus ensuring that teachers will
use successful completion of the test as a
goal of classroom instruction.
[Source -
http://www.newhorizons.org/strategies/assess/terminology.htm]
Curriculum-embedded
or Learning-embedded Assessment
Assessment that occurs simultaneously
with learning such as projects, portfolios
and "exhibitions." Occurs in the
classroom setting, and, if properly designed,
students should not be able to tell whether
they are being taught or assessed. Tasks or
tests are developed from the curriculum or
instructional materials.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Cut
Score
Score used to determine the minimum performance
level needed to pass a competency test. (See
Descriptor for another type of determiner.)
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Descriptor
A
set of signs used as a scale against which
a performance or product is placed in an evaluation.
An example from Grant Wiggins' Glossary of
Useful Terms Related to Authentic and Performance
Assessments is taken from "the CAP writing
test where a 5 out of a possible 6 is described:
'The student describes the problem adequately
and argues convincingly for at least one solution...without
the continual reader awareness of the writer
of a 6.'"
Descriptors
allow assessment to include clear guidelines
for what is and is not valued in student work.
Wiggins adds that "[t]he word 'descriptor'
reminds us that justifiable value judgments
are made by know how to empirically describe
the traits of work we do and do not value."
(Emphasis this.)
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Dimension
Aspects
or categories in which performance in a domain
or subject area will be judged. Separate descriptors
or scoring methods may apply to each dimension
of the student's performance assessment.
[Source
- http://www.newhorizons.org/strategies/assess/terminology.htm]
Documentation
Documentation
is a naturalistic assessment process, which
involves recording classroom observations
over time, across learning modalities, and
in coordination with colleagues.
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
Essay
Test
Test that requires students to answer questions
in writing. Responses can be brief or extensive.
Tests for recall, ability to apply knowledge
of a subject to questions about the subject,
rather than ability to choose the least incorrect
answer from a menu of options.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Evaluation
Both qualitative and quantitative descriptions
of pupil behavior plus value judgments concerning
the desirability of that behavior. Using collected
information (assessments) to make informed
decisions about continued instruction, programs,
activities. Exemplar Model of excellence.
(See Benchmark, Norm, Rubric, Standard.)
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Formative
Assessment
Observations which allow one to determine
the degree to which students know or are able
to do a given learning task, and which identifies
the part of the task that the student does
not know or is unable to do. Outcomes suggest
future steps for teaching and learning. (See
Summative Assessment.)
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Assessment
that provides feedback to the teacher for
the purpose of improving instruction.
[Source
- http://www.sabes.org/assessment/glossary.htm]
Grade
Equivalent
Both qualitative and quantitative descriptions
of pupil behavior plus value judgments concerning
the A score that describes student performance
in terms of the statistical performance of
an average student at a given grade level.
A grade equivalent score of 5.5, for example,
might indicate that the student's score is
what could be expected of a average student
doing average work in the fifth month of the
fifth grade. This score allows for a theoretical
or approximate comparison across grades. It
ranges from September of the kindergarten
year (K.O.) to June of the senior year in
high school (12.9) Useful as a ranking score,
grade equivalents are only a theoretical or
approximate comparison across grades. In this
case, it may not indicate what the student
would actually score on a test given to a
midyear fifth grade class.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Grade
Level Equivalent
The school grade level for a given population
for which a given score is the median score
in that population. For example, if a test
was administered during the month of October
to a norming group of sixth grade students
and the median scale score obtained was 475,
then the grade equivalent for a scale score
of 475 on that test would be set at 6.1 -
6 representing Grade 6 and .1 representing
the month of October (September is taken as
the beginning of the school year and equals
to 0).
[Source - http://www.sabes.org/assessment/glossary.htm]
Graphic
Organizer
Graphic organizers are mental maps that help
students make their thinking visible. They
represent the process skills of sequencing,
comparing, contrasting, classifying, inferring,
drawing conclusions, problem solving, and
thinking critically.
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
High
Stakes Testing
Any testing program whose results have important
consequences for students, teachers, schools,
and/or districts. Such stakes may include
promotion, certification, graduation, or denial/approval
of services and opportunity. High stakes testing
can corrupt the evaluation process when pressure
to produce rising test scores results in "teaching
to the test" or making tests less complex.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Test
used to provide results that have important,
direct consequences for examinees, programs,
or institutions involved in the testing. For
example, MCAS (K-12) is considered a high-stakes
test because children who do not pass the
examination do not receive a high school diploma,
regardless of their performance in other areas
of their school education.
[Source - http://www.sabes.org/assessment/glossary.htm]
Holistic
Method
In assessment, assigning a single score based
on an overall assessment of performance rather
than by scoring or analyzing dimensions individually.
The product is considered to be more than
the sum of its parts and so the quality of
a final product or performance is evaluated
rather than the process or dimension of performance.
A holistic scoring rubric might combine a
number of elements on a single scale. Focused
holistic scoring may be used to evaluate a
limited portion of a learner's performance.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Indicators
Indicators provide specific examples and explicit
definitions that can be used in rating students'
level of achievement relative to specified
skills, strategies, and knowledge.
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
IQ
Test
The first of the standardized norm-referenced
tests, developed during the nineteenth century.
Traditional psychologists believe that neurological
and genetic factors underlie "intelligence"
and that scoring the performance of certain
intellectual tasks can provide assessors with
a measurement of general intelligence. There
is a substantial body of research that suggests
that I.Q. tests measure only certain analytical
skills, missing many areas of human endeavor
considered to be intelligent behavior. I.
Q is considered by some to be fixed or static;
whereas an increasing number of researchers
are finding that intelligence is an ongoing
process that continues to change throughout
life.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Item
Analysis
Analyzing each item on a test to determine
the proportions of students selecting each
answer. Can be used to evaluate student strengths
and weaknesses; may point to problems with
the test's validity and to possible bias.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Journals
Students' personal records and reactions to
various aspects of learning and developing
ideas. A reflective process often found to
consolidate and enhance learning.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Mean
One of several ways of representing a group
with a single, typical score. It is figured
by adding up all the individual scores in
a group and dividing them by the number of
people in the group. Can be affected by extremely
low or high scores.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Measurement
Quantitative description of student learning
and qualitative description of student attitude.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Median
The point on a scale that divides a group
into two equal subgroups. Another way to represent
a group's scores with a single, typical score.
The median is not affected by low or high
scores as is the mean. (See Norm.)
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Metacognition
The knowledge of one's own thinking processes
and strategies, and the ability to consciously
reflect and act on the knowledge of cognition
to modify those processes and strategies.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Multidimensional
Assessment
Assessment that gathers information about
a broad spectrum of abilities and skills (as
in Howard Gardner's theory of Multiple Intelligences.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Multiple-Choice
Test
A test in which students are presented with
a question or an incomplete sentence or idea.
The students are expected to choose the correct
or best answer/completion from a menu of alternatives.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Naturalistic
Assessment
Naturalistic assessment refers to evaluation
rooted in the natural setting of the classroom.
It involves observation of student performances
and behavior in an informal context. Naturalistic
observation is done as students go about their
daily work and is sometimes called kidwatching.
[Source -
http://www.essentialschools.org/cs/resources/view/ces_res/124]
Norm
A distribution of scores obtained from a norm
group. The norm is the midpoint (or median)
of scores or performance of the students in
that group. Fifty percent will score above
and fifty percent below the norm.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Norm
Group
A random group of students selected by a test
developer to take a test to provide a range
of scores and establish the percentiles of
performance for use in establishing scoring
standards.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Norm-Referenced
Test
A test in which a student or a group's performance
is compared to that of a norm group. The student
or group scores will not fall evenly on either
side of the median established by the original
test takers. The results are relative to the
performance of an external group and are designed
to be compared with the norm group providing
a performance standard. Often used to measure
and compare students, schools, districts,
and states on the basis of norm-established
scales of achievement.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
An
objective test that is standardized on a group
of individuals whose performance is evaluated
in relation to the performance of others;
contrasted with criterion-referenced test.
Most standardized achievement tests are referred
to as norm-referenced.
[Source - http://www.sabes.org/assessment/glossary.htm]
Norm
Curve Equivalent
A score that ranges from 1-99, often used
by testers to manipulate data arithmetically.
Used to compare different tests for the same
student or group of students and between different
students on the same test. An NCE is a normalized
test score with a mean of 50 and a standard
deviation of 21.06. NCEs should be used instead
of percentiles for comparative purposes. Required
by many categorical funding agencies, e.g.,
Chapter I or Title I.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Objective
Test
A test for which the scoring procedure is
completely specified enabling agreement among
different scorers. A correct-answer test.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
On-Demand
Assessment
An assessment process that takes place as
a scheduled event outside the normal routine.
An attempt to summarize what students have
learned that is not embedded in classroom
activity.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Outcome
An operationally defined educational goal,
usually a culminating activity, product, or
performance that can be measured.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
The
word outcome is often used interchangeably
with goal, purpose, demonstration of learning,
culmination, and end. Exit outcomes may be
used synonymously with such terms as comperencies.
knowledge, and orientations. Outcomes are
the "end-products" of the entire
instructional process. Outcomes can include
internal changes in the learner or observable
changes. In the Work and Family Resource Guides,
the outcomes are expressed by the intent or
goals of the Work and Family Life Program
described in the introductions to each guide
and by the positive actions students are asked
to take regarding the practical problems that
frame the guides.
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
![]()
Percentile
A ranking scale ranging from a low of 1 to
a high of 99 with 50 as the median score.
A percentile rank indicates the percentage
of a reference or norm group obtaining scores
equal to or less than the test-taker's score.
A percentile score does not refer to the percentage
of questions answered correctly, it indicates
the test-taker's standing relative to the
norm group standard.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Performance-Based
Assessment
Direct, systematic observation and rating
of student performance of an educational objective,
often an ongoing observation over a period
of time, and typically involving the creation
of products. The assessment may be a continuing
interaction between teacher and student and
should ideally be part of the learning process.
The assessment should be a real-world performance
with relevance to the student and learning
community. Assessment of the performance is
done using a rubric, or analytic scoring guide
to aid in objectivity. Performance-based assessment
is a test of the ability to apply knowledge
in a real-life setting. Performance of exemplary
tasks in the demonstration of intellectual
ability.
Evaluation of the product of a learning experience
can also be used to evaluate the effectiveness
of teaching methods.
Stiggins
defines performance-based assessment as the
use of performance criteria to determine the
degree to which a student has met an achievement
target. Important elements of performance-based
assessment include clear goals or performance
criteria clearly articulated and communicated
to the learner; the establishment of a sound
sampling that clearly envisions the scope
of an achievement target and the type of learning
that is involved (use of problem-solving skills,
knowledge acquisition, etc.) Attention to
extraneous interference (cultural biases,
language barriers, testing environment, tester
biases) and establishment of a clear purpose
for the data collected during the assessment
before the assessment is undertaken, keeping
in mind the needs of the groups involved (teachers,
students, parents, etc.) (from an article
by Richard J. Stiggins, "The Key to Unlocking
High-Quality Performance Assessments."
Assessment: How Do We Know What They Know?
ASCD, 1992.
[Source -
http://www.newhorizons.org/strategies/assess/terminology.htm]
Performance
assessment is a broad term, encompassing many
of the characteristics of both authentic assessment
and alternative assessment. Generally, performance
assessments provide students with opportunities
to demonstrate their understanding and to
thoughtfully apply knowledge, skills, and
habits of mind in a variety of structured
and unstructured situations. These assessments
often occur over time and result in a tangible
product or observable performance. Terms of
Assessment that may or may not be enduring,
or endearing
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
Performance
Criteria
A ranking scale ranging from a low of 1 to
a high of 99 with 50 as the median score.
A percentile rank iThe standards by which
student performance is evaluated. Performance
criteria help assessors maintain objectivity
and provide students with important information
about expectations, giving them a target or
goal to strive for.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
1.
A statement or description of a set of operational
tasks exemplifying a level of performance
associated with a more general content standard;
the statement may be used to guide judgements
about the location of a cut score on a score
scale; the term often implies a desired level
of performance. 2. Explicit definitions of
what students must do to demonstrate proficiency
at a specific level on the content standards;
for example, in Massachusetts' Curriculum
Frameworks in the area of 'reading', there
are six levels for each of four standards.
Under the standard "comprehension",
performance can range from "develop vocabulary"
to "interpret charts & graphs"
to "recognize a variety of genres &
styles."
[Source - http://www.sabes.org/assessment/glossary.htm]
Portfolio
A systematic and organized collection of a
student's work that exhibits to others the
direct evidence of a student's efforts, achievements,
and progress over a period of time. The collection
should involve the student in selection of
its contents, and should include information
about the performance criteria, the rubric
or criteria for judging merit, and evidence
of student self-relection or evaluation. It
should include representative work, providing
a documentation of the learner's performance
and a basis for evaluation of the student's
progress. Portfolios may include a variety
of demonstrations of learning and have been
gathered in the form of a physical collection
of materials, videos, CD-ROMs, reflective
journals, etc.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
portfolio
is a purposeful collection of student work
that exhibits the student's efforts, progress,
and achievement in one or more areas.
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
Portfolio
Assessment
Portfolios
may be assessed in a variety of ways. Each
piece may be individually scored, or the portfolio
might be assessed merely for the presence
of required pieces, or a holistic scoring
process might be used and an evaluation made
on the basis of an overall impression of the
student's collected work. It is common that
assessors work together to establish consensus
of standards or to ensure greater reliability
in evaluation of student work. Established
criteria are often used by reviewers and students
involved in the process of evaluating progress
and achievement of objectives.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Primary
Trait Method
A
type of rubric scoring constructed to assess
a specific trait, skill, behavior, or format,
or the evaluation of the primary impact of
a learning process on a designated audience.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Process
A
generalizable method of doing something, generally
involving steps or operations which are usually
ordered and/or interdependent. Process can
be evaluated as part of an assessment, as
in the example of evaluating a student's performance
during prewriting exercises leading up to
the final production of an essay or paper.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Process
Assessment
Process
assessment refers to assessing a student's
skills in progressing through a series of
actions or operations. Process skills that
teachers seek to assess relate to thinking
abilities, applications of procedural knowledge,
and interactions with others. Some examples
of process skills are critical thinking, creative
thinking, problem solving, decision making,
goal setting, cooperation, relating to others,
leadership, and management.
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
Product
The
tangible and stable result of a performance
or task. An assessment is made of student
performance based on evaluation of the product
of a demonstration of learning.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Product/Project
Assessment
Products
and projects are typically assigned to individuals
or groups of students on a topic related to
the curriculum. The project results in a product,
which is assessed. The processes used during
the assessment could also be assessed.
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
Profile
A
graphic compilation of the performance of
an individual on a series of assessments.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Project
A
complex assignment involving more than one
type of activity and production. Projects
can take a variety of forms, some examples
are a mural construction, a shared service
project, or other collaborative or individual
effort.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Quartile
The
breakdown of an aggregate of percentile rankings
into four categories: the 0-25th percentile,
26-50th percentile, etc.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Quintile
The
breakdown of an aggregate of percentile rankings
into five categories: the 0-20th percentile,
21-40th percentile, etc.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Rating
Scale
A
scale based on descriptive words or phrases
that indicate performance levels. Qualities
of a performance are described (e.g., advanced,
intermediate, novice) in order to designate
a level of achievement. The scale may be used
with rubrics or descriptions of each level
of performance.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Reliability
The
measure of consistency for an assessment instrument.
The instrument should yield similar results
over time with similar populations in similar
circumstances.
[Source
- http://www.newhorizons.org/strategies/assess/terminology.htm]
How
accurately a score will be reproduced if an
individual is measured again. The degree to
which the results of an assessment are dependable
and consistently measure particular student
knowledge and/or skills. Reliability is an
indication of the consistency of scores across
raters, over time, or across different tasks
or items that measure the same thing. Thus,
reliability may be expressed as (a) the relationship
between test items intended to measure the
same skill or knowledge (item reliability),
(b) the relationship between two administrations
of the same test to the same student or students
(test/retest reliability), or (c) the degree
of agreement between two or more raters (rater
reliability). An unreliable assessment cannot
be valid.
[Source - http://www.sabes.org/assessment/glossary.htm]
Rubric
Some of the definitions of rubric are contradictory.
In general a rubric is a scoring guide used
in subjective assessments. A rubric implies
that a rule defining the criteria of an assessment
system is followed in evaluation. A rubric
can be an explicit description of performance
characteristics corresponding to a point on
a rating scale. A scoring rubric makes explicit
expected qualities of performance on a rating
scale or the definition of a single scoring
point on a scale.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Specific
sets of criteria that clearly define for both
student and teacher what a range of acceptable
and unacceptable performance looks like. Criteria
define descriptors of ability at each level
of performance and assign values to each level.
Levels referred to are proficiency levels
which describe a continuum from excellent
to unacceptable product.
[Source - http://www.sabes.org/assessment/glossary.htm]
A
scoring rubric consists of fixed scales related
to a list of criteria describing performance.
Each scale is composed of anchors that describe
the various levels of performance com- plexity.
Assigned weights, which give the relative
value of each criterion, are used in the process
of sumniating scores to ascertain whether
the standard has been met. Rubrics promote
learning by offering clear performance targets
to students for agreed-upon standards. Rubrics
are presented to students along with the performance
task.
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
Sampling
A
way to obtain information about a large group
by examining a smaller, randomly chosen selection
(the sample) of group members. If the sampling
is conducted correctly, the results will be
representative of the group as a whole. Sampling
may also refer to the choice of smaller tasks
or processes that will be valid for making
inferences about the student's performance
in a larger domain. "Matrix sampling"
asks different groups to take small segments
of a test; the results will reflect the ability
of the larger group on a complete range of
tasks.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Scale
A
classification tool or counting system designed
to indicate and measure the degree to which
an event or behavior has occurred.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Scale
Score
Scores
based on a scale ranging from 001 to 999.
Scale scores are useful in comparing performance
in one subject area across classes, schools,
districts, and other large populations, especially
in monitoring change over time.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Score
A
rating of performance based on a scale or
classification.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Scoring
A
package of guidelines intended for people
scoring performance assessments. May include
instructions for raters, notes on training
raters, rating scales, samples of student
work exemplifying various levels of performance.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Scoring
Criteria
Rules
for assigning a score or the dimensions of
proficiency in performance used to describe
a student's response to a task. May include
rating scales, checklists, answer keys, and
other scoring tools. In a subjective assessment
situation, a rubric.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Self-Assessment
A
process in which a student engages in a systematic
review of a performance, usually for the purpose
of improving future performance. May involve
comparison with a standard, established criteria.
May involve critiquing one's own work or may
be a simple description of the performance.
Reflection, self-evaluation, metacognition,
are related terms.
[Source -
http://www.newhorizons.org/strategies/assess/terminology.htm]
Senior
Project
Extensive
projects planned and carried out during the
senior year of high school as the culmination
of the secondary school experience, senior
projects require higher-level thinking skills,
problem-solving, and creative thinking. They
are often interdisciplinary, and may require
extensive research. Projects culminate in
a presentation of the project to a panel of
people, usually faculty and community mentors,
sometimes students, who evaluate the student's
work at the end of the year.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Self-Assessment
A
process in which a student engages in a systematic
review of a performance, usually for the purpose
of improving future performance. May involve
comparison with a standard, established criteria.
May involve critiquing one's own work or may
be a simple description of the performance.
Reflection, self-evaluation, metacognition,
are related terms.
Content
Standard
Content
standards - also known as discipline standards
- comprise the knowledge and skills specific
to a given discipline. They describe information
and skills essential to the practice or application
of a particular discipline or content domain.
Curriculum
or Program Standard
Curriculum standards - sometimes referred
to as program standards - are best described
as the goals of classroom instruction. They
imply the curricular or instructional activities
that might be used to help students develop
skill and ability within a given content domain.
To a great extent, curriculum standards describe
the instructional means to achieve content
standards.
Performance
Standard
A lifelong learning standard is not specific
to any one discipline and can be used in many
situations throughout a person's lifetime.
This type of standard is not even specific
to academics; it is a skill that can be used
in virtually all aspects of life. For example,
one lifelong learning standard for students
might be, Make and carry out effective plans.
Lifelong
Learning Standard
A
lifelong learning standard is not specific
to any one discipline and can be used in many
situations throughout a person's lifetime.
This type of standard is not even specific
to academics; it is a skill that can be used
in virtually all aspects of life. For example,
one lifelong learning standard for students
might be, Make and carry out effective plans.
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
Standardized
Test
An
objective test that is given and scored in
a uniform manner. Standardized tests are carefully
constructed and items are selected after trials
for appropriateness and difficulty. Tests
are issued with a manual giving complete guidelines
for administration and scoring. The guidelines
attempt to eliminate extraneous interference
that might influence test results. Scores
are often are often norm-referenced.
[Source -
http://www.newhorizons.org/strategies/assess/terminology.htm]
A
test designed to be given under specified,
standard conditions to obtain a sample of
learner behavior that can be used to make
inferences about the learner's ability. Standardized
testing allows results to be compared statistically
to a standard such as a norm or criteria.
If the test is not administered according
to the standard conditions, the results are
invalid.
[Source - http://www.sabes.org/assessment/glossary.htm]
Standards
Agreed-upon
values used to measure the quality of student
performance, instructional methods, curriculum,
etc.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
Student
Performance Level (SPL)
A
standard description of a student's (ESOL)
language ability at a given level in terms
of speaking, listening, reading, writing,
and the ability to communicate with a native
speaker; a profile of skill levels for a student
can thus be assigned and used for placement,
instructional, or reporting purposes.
[Source
- http://www.sabes.org/assessment/glossary.htm]
Subjective
Test
A
test in which the impression or opinion of
the assessor determines the score or evaluation
of performance. A test in which the answers
cannot be known or prescribed in advance.
[Source
- http://www.newhorizons.org/strategies/assess/terminology.htm]
Summative
Assessment
Evaluation
at the conclusion of a unit or units of instruction
or an activity or plan to determine or judge
student skills and knowledge or effectiveness
of a plan or activity. Outcomes are the culmination
of a teaching/learning process for a unit,
subject, or year's study. (See Formative Assessment.).
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
A
culminating assessment, which gives information
on students' mastery of content, knowledge,
or skills.
[Source - http://www.sabes.org/assessment/glossary.htm]
Task
A
task is a complex activity requiring multiple
responses to a challenging question or problem.
[Source - http://www.essentialschools.org/cs/resources/view/ces_res/124]
Validity
The
test measures the desired performance and
appropriate inferences can be drawn from the
results. The assessment accurately reflects
the learning it was designed to measure.
[Source - http://www.newhorizons.org/strategies/assess/terminology.htm]
The
extent to which an assessment measures what
it is supposed to measure and the extent to
which inferences and actions made on the basis
of test scores are appropriate and accurate.
For example, if a student performs well on
a reading test, how confident are we that
that student is a good reader?A
valid standards-based assessment is aligned
with the standards intended to be measured,
provides an accurate and reliable estimate
of students' performance relative to the standard,
and is fair. An assessment cannot be valid
if it is not reliable.
[Source - http://www.sabes.org/assessment/glossary.htm]


