Research Paper Doctorate 3,787 words

Tests and measurements in educational assessment

Last reviewed: July 11, 2004 ~19 min read

Tests and Measurements

The activities of the classrooms are revealed to the learners, instructors and the institutions through administering normalized tests. They encompass gathered knowledge in reading, mathematics, and English for learners starting from kindergarten up to the high school level. Several instructors detest from organizing and evaluating tests and a majority of the students are scared of appearing in them. But tests are potent didactic instruments which provide at the minimum four tasks. First of all, tests assist in judging pupils and evaluating if they are receiving their education as per our reckoning. Secondly, properly planned tests provide inspiration and assist pupils to systematize their educational pursuits. (Davis, 1993, p.19)

Learners undertake their lessons in manners that show how they imagine their tests will be like. When they anticipate that a test is going to ascertain their knowledge about facts, they will get by heart the particulars; when they are hopeful that a test will need problem solving or assimilating knowledge, the students will stress upon comprehending and using information. Thirdly, tests can assist you in appraising how effectively you are portraying the content. Lastly, tests can strengthen learning by giving pupils with a guide of the subjects or aptitudes they have not so far attained expertise and must pay attention to. In spite of all these advantages, examination is also psychologically stimulating and concerning. Each of the tests endeavors to present correct grading of students and useful data on performances of the students of common educational goals. (Davis, 1993, p.20)

Body:

Among the regular tests that were held during the initial years are the TerraNova CAT, the TerraNova CTBS, the SAT-9, the MAT-8, and the ITBS and these are prevailing ones even today. These tests like the CAT or the Iowa test of Basic Skills (ITBS) are everlasting, created by the business houses are firmly based on psychometric theory. (Burley, 2002, p.44) On the validity aspect of psychometrics, concept maps will possess extremely high construct validity, meaning they can actually gauge an individual's intensity of knowledge in every sphere. A higher level of inter-relational validity are present in them while someone is able to get a different yardstick for correct comprehension, like appreciation of the person's proficiency by several specialists. A majority of the evaluation ranks higher on consistency and efficiency on getting marks, however they accord lower importance on construct validity.

Regular performance tests are normalized with the logic that they are required to be executed, allotted marks, and construed in a usual, prearranged method. In order that the normalized performance examinations present minute judgments, they should be capable to generate enough assortments of scores that a test usually gives. Those test matters that offer the most excellent to the making of the score-stretch are the ones solved rightly by nearly fifty percent of the candidates appearing of the test. To prevent mishandling of the normalized examinations, school districts ought to: Administer tests first of all to analyze learning requirements of every student and reform the programs; Hold examinations on a yearly basis during the autumn semester hence results assist in directing learning plan for pupils; Assign instructors and administrators to assessing the capability and specialized values set up by the professional bodies; Shun the Lake Wobegon Effect, wherein every state asserts that its pupils are more compared to the national average. (Popham, 2002, p. 23)

The Guidance Framework, relevant to the 50's and 60's, stressed the recognition and cultivating the aptitude of the student by way of analysis and segregation. The tests endeavored to gauge the performance and competence. The learning theory inbuilt in this milieu was that the capability of the students establishes the amount and extent of the learning content of students. Schools are held to be slightly influencing the outcome of the students. (Grant, 2001, p.401) Common sense coupled with an idea of two distinctive features of every quality tests, gives the outline that instructors employ to arrive at the most excellent selection while making student evaluation.

An evaluation system, on the basis of the commonsense principles regarding how pupils progress and are induced to better their performances can be taken up by the instructors. A better receptive arrangement built on cooperative response to enhance learning; a system which renders regional work and instructor decision more vital to affirm responsibility; a system put in place to give enticement for school restoration and innate professional development every year for the entire teachers; and an arrangement that will enthuse greater inventive teaching and not greater apprehensive submissive conduct can be grown. (Marsh; Roche, 1997, p.1194)

More than a third of the instructor's professional time period is devoted to the classroom evaluation which is a vital component of education. Teaching and learning are only enhanced through classroom evaluation and beyond this it serves no other objective; the teacher works out the approach and gains from the information she collects. The purpose of classroom evaluation is aimed at collecting information from the pupils regarding their knowledge and what is their feeling about the process of learning. Of the two vital causes for which pupils are judged are to apprise and enhance the outcomes of every student and to inspect or estimate performance of system. The review in the classroom is implicated with the appraisal and enhancement of every student performance and is extremely contextualized to the experiences of the students during learning in particular classrooms. (Shea, 1990, p.36)

The outcome of this type of evaluation normally is not typecast outside the framework of the classroom in issue. But teachers and students get instant and exhaustive response regarding the efficacy of the students in comprehending the theme of the subject. As per Angelo and Cross, evaluation in the classroom functions at the optimum level when the pupils are keenly engrossed in the system of making response policies and when teachers discuss the outcomes with them and answer to their apprehension. In this manner, classroom review can give opinion in an on-going and iterative manner in the gamut of the learning atmosphere. Several diverse classroom evaluation techniques are existent for use of the instructors, every one offering distinct learning chance and response for students. (McGilly, 1994, p.72)

While the instructors choose within these alternatives, they judge which appraisal is ideally suitable for their function and the impact that evaluation might put on student learning. For instance, a science laboratory can be utilized to give the students practical learning skill, besides evaluating the knowledge of the pupils regarding the scientific guidelines associated with the tests. Likewise, a writing effort may be utilized to assess the technical competence of a student in writing an essay or thesis paper, and also her precise awareness of an episode, individual, or matter. (Jacobs; Chase, 1992, p.80)

Evaluation procedures in the classroom can be exercised in each class, every week, or during specific points all through the semester. Whereas terminal assessments bring out student responses which might impact the preference of an instructor in the subsequent occasion when she administers a course, classroom evaluation gives the teacher with the responses she can take action at once. Majority of the classroom evaluation entails tests, which have been made by the teachers. These tests designed by the teachers can even be separated by when they are administered and the manner of using the results. Examination involving writing, and further evaluations which the teachers give continually in their classrooms are the evaluative tests that are preeminently appropriate in directing enhancements in student learning. (McGilly, 1994, p.76)

The instructors rely on the outcome from these evaluations due to their direct relationship to instructional objectives imparted in the classroom. Add to this, the outcomes are instantaneous and simple to examine at the own student level. Instructors grow to be further systematic regarding their conjecture and purpose in respect to student awareness or comprehension. They confirm frequently the extent of the working of their plans for learning of the students. The advantages of classroom evaluation are that instructors turn into extra perceptive spectators of pupils and grow to be more enlightened regarding student learning; students happen to be more involved in the learning procedure and more insightful regarding themselves as learners-they visualize themselves as furnishing to the learning atmosphere; classroom evaluation enhances collaboration among teachers and students and speaks to pupils that teachers are dedicated to their scholastic accomplishment. (Sanders, 2001, p.62)

In order to employ classroom evaluation to create progress, nevertheless, teachers ought to transform their outlook regarding evaluation and their understanding of the outcomes. Particularly, they are required to notice their evaluation as an essential component of the teaching procedure and as vital for assisting students' study. (Shea, 1990, p.41) Tests administered at the end of a chapter or semester or following the commencement of the learning are known as summative tests. The rationale is to evaluate learning and performance and normally influences the rating assigned to the student. The tests that are administered concurrently with the learning are known as formative tests. The objective is to give opinion; hence pupils will be able to regulate the manner of their learning or how it will enable the instructors to adjust with the teaching. Normally these tests do not impact the ratings of the students.

It is expected that 54 tests designed by the teachers is administered annually in a normal year that produces possibly billions of exclusive evaluations, on a yearly basis globally. Irrespective of the correct incidence, the tests made by the teachers themselves are employed often. Besides, instructors attach more importance on the tests made by them in finding out ratings and improvement of students than devised by others, or on separate data sources. Nearly every teacher thinks that they require sound competence in calibration. Whereas several state that they are positive of their capability to deliver convincing and trustworthy tests, others give an account of the intensity of uneasiness with the value of their own tests or consider that their instruction was not sufficient. (Marsh; Roche, 1997, p.1189) superior classroom test is convincing and trustworthy. Legality is the worth of a test that determines what it is supposed to determine. It is the extent to which proof, common sense, or hypothesis favors any explanation or inferences regarding a pupil on the basis of his/her performance in the tests. Clearly stated, it is the manner how someone understands that the student's competence of mathematics is judged through a test in mathematics and never their reading capability. Another feature of the legality of the test, of specific significance for classroom teachers is legality as regards the subject matter. Are the objects of a test largely stands for the items, which could be on the test? Realistic sources for objects which ought to be on the test are objectives given in the class, important conception studied in the class lectures, central ideas, and the like. The classroom instructors those who desire to ensure that they have a suitable test from the perspective of content often make a chart of stipulation which particularly lists what was imparted and the number of items in the test that will encompass these subjects. The table could yet be shared with the pupils to lead them in reading for a test as a summarization of what was most vital in a chapter or subject. (Murray, 1990, p.62)

Dependability is the attribute of a test that generates scores which are not influenced to a greater extent by probability. Pupils at times arbitrarily overlook a question the answer to which they in fact knew, or in several occasion arrive at the correct reply by sheer guesswork; instructors could at times commit mistakes or score erratically in case of tests of subjective nature scores. These are the difficulties of less dependability. The predicament of less dependability can be resolved in certain easy method. First of all, a test containing a lot of items shall generally be more dependable compared to a short duration test, since any arbitrary fluctuations in performance happening during the period of the test will nullify each other throughout a lot of items. In the same vein, assigning a class grade will be much dependable in case it reproduces a lot of diverse tasks or parts. Secondly, greater the objectivity, tests will have lesser random errors while scoring, hence teachers apprehensive on the issue of dependability often support objectively scored tests. Also while using the subjective layout, like deliverables; teachers frequently employ an elaborate heading to show the scoring as objective, and, hence as dependable as probable. (Jacobs; Chase, 1992, p.86)

Classroom tests can also be classified depending on what they propose to assess. Conventional written tests in the classroom tests are optimally employed to assess knowledge. They are characteristically objectively attained. Achievement-based test, also known as reliable or substitute tests are ideally employed to evaluate the proficiency or capability of the students. These are usually subjectively scored. Tests devised to assess knowledge are generally prepared as a collection of independent questions. There are two types of questions: choice items that let the students to choose a right answer from a listing of correct answers. Scoring multiple choice items is normally fast and objective. Scoring supply matters are liable to consume more time and is generally more subjective. At times, instructors choose to employ multiple choice items while they are concerned in assessing elementary, minor stages of comprehension and employ supply items when they are concerned with the enhanced stages of comprehension. (Marsh; Roche, 1997, p.1190)

Tests made by the instructor could even be segregated by the time when they are given and how the outcomes are utilized. The tests which are given after the completion of a chapter or semester or post-learning has started are known as summative tests. The objective of these tests is to evaluate learning and performance and generally impacts the class ratings of a student. Tests can also be administered simultaneously with the learning which is known as formative tests. The objectives of these tests are to give opinions, hence pupils can regulate the pace of their learning and instructors can in turn regulate their teaching procedure. Normally these tests do not influence ratings of the students. (Sanders, 2001, p.62)

The recognized evaluating training teachers often get attention on extensive test administration and normalized test score understanding, somewhat than on the test creation policies or item-writing regulations that the teachers requires. A better teacher prepared test must adhere to suitable item-writing regulations, but observed learning setting up the soundness of item-writing regulation are few in number and often questionable, and, item-writing regulations are based mainly on common sense and the traditional knowledge of test specialists. Although psychometric and research have been in existence since the last 50 years, Cronbach lamented in 1985 that total deficiency of academic interest given to achievement test items. (Shea, 1990, p.54)

It was Haladyna and Downing, who twenty years after Cronbach's caveat reaffirmed this statement that the idea regarding choice selection item writing, for instance, was even reasonably restricted and later on after a decade, went on to add that item writing is still mainly an ingenious act. The present experimental study for item-writing principles concentrates on studies that look at the correlation between a specified item design and either test performance or psychometric principles of the test linked to the choice of the format. Certain procedures are provided by experimental or semi-experimental designs, but the basis of ideal practices in these sphere remnants, basically, only suggestions of the specialists. (Shea, 1990, p.57)

In spite of the significance of the evaluation education at the present moment, a handful of teachers get sufficient formal training in evaluation plan or scrutiny. A current review illustrated. For instance, that less than 50% states need proficiency in evaluation for holding a teacher's license. In the absence of definite training, instructors depend a great deal on the evaluation presented by the publisher of their textbooks or study materials. In case no appropriate evaluations are obtainable, teachers create their personal makeshift evaluations in a disorganized manner with questions and essay add-ons identical to those prepared by their instructors. They consider assessments as instruments of evaluation to oversee while instructional procedures are ended and to utilize mainly for giving ratings to the students. It is a fact that majority of the state certification systems and 50% of every teacher education possess no evaluation course necessity or just an express statement that teachers have been imparted training in assessment (Sanders, 2001, p.63)

Apart from that, instructors have in the past got a meager or did not receive any training or assistance after certification. To utilize evaluations to enhance education and learning of the students, teachers are required to modify their methodologies to evaluation. Evaluations going on in the classrooms which provide significant basis of information do not astonish students. But, these evaluations reveal the ideas and competencies which the teachers underlined in the class, coupled with the instructor's obvious norms for measuring the performance of the students. These ideas, competencies and norms go in tune along with the teacher's instructional actions and, perfectly, with state or district norms. Students visualize these evaluations as reasonable procedures of vital learning objectives. Teachers assist learning by giving students with significant advice on their learning growth and through way of assisting; they discover problems in learning (Sanders, 2001, p.63)

The finest classroom evaluation also provide significant information sources for teachers, assisting them find out the areas they taught in a better manner and the areas they have to concentrate more. Collecting this data does not need an important statistical analysis of evaluation results. Instructors are required to prepare a plain computation of the number of students not attending each of the assessment items or were unsuccessful to satisfy a particular norm. While reviewing these results, the instructor ought to primarily think about the quality of the item or norm. In case teachers locate no real difficulties with the item or norms, in that case they ought to shift their mind to their teaching. Examining evaluation outcomes in this manner implies keeping away a few prevailing ego concerns. In fact, teachers and students are jointly accountable in the process of learning. In spite with the most intrepid teaching exercises, we cannot assure every student will learn all the things superbly. (Murray, 1990, p.12)

You’re 84% through this paper. Sign up to read the full paper.

Sign Up Now — Instant Access Already a member? Log in
130,000+ paper examples AI writing assistant Citation generator Cancel anytime
Cite This Paper
PaperDue. (2004). Tests and measurements in educational assessment. PaperDue. https://www.paperdue.com/essay/tests-and-measurements-174907

Always verify citation format against your institution’s current style guide requirements.