Reliability of Test
Reliability is defined by Joppe (2002,p.1) as the level of consistency of the obtained results over a period of time as well as an accurate representation of the population under study. If the outcome of the study can be reproduced using a similar methodology then the instrument used in the research are said to be reliable.
It is worth noticing that there is an element of replicability as well as repeatability ff the observations or results. The work of Kirk and Miller (1986,41-42) indicated that there exists three different types of validity in any given quantitative research. These however, all relate to; the extent to which the give measure if repeated, remains constant, the stability of the given measure over a period of time as well as the similarity of the given measurements in a given time period. The work of Charles (1995) focuses on the idea of consistency with which a given test item is answered. The test-retest method is a type of reliability test. The attribute of a given instrument that is tested for reliability is called stability. A stable measure would produce similar results. A high level of stability is indicative of a very high level of instrument reliability. This indicates that the results are measurable. There is a problem with the test-retest method as pointed out by Joppe (2000).The problem would ultimately make the test to become unreliable to a certain degree. Joppe (2000) explained that the test-retest technique may lead to the sensitization of the respondents on the specific subject matter and thereby influencing their responses. Reliability therefore refers to the level of consistency of a given measure. In psychology for example, if a test is aimed at testing a trait such as introversion, then every time it is administered to a given subject then the results obtained should be approximately similar. The downside is that it is never easy to calculate reliability precisely. Ways of approximating it however, are in existence (Cherry, n.d)
Types of reliability
There are several types of reliability (PTI,2006, Cherry, n.d).They are as follows;
Test-Retest reliability
In this type of a reliability test, the r est is administered twice at two distinct points in time (Cherry, n.d).This reliability test assumes that there is never going to be a change in the construct or quality being measured. It is generally employed for things that are usually stable over a period of time like intelligence.
Inter-rater Reliability
This form of reliability is undertaken by having two judges who are both independent, score the test. The scores that are obtained are then compared critically in order to determine the level of consistency of the rates' estimates. A technique of testing the inter-rater reliability is to score items based on a 1-10 scale. The next process is the calculation of the correlation that exists between the two scores so as to determine the degree of inter-rater reliability.
Parallel-Forms Reliability
This form of reliability is determined by comparing the various (different tests) that were originally created using similar content. This is achieved by creating a large set of test items that are aimed at measuring a similar quality and then dividing (randomly) these items into two tests that are separate.
Internal Consistency Reliability
This type of reliability is employed in the judgment of the consistency of results that are obtained across items that are based on the same test. It basically involves the comparison of the test items that are integral in the measurement of the same construct so as to determine the internal consistency of the tests.
II. Validity (Test Validity)
The work of Joppe (2000) provided an explanation of validity as a determination of whether the given research really measures whatever it is intended to measure as well as the level of truthfulness of the results. Wainer and Braun (1998) on the other hand referred to validity as "construct validity."
Validity in the context of psychology has been extensively been discussed by Tebes (2000).The work of Cook and Campbell (1979) identified four major types of validity; internal validity, statistical conclusion validity, external validity and construct validity. The internal validity makes inference regarding causal relationships in cases involving two articles. The statistical conclusion validity makes inferences regarding the covariations that exist between two variables. External validity involves the generalization to settings, times and other persons. Construct validity involves generalization on theoretical relationship between cause and effect.
Forms of validity
Face validity
Cherry (n, d) defined...
Our semester plans gives you unlimited, unrestricted access to our entire library of resources —writing tools, guides, example essays, tutorials, class notes, and more.
Get Started Now