SlideShare uma empresa Scribd logo
1 de 21
VALIDITY, RELIABILITY & PRACTICALITY Prof. Jonathan Magdalena
QUALITIES OF MEASUREMENT DEVICES Validity Does it measure what it is supposed to measure? Reliability How representative is the measurement? Objectivity Do independent scorers agree? Practicality Is it easy to construct, administer, score and interpret?
VALIDITY Validity refers to whether or not a test measures what it intends to measure.  A test with high validity has items closely linked to the test’s intended focus. A test with poor validity does not measure the content and competencies it ought to.
VALIDITY - Kinds of Validity “Content”: related to  objectives and their sampling. “Construct”: referring to the theory underlying the target. “Criterion”: related to concrete criteria in the real world. It can be concurrent or predictive. “Concurrent”: correlating high with another measure  already validated.  “Predictive”: Capable of anticipating some later measure.  “Face”: related to the test overall appearance.
1. CONTENT VALIDITY Content validity refers to the connections between the test items and the subject-related tasks. The test should evaluate only the content related to the field of study in a manner sufficiently representative, relevant,  and comprehensible.
2. CONSTRUCT VALIDITY It implies using the construct (concepts, ideas, notions) in accordance to the state of the art in the field. Construct validity seeks agreement between updated subject-matter theories and the specific measuring components of the test.   For example, a test of intelligence nowadays must include measures of multiple intelligences, rather than just logical-mathematical and linguistic ability measures.
3. CRITERION-RELATED VALIDITY Also referred to as instrumental validity, it is used to demonstrate the accuracy of a measure or procedure by comparing it with another process or method which has been demonstrated to be valid.  For example, imagine a hands-on driving test has been  proved to be an accurate test of driving skills.  A written test can be validated by using a criterion related strategy in which the hands-on driving test is compared to it.
4. CONCURRENT VALIDITY Concurrent validity uses statistical methods of correlation to  other measures.  Examinees who are known to be either masters or non-masters on the content measured are identified before the test is administered. Once the tests have been scored, the relationship between the examinees’ status as either masters or non-masters and their performance (i.e., pass or fail) is estimated based on the test.
5. PREDICTIVE VALIDITY Predictive validity estimates the relationship of test scores to an examinee's future performance as a master or non-master. Predictive validity considers the question, "How well does the test predict examinees' future status as masters or non-masters?"  For this type of validity, the correlation that is computed is based on the test results and the examinee’s  later performance. This type of validity is especially useful for test purposes such as selection or admissions.
6. FACE VALIDITY Face validity is determined by a review of the items and not through the use of statistical analyses. Unlike content validity, face validity is not investigated through formal procedures. Instead, anyone who looks over the test, including examinees, may develop an informal opinion as to whether or not the test is measuring what it is supposed to measure.
QUALITIES OF MEASUREMENT DEVICES Validity Does it measure what it is supposed to measure? Reliability How representative is the measurement? Objectivity Do independent scorers agree? Practicality Is it easy to construct, administer, score and interpret?
RELIABILITY Reliability is the extent to which an experiment, test, or any measuring procedure shows the same result on repeated trials.  For researchers, four key types of reliability are:
RELIABILITY “Equivalency”: related to the co-occurrence of two items. “Stability”: related to time consistency. “Internal”: related to the instruments. “Interrater”: related to the examiners’ criterion.
1. EQUIVALENCY RELIABILITY  Equivalency reliability is the extent to which two items measure identical concepts at an identical level of difficulty. Equivalency reliability is determined by relating two sets of test scores to one another to highlight the degree of relationship or association.
2. STABILITY RELIABILITY  Stability reliability (sometimes called test, re-test reliability) is the agreement of measuring instruments over time. To determine stability, a measure or test is repeated on the same subjects at a future date. Results are compared and correlated with the initial test to give a measure of stability. Instruments with a high stability reliability are thermometers, compasses, measuring cups, etc.
3. INTERNAL CONSISTENCY  Internal consistency is the extent to which tests or procedures assess the same characteristic, skill or quality. It is a measure of the precision between the measuring instruments used in a study. This type of reliability often helps researchers interpret data and predict the value of scores and the limits of the relationship among variables.
4. INTERRATER RELIABILITY  Interraterreliability is the extent to which two or more individuals (coders or raters) agree.  For example, when two or more teachers use a rating scale with which they are rating the students’ oral responses in an  interview (1 being most negative, 5 being most positive). If one researcher gives a "1" to a student response, while another researcher gives a "5," obviously the interrater reliability would be inconsistent.
SOURCES OF ERROR Examinee (is a human being) Examiner (is a human being) Examination (is designed by and for human beings)
RELATIONSHIP BETWEEN VALIDITY & RELIABILITY Validity and reliability are closely related. A test cannot be considered valid unless the measurements resulting from it are reliable. Likewise, results from a test can be reliable and not necessarily valid.
BACKWASH EFFECT Backwash (also known as washback) effect is related to the potentially positive and negative effects of test design and content on the form and content of English language training courseware.
THANKS

Mais conteúdo relacionado

Mais procurados (20)

Validity and Reliability
Validity and Reliability Validity and Reliability
Validity and Reliability
 
01 validity and its type
01 validity and its type01 validity and its type
01 validity and its type
 
Validity and Reliability
Validity and ReliabilityValidity and Reliability
Validity and Reliability
 
Types of test
Types of testTypes of test
Types of test
 
How to improve test reliability
How to improve test reliabilityHow to improve test reliability
How to improve test reliability
 
Content validity
Content validityContent validity
Content validity
 
Validity & reliability
Validity & reliabilityValidity & reliability
Validity & reliability
 
Construct Validity
Construct ValidityConstruct Validity
Construct Validity
 
Practicality of a test
Practicality of a testPracticality of a test
Practicality of a test
 
Characteristics of a good test
Characteristics of a good testCharacteristics of a good test
Characteristics of a good test
 
Test validity
Test validityTest validity
Test validity
 
Reliability types
Reliability typesReliability types
Reliability types
 
Validity, reliability and feasibility
Validity, reliability and feasibilityValidity, reliability and feasibility
Validity, reliability and feasibility
 
Validity of test
Validity of testValidity of test
Validity of test
 
Presentation validity
Presentation validityPresentation validity
Presentation validity
 
Test methods in Language Testing
Test methods in Language TestingTest methods in Language Testing
Test methods in Language Testing
 
Test development
Test developmentTest development
Test development
 
validity and reliability
validity and reliabilityvalidity and reliability
validity and reliability
 
Reliability bachman 1990 chapter 6
Reliability bachman 1990 chapter 6Reliability bachman 1990 chapter 6
Reliability bachman 1990 chapter 6
 
Monitoring and assessment
Monitoring and assessmentMonitoring and assessment
Monitoring and assessment
 

Destaque

Validity & reliability an interesting powerpoint slide i created
Validity & reliability  an interesting powerpoint slide i createdValidity & reliability  an interesting powerpoint slide i created
Validity & reliability an interesting powerpoint slide i createdSze Kai
 
Validity, its types, measurement & factors.
Validity, its types, measurement & factors.Validity, its types, measurement & factors.
Validity, its types, measurement & factors.Maheen Iftikhar
 
1 Reliability and Validity in Physical Therapy Tests
1  Reliability and Validity in Physical Therapy Tests1  Reliability and Validity in Physical Therapy Tests
1 Reliability and Validity in Physical Therapy Testsaebrahim123
 
Scaling and Measurement techniques
Scaling and Measurement techniquesScaling and Measurement techniques
Scaling and Measurement techniquesJignesh Kariya
 
eeMba ii rm unit-3.1 measurement & scaling a
eeMba ii rm unit-3.1 measurement & scaling aeeMba ii rm unit-3.1 measurement & scaling a
eeMba ii rm unit-3.1 measurement & scaling aRai University
 
Questionnaire design and validation
Questionnaire design and validationQuestionnaire design and validation
Questionnaire design and validationKarim Elghanam
 
Measurement in Marketing Research
Measurement in Marketing ResearchMeasurement in Marketing Research
Measurement in Marketing ResearchShameem Ali
 
A" Research Methods Reliability and validity
A" Research Methods Reliability and validityA" Research Methods Reliability and validity
A" Research Methods Reliability and validityJill Jan
 
validity its types and importance
validity its types and importancevalidity its types and importance
validity its types and importanceIerine Joy Caserial
 
Research methodology & Biostatistics
Research methodology & Biostatistics  Research methodology & Biostatistics
Research methodology & Biostatistics Kusum Gaur
 

Destaque (11)

Validity & reliability an interesting powerpoint slide i created
Validity & reliability  an interesting powerpoint slide i createdValidity & reliability  an interesting powerpoint slide i created
Validity & reliability an interesting powerpoint slide i created
 
Validity, its types, measurement & factors.
Validity, its types, measurement & factors.Validity, its types, measurement & factors.
Validity, its types, measurement & factors.
 
1 Reliability and Validity in Physical Therapy Tests
1  Reliability and Validity in Physical Therapy Tests1  Reliability and Validity in Physical Therapy Tests
1 Reliability and Validity in Physical Therapy Tests
 
Scaling and Measurement techniques
Scaling and Measurement techniquesScaling and Measurement techniques
Scaling and Measurement techniques
 
eeMba ii rm unit-3.1 measurement & scaling a
eeMba ii rm unit-3.1 measurement & scaling aeeMba ii rm unit-3.1 measurement & scaling a
eeMba ii rm unit-3.1 measurement & scaling a
 
Questionnaire design and validation
Questionnaire design and validationQuestionnaire design and validation
Questionnaire design and validation
 
Measurement in Marketing Research
Measurement in Marketing ResearchMeasurement in Marketing Research
Measurement in Marketing Research
 
Reliability and validity
Reliability and validityReliability and validity
Reliability and validity
 
A" Research Methods Reliability and validity
A" Research Methods Reliability and validityA" Research Methods Reliability and validity
A" Research Methods Reliability and validity
 
validity its types and importance
validity its types and importancevalidity its types and importance
validity its types and importance
 
Research methodology & Biostatistics
Research methodology & Biostatistics  Research methodology & Biostatistics
Research methodology & Biostatistics
 

Semelhante a Validity, reliability & practicality

Test characteristics
Test characteristicsTest characteristics
Test characteristicsSamcruz5
 
Module-14-1-Characterstics of a good test-Reliability,Validity....pdf
Module-14-1-Characterstics of a good test-Reliability,Validity....pdfModule-14-1-Characterstics of a good test-Reliability,Validity....pdf
Module-14-1-Characterstics of a good test-Reliability,Validity....pdfVikramjit Singh
 
Validity and objectivity of tests
Validity and objectivity of testsValidity and objectivity of tests
Validity and objectivity of testsbushra mushtaq
 
Validity & reliability seminar
Validity & reliability seminarValidity & reliability seminar
Validity & reliability seminarmrikara185
 
Validity of a Research Tool
Validity of a Research ToolValidity of a Research Tool
Validity of a Research TooljobyVarghese22
 
RELIABILITY AND VALIDITY
RELIABILITY AND VALIDITYRELIABILITY AND VALIDITY
RELIABILITY AND VALIDITYJoydeep Singh
 
VALIDITY
VALIDITYVALIDITY
VALIDITYANCYBS
 
Language testing and evaluation validity and reliability.
Language testing and evaluation validity and reliability.Language testing and evaluation validity and reliability.
Language testing and evaluation validity and reliability.Vadher Ankita
 
Validity in psychological testing
Validity in psychological testingValidity in psychological testing
Validity in psychological testingMilen Ramos
 
Research Methodology 7
Research Methodology   7Research Methodology   7
Research Methodology 7ayat_ismail
 
Educ 243 final report pepito
Educ 243 final report pepitoEduc 243 final report pepito
Educ 243 final report pepitodeped
 
Characteristics of Good Evaluation Instrument
Characteristics of Good Evaluation InstrumentCharacteristics of Good Evaluation Instrument
Characteristics of Good Evaluation InstrumentSuresh Babu
 
UNIT6.pptx PowerPoint slide of chemostrt
UNIT6.pptx PowerPoint slide of chemostrtUNIT6.pptx PowerPoint slide of chemostrt
UNIT6.pptx PowerPoint slide of chemostrtjannattar14
 
Areen Ashraf.Validity and its types university of education faisalabad
Areen Ashraf.Validity and its types university of education faisalabadAreen Ashraf.Validity and its types university of education faisalabad
Areen Ashraf.Validity and its types university of education faisalabadaliceella25970
 
Louzel Report - Reliability & validity
Louzel Report - Reliability & validity Louzel Report - Reliability & validity
Louzel Report - Reliability & validity Louzel Linejan
 

Semelhante a Validity, reliability & practicality (20)

Test characteristics
Test characteristicsTest characteristics
Test characteristics
 
Module-14-1-Characterstics of a good test-Reliability,Validity....pdf
Module-14-1-Characterstics of a good test-Reliability,Validity....pdfModule-14-1-Characterstics of a good test-Reliability,Validity....pdf
Module-14-1-Characterstics of a good test-Reliability,Validity....pdf
 
Rep
RepRep
Rep
 
Validity and objectivity of tests
Validity and objectivity of testsValidity and objectivity of tests
Validity and objectivity of tests
 
Validity & reliability seminar
Validity & reliability seminarValidity & reliability seminar
Validity & reliability seminar
 
Validity of a Research Tool
Validity of a Research ToolValidity of a Research Tool
Validity of a Research Tool
 
RELIABILITY AND VALIDITY
RELIABILITY AND VALIDITYRELIABILITY AND VALIDITY
RELIABILITY AND VALIDITY
 
VALIDITY
VALIDITYVALIDITY
VALIDITY
 
Language testing and evaluation validity and reliability.
Language testing and evaluation validity and reliability.Language testing and evaluation validity and reliability.
Language testing and evaluation validity and reliability.
 
Validity in psychological testing
Validity in psychological testingValidity in psychological testing
Validity in psychological testing
 
Qualities of good evaluation tool (1)
Qualities of good evaluation  tool (1)Qualities of good evaluation  tool (1)
Qualities of good evaluation tool (1)
 
Validity
ValidityValidity
Validity
 
Research Methodology 7
Research Methodology   7Research Methodology   7
Research Methodology 7
 
EM&E.pptx
EM&E.pptxEM&E.pptx
EM&E.pptx
 
Educ 243 final report pepito
Educ 243 final report pepitoEduc 243 final report pepito
Educ 243 final report pepito
 
Characteristics of Good Evaluation Instrument
Characteristics of Good Evaluation InstrumentCharacteristics of Good Evaluation Instrument
Characteristics of Good Evaluation Instrument
 
UNIT6.pptx PowerPoint slide of chemostrt
UNIT6.pptx PowerPoint slide of chemostrtUNIT6.pptx PowerPoint slide of chemostrt
UNIT6.pptx PowerPoint slide of chemostrt
 
Areen Ashraf.Validity and its types university of education faisalabad
Areen Ashraf.Validity and its types university of education faisalabadAreen Ashraf.Validity and its types university of education faisalabad
Areen Ashraf.Validity and its types university of education faisalabad
 
Louzel Report - Reliability & validity
Louzel Report - Reliability & validity Louzel Report - Reliability & validity
Louzel Report - Reliability & validity
 
Validity and reliablity
Validity and reliablityValidity and reliablity
Validity and reliablity
 

Mais de Samcruz5

On the Thesis Statement and the Five-Paragraph Essay
On the Thesis Statement and the Five-Paragraph EssayOn the Thesis Statement and the Five-Paragraph Essay
On the Thesis Statement and the Five-Paragraph EssaySamcruz5
 
A Pedagogical Approach to Wars: The Study of India and Pakistan as EFL Teachi...
A Pedagogical Approach to Wars: The Study of India and Pakistan as EFL Teachi...A Pedagogical Approach to Wars: The Study of India and Pakistan as EFL Teachi...
A Pedagogical Approach to Wars: The Study of India and Pakistan as EFL Teachi...Samcruz5
 
Needs analysis stare-dad
Needs analysis stare-dadNeeds analysis stare-dad
Needs analysis stare-dadSamcruz5
 
Needs Analysis: Where ESP courses start
Needs Analysis: Where ESP courses startNeeds Analysis: Where ESP courses start
Needs Analysis: Where ESP courses startSamcruz5
 
Natural approach (1)
Natural approach (1)Natural approach (1)
Natural approach (1)Samcruz5
 
Communicative language teaching_(clt)2
Communicative language teaching_(clt)2Communicative language teaching_(clt)2
Communicative language teaching_(clt)2Samcruz5
 
Presentation didactic 2
Presentation didactic 2Presentation didactic 2
Presentation didactic 2Samcruz5
 
Presentation Didactics
Presentation DidacticsPresentation Didactics
Presentation DidacticsSamcruz5
 
Lesson Planning
Lesson PlanningLesson Planning
Lesson PlanningSamcruz5
 
Reading as a Skill
Reading as a SkillReading as a Skill
Reading as a SkillSamcruz5
 
Testing Listening and Reading
Testing Listening and ReadingTesting Listening and Reading
Testing Listening and ReadingSamcruz5
 
Testing Speaking and Writing
Testing Speaking and WritingTesting Speaking and Writing
Testing Speaking and WritingSamcruz5
 
Testing Pronunciation
Testing PronunciationTesting Pronunciation
Testing PronunciationSamcruz5
 
Testing Vocabulary
Testing VocabularyTesting Vocabulary
Testing VocabularySamcruz5
 
Testing Grammar
Testing GrammarTesting Grammar
Testing GrammarSamcruz5
 
Testing pronunciation
Testing pronunciationTesting pronunciation
Testing pronunciationSamcruz5
 
Communicative testing 1
Communicative testing 1Communicative testing 1
Communicative testing 1Samcruz5
 
Communicative Testing
Communicative TestingCommunicative Testing
Communicative TestingSamcruz5
 
Communicative Testing
Communicative TestingCommunicative Testing
Communicative TestingSamcruz5
 
Assessing grammar
Assessing grammarAssessing grammar
Assessing grammarSamcruz5
 

Mais de Samcruz5 (20)

On the Thesis Statement and the Five-Paragraph Essay
On the Thesis Statement and the Five-Paragraph EssayOn the Thesis Statement and the Five-Paragraph Essay
On the Thesis Statement and the Five-Paragraph Essay
 
A Pedagogical Approach to Wars: The Study of India and Pakistan as EFL Teachi...
A Pedagogical Approach to Wars: The Study of India and Pakistan as EFL Teachi...A Pedagogical Approach to Wars: The Study of India and Pakistan as EFL Teachi...
A Pedagogical Approach to Wars: The Study of India and Pakistan as EFL Teachi...
 
Needs analysis stare-dad
Needs analysis stare-dadNeeds analysis stare-dad
Needs analysis stare-dad
 
Needs Analysis: Where ESP courses start
Needs Analysis: Where ESP courses startNeeds Analysis: Where ESP courses start
Needs Analysis: Where ESP courses start
 
Natural approach (1)
Natural approach (1)Natural approach (1)
Natural approach (1)
 
Communicative language teaching_(clt)2
Communicative language teaching_(clt)2Communicative language teaching_(clt)2
Communicative language teaching_(clt)2
 
Presentation didactic 2
Presentation didactic 2Presentation didactic 2
Presentation didactic 2
 
Presentation Didactics
Presentation DidacticsPresentation Didactics
Presentation Didactics
 
Lesson Planning
Lesson PlanningLesson Planning
Lesson Planning
 
Reading as a Skill
Reading as a SkillReading as a Skill
Reading as a Skill
 
Testing Listening and Reading
Testing Listening and ReadingTesting Listening and Reading
Testing Listening and Reading
 
Testing Speaking and Writing
Testing Speaking and WritingTesting Speaking and Writing
Testing Speaking and Writing
 
Testing Pronunciation
Testing PronunciationTesting Pronunciation
Testing Pronunciation
 
Testing Vocabulary
Testing VocabularyTesting Vocabulary
Testing Vocabulary
 
Testing Grammar
Testing GrammarTesting Grammar
Testing Grammar
 
Testing pronunciation
Testing pronunciationTesting pronunciation
Testing pronunciation
 
Communicative testing 1
Communicative testing 1Communicative testing 1
Communicative testing 1
 
Communicative Testing
Communicative TestingCommunicative Testing
Communicative Testing
 
Communicative Testing
Communicative TestingCommunicative Testing
Communicative Testing
 
Assessing grammar
Assessing grammarAssessing grammar
Assessing grammar
 

Último

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 

Último (20)

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 

Validity, reliability & practicality

  • 1. VALIDITY, RELIABILITY & PRACTICALITY Prof. Jonathan Magdalena
  • 2. QUALITIES OF MEASUREMENT DEVICES Validity Does it measure what it is supposed to measure? Reliability How representative is the measurement? Objectivity Do independent scorers agree? Practicality Is it easy to construct, administer, score and interpret?
  • 3. VALIDITY Validity refers to whether or not a test measures what it intends to measure. A test with high validity has items closely linked to the test’s intended focus. A test with poor validity does not measure the content and competencies it ought to.
  • 4. VALIDITY - Kinds of Validity “Content”: related to objectives and their sampling. “Construct”: referring to the theory underlying the target. “Criterion”: related to concrete criteria in the real world. It can be concurrent or predictive. “Concurrent”: correlating high with another measure already validated. “Predictive”: Capable of anticipating some later measure. “Face”: related to the test overall appearance.
  • 5. 1. CONTENT VALIDITY Content validity refers to the connections between the test items and the subject-related tasks. The test should evaluate only the content related to the field of study in a manner sufficiently representative, relevant, and comprehensible.
  • 6. 2. CONSTRUCT VALIDITY It implies using the construct (concepts, ideas, notions) in accordance to the state of the art in the field. Construct validity seeks agreement between updated subject-matter theories and the specific measuring components of the test. For example, a test of intelligence nowadays must include measures of multiple intelligences, rather than just logical-mathematical and linguistic ability measures.
  • 7. 3. CRITERION-RELATED VALIDITY Also referred to as instrumental validity, it is used to demonstrate the accuracy of a measure or procedure by comparing it with another process or method which has been demonstrated to be valid. For example, imagine a hands-on driving test has been proved to be an accurate test of driving skills. A written test can be validated by using a criterion related strategy in which the hands-on driving test is compared to it.
  • 8. 4. CONCURRENT VALIDITY Concurrent validity uses statistical methods of correlation to other measures. Examinees who are known to be either masters or non-masters on the content measured are identified before the test is administered. Once the tests have been scored, the relationship between the examinees’ status as either masters or non-masters and their performance (i.e., pass or fail) is estimated based on the test.
  • 9. 5. PREDICTIVE VALIDITY Predictive validity estimates the relationship of test scores to an examinee's future performance as a master or non-master. Predictive validity considers the question, "How well does the test predict examinees' future status as masters or non-masters?" For this type of validity, the correlation that is computed is based on the test results and the examinee’s later performance. This type of validity is especially useful for test purposes such as selection or admissions.
  • 10. 6. FACE VALIDITY Face validity is determined by a review of the items and not through the use of statistical analyses. Unlike content validity, face validity is not investigated through formal procedures. Instead, anyone who looks over the test, including examinees, may develop an informal opinion as to whether or not the test is measuring what it is supposed to measure.
  • 11. QUALITIES OF MEASUREMENT DEVICES Validity Does it measure what it is supposed to measure? Reliability How representative is the measurement? Objectivity Do independent scorers agree? Practicality Is it easy to construct, administer, score and interpret?
  • 12. RELIABILITY Reliability is the extent to which an experiment, test, or any measuring procedure shows the same result on repeated trials. For researchers, four key types of reliability are:
  • 13. RELIABILITY “Equivalency”: related to the co-occurrence of two items. “Stability”: related to time consistency. “Internal”: related to the instruments. “Interrater”: related to the examiners’ criterion.
  • 14. 1. EQUIVALENCY RELIABILITY Equivalency reliability is the extent to which two items measure identical concepts at an identical level of difficulty. Equivalency reliability is determined by relating two sets of test scores to one another to highlight the degree of relationship or association.
  • 15. 2. STABILITY RELIABILITY Stability reliability (sometimes called test, re-test reliability) is the agreement of measuring instruments over time. To determine stability, a measure or test is repeated on the same subjects at a future date. Results are compared and correlated with the initial test to give a measure of stability. Instruments with a high stability reliability are thermometers, compasses, measuring cups, etc.
  • 16. 3. INTERNAL CONSISTENCY Internal consistency is the extent to which tests or procedures assess the same characteristic, skill or quality. It is a measure of the precision between the measuring instruments used in a study. This type of reliability often helps researchers interpret data and predict the value of scores and the limits of the relationship among variables.
  • 17. 4. INTERRATER RELIABILITY Interraterreliability is the extent to which two or more individuals (coders or raters) agree. For example, when two or more teachers use a rating scale with which they are rating the students’ oral responses in an interview (1 being most negative, 5 being most positive). If one researcher gives a "1" to a student response, while another researcher gives a "5," obviously the interrater reliability would be inconsistent.
  • 18. SOURCES OF ERROR Examinee (is a human being) Examiner (is a human being) Examination (is designed by and for human beings)
  • 19. RELATIONSHIP BETWEEN VALIDITY & RELIABILITY Validity and reliability are closely related. A test cannot be considered valid unless the measurements resulting from it are reliable. Likewise, results from a test can be reliable and not necessarily valid.
  • 20. BACKWASH EFFECT Backwash (also known as washback) effect is related to the potentially positive and negative effects of test design and content on the form and content of English language training courseware.