AŞAMALI TEPKİ MODELİ VE KLASİK TEST KURAMI ALTINDA ELDE EDİLEN TEST VE MADDE PARAMETRELERİNİN KARŞILAŞTIRILMASI

İbrahim Alper Köse

doi:10.17240/aibuefd.2015.15.2-5000161319

İnceleme Makalesi

COMPARISON OF TEST AND ITEM PARAMETERS UNDER GRADED RESPONSE MODEL (IRT) AND CLASSICAL TEST THEORY

Yıl 2015, Cilt: 15 Sayı: 2, 184 - 197, 21.12.2015

İbrahim Alper Köse

https://doi.org/10.17240/aibuefd.2015.15.2-5000161319

Cited By: 6

Öz

The primary objective of the present research was to investigate the test and item parameters of Constructivist Learnin Envirement Scale (CLES) under graded response model (GRM) and classical test theory (CTT) and submit the best model which was fitted to real data. The results from a traditionally reliability analysis and a eigenvalue plot indicated that CLES scale satisfied the unidimensionality assumption and reliable. The mean response of CLES scale items was 3,39 which reflected the fact that the items tended to have fairly moderate difficulty values overall. Regarding the item discrimination, there were 10 items with low discrimination values for CTT because their item dicsrimination values are less than 0,60 and .there was 3 items with a very low level for GRM analysis. Item discrimination parameters for each item ranged between 0,68 and 1,81. It should be noticed that item discriminations can be considered as generally adequate and efective at distinquishing among respondents based on their estimated trait levels. The discriminations from the graded response model and classical test theory analysis correlated highly (r=.96) with each other. As a result, it can be colcluded that item and test parameters were highly correlated and researchers could use two methods if their assumpions were met and limitations were considerd.

Anahtar Kelimeler

Graded Response Model, Item Response Theory, Classical test Theory, Item Parameter Estimation

Kaynakça

Arkün, S. & Aşkar, P. (2010). The development of a scale on assessing constructivist learning environments. HU Journal of Education, 39, 32-43
Baker, F.B. (1985). The basics of item response theory. Portsmouth, NH:Heineman
Bobcock, B.G.E.(2009).Estimating a Noncompensatory IRT Model Using a modified Metropolis algorithm. Unpublished Doctoral Dissertation.The University of Minesota.
Bock, R. D.,Thissen, D. & Zimowski, M.F. (1997). IRT estimation of domain scores. Journal of Educational Measurement, 34, 197-211
Chow, P. & Winzer, M.M. (1992). Reliability and validity of a scale measuring attitudes toward mainstreaming. Educational and Psychological Measurement, 52, 223-228.
De Ayala, R.J., Dodd, B.G. & Koch, W.R. (1989). Acomparison of the graded response and partial credit models for assessing writing ability. Paper Presented at the Anuual Meeting of the National Council on Measurement in Education, San Francisco, CA.
Demars, C. (2010). Item response theory. Understanding statistics measurement. Oxford University Press.
Embretson, S.E. & Reise, S.P. (2000). Item Response Theory for Psychologists. Mahwah, NJ:Erlbaum.
Gray-Little, B., Williams, V.S.L. & Hancock, T.D. (1997). An item response theory analysis of the Rosenberg Self Esteem Scale. Personality and Social Psychology Bulletin, 23, 443-451.
Hambleton, R.K. & Swaminathan, H. (1985). Item Response Theory. Kluwer-Nijhoff Publishing. Boston-USA.
Hambleton, R.K. & Swaminathan, H. (1989). Item Response Theory. Principles And Applications. Kluwer-Nijhoff Publishing. Boston-USA.
Hambleton, R.K., Swaminathan, H.& Rogers, H.J. (1991). Fundamentals of Item Response Theory. Sage Publications, London.
LaHuis, D.M., Clark, P. & O’Brien, E. (2011). An examination of item response theory item fit indices fort he graded response model. Organizational Reseach Methods, 14(1), 10-23.
Lane, S., Stone, C.A., Ankenmann, R.D. & Liu, M. (1995). Examination of the assumptions and properties of the graded item response model: An example using a mathematics perfromance assessment. Applied Measurement in Education, 8(4), 313-340.
Lee, K.H. (1995). Application of the graded response model to the revised Tennessee self-concept scale: Unidimensionality, parameter invariance, and differential item functioning. Unpublished Doctoral Dissertation. University of Southern California.
Keith, P.R. (1983). Application of a graded response model to the assessment of job satisfaction. Unpublished Doctoral Dissertation. University of Illinois.
Koch, W.R. (1983). Likert scaling using the graded response latent trait model. Applied Psychological Measurement, 7(1), 15-32.
Madera, E.K. (2003). Application of the graded response model to the assessment of student attitudes. Unpublished Doctoral Dissertation. University of Toronto.
Marie, L.A. (1997). The application of item response theory to employee attitude survey. Unpublished Doctoral Dissertation. University of Connecticut.
Mielenz, T.J., Edwards, M.C. & Callahan, L.F. (2010). Item response theory analysis of two questionnaire measures of arthritis-related self efficacy beliefs from community based US samples. Hindawi Publishing Corporation Arthritis.
Muraki, E. (1990). Fitting a polytomous item response model to Likert type data. Applied Psychological Measurement, 14(1), 59-71.
Ostini, R. & Nering, M.L. (2006). Polytomous Item Response Theory Models. Sage Publications, Inc. California, USA.
Reise, S.P. & Yu, J. (1990). Parameter recovery in the graded response model using MULTİLOG. Journal of Educational Measurement, 27 (2), 133-144.
Roberts, J.S. & Laughlin, J.E. (1996). The graded unfolding model: A unidimensional item response model for unfolding graded responses. Research Report, Educational Testing Service, Princeton.
Rubio, V.J., Aguado, D., Hontangas, P.M. & Hernandez, J.M. (2007). Psychometric properties of an emotional adjustment measure. European Journal of Psychological Assessment, 23 (1), 39-46.
Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores (Psychometric monograph No. 17). Richmond, VA: Psychometric society. Retrieved from http://www.psychometrika.org/journal/online/MN17.pdf
Seungho Yang, M. A. (2007). A Comparison of Unidimensional and Multidimensional RaschModels Using Parametrer Estimates and Fit İndices When Assumption of Unidimensionality is Violated. Unpublished Doctoral Dissertation. The Ohio State University
Sijstma, K. &Hemker, B.T. (2000). A taxonomyof IRT models for ordering persons anditems using simple sum scores. Journal of Educational and Behavioral Statistics, 25 (4), 391-415.
Sijstma, K. & Junker, B.W. (2006). Item response theory: Past performance, present developments and future expectatitons. Behaviormetrika, 33 (1), 75-102.
Stone, C.A., Akenmann, R.D. & Liu, M. (1995). Examination of the assumptions and properties of the graded item response model: An example using a mathematics perfromance assessment. Applied Measurement in Education, 8(4), 313-340.
Sukirno, A. & Sununta, S. (2010). The comparison of graded response model and classical test theory in human resource research: A model fitness test. Research and Practice in Human Resource Management, 18(2), 77-86.
Tezbaşaran, A. & Kelecioğlu, H. (2004). Madde-ölçek korelasyonlarına, alt-üst grup ortalamalarına ve aşamalı tepki modeline göre geliştirilen sigaraya ilişkin tutum ölçeğinin madde ve ölçek özelliklerinin incelenmesi. XIII. Ulusal Eğitim Bilimleri Kurultayı, İnönü Üniversitesi, Malatya.
Thissen, D. (1988). MULTILOG (Computer program). Mooresville, IN: Scientific Software.
Uttaro, T. & Lehman, A. (1999). Graded response modeling of the Quality of Life Interview. Evaluation and Program Planning, 22(1999), 41-52
Wu, M. & Adams, R. (2006). Modelling mathematics problem solving item responses using a multidimensional IRT model. Mathematics Education Research Journal, 18(2), 93-113.

AŞAMALI TEPKİ MODELİ VE KLASİK TEST KURAMI ALTINDA ELDE EDİLEN TEST VE MADDE PARAMETRELERİNİN KARŞILAŞTIRILMASI

Yıl 2015, Cilt: 15 Sayı: 2, 184 - 197, 21.12.2015

İbrahim Alper Köse

https://doi.org/10.17240/aibuefd.2015.15.2-5000161319

Cited By: 6

Öz

Bu çalışmanın temel amacı klasik test kuramı (KTK) ve aşamalı tepki modeli (ATM) altında Yapılandırmacı Öğrenme Ortamları Ölçeğinin kestirilen madde ve test parametrelerinin karşılaştırılarak, araştırmacılara hangi kuram altında kestirilen parametrelerin daha keskin ve güvenilir olduğunu sunmaktır. Geleneksel güvenirlik analizi ve özdeğer grafiği sonuçları veri grubunun tek boyutlu olduğunu ve ölçekten elde edilen sonuçların güvenilir olduğunu göstermektedir. Yapılandırmacı öğrenme ortamları ölçeği ortaokul formunun madde ortalaması 3,39 olarak hesaplanmış ve bu da orta düzeyde madde güçlüğüne karşılık gelmektedir. Madde ayırıcılık parametreleri incelendiğinde KTK altında 10 maddenin düşük ayırıcılığa sahip olduğu, ATM altında ise 3 maddenin düşük ayırıcılığa sahip olduğu ve ayırıcılık değerlerinin 0,61 ile 1,81 arasında değiştiği ortaya konmuştur. Her iki kuram altında madde ayırıcılık parametreleri arasında r=0,96 düzeyinde korelasyon bulunmuştur. Sonuç olarak her iki kuram altında kestirilen madde ve test parametrelerinin yüksek düzeyde ilişkili olduğu, model varsayımların karşılandığı ve sınırlılıkların göz önüne alındığı durumlarda her iki kuramında kullanılabileceği sonucuna ulaşılmıştır.

Anahtar Kelimeler

Aşamalı tepki modeli, madde tepki kuramı, klasik test kuramı, Madde parametre kestirimi

Kaynakça

Arkün, S. & Aşkar, P. (2010). The development of a scale on assessing constructivist learning environments. HU Journal of Education, 39, 32-43
Baker, F.B. (1985). The basics of item response theory. Portsmouth, NH:Heineman
Bobcock, B.G.E.(2009).Estimating a Noncompensatory IRT Model Using a modified Metropolis algorithm. Unpublished Doctoral Dissertation.The University of Minesota.
Bock, R. D.,Thissen, D. & Zimowski, M.F. (1997). IRT estimation of domain scores. Journal of Educational Measurement, 34, 197-211
Chow, P. & Winzer, M.M. (1992). Reliability and validity of a scale measuring attitudes toward mainstreaming. Educational and Psychological Measurement, 52, 223-228.
De Ayala, R.J., Dodd, B.G. & Koch, W.R. (1989). Acomparison of the graded response and partial credit models for assessing writing ability. Paper Presented at the Anuual Meeting of the National Council on Measurement in Education, San Francisco, CA.
Demars, C. (2010). Item response theory. Understanding statistics measurement. Oxford University Press.
Embretson, S.E. & Reise, S.P. (2000). Item Response Theory for Psychologists. Mahwah, NJ:Erlbaum.
Gray-Little, B., Williams, V.S.L. & Hancock, T.D. (1997). An item response theory analysis of the Rosenberg Self Esteem Scale. Personality and Social Psychology Bulletin, 23, 443-451.
Hambleton, R.K. & Swaminathan, H. (1985). Item Response Theory. Kluwer-Nijhoff Publishing. Boston-USA.
Hambleton, R.K. & Swaminathan, H. (1989). Item Response Theory. Principles And Applications. Kluwer-Nijhoff Publishing. Boston-USA.
Hambleton, R.K., Swaminathan, H.& Rogers, H.J. (1991). Fundamentals of Item Response Theory. Sage Publications, London.
LaHuis, D.M., Clark, P. & O’Brien, E. (2011). An examination of item response theory item fit indices fort he graded response model. Organizational Reseach Methods, 14(1), 10-23.
Lane, S., Stone, C.A., Ankenmann, R.D. & Liu, M. (1995). Examination of the assumptions and properties of the graded item response model: An example using a mathematics perfromance assessment. Applied Measurement in Education, 8(4), 313-340.
Lee, K.H. (1995). Application of the graded response model to the revised Tennessee self-concept scale: Unidimensionality, parameter invariance, and differential item functioning. Unpublished Doctoral Dissertation. University of Southern California.
Keith, P.R. (1983). Application of a graded response model to the assessment of job satisfaction. Unpublished Doctoral Dissertation. University of Illinois.
Koch, W.R. (1983). Likert scaling using the graded response latent trait model. Applied Psychological Measurement, 7(1), 15-32.
Madera, E.K. (2003). Application of the graded response model to the assessment of student attitudes. Unpublished Doctoral Dissertation. University of Toronto.
Marie, L.A. (1997). The application of item response theory to employee attitude survey. Unpublished Doctoral Dissertation. University of Connecticut.
Mielenz, T.J., Edwards, M.C. & Callahan, L.F. (2010). Item response theory analysis of two questionnaire measures of arthritis-related self efficacy beliefs from community based US samples. Hindawi Publishing Corporation Arthritis.
Muraki, E. (1990). Fitting a polytomous item response model to Likert type data. Applied Psychological Measurement, 14(1), 59-71.
Ostini, R. & Nering, M.L. (2006). Polytomous Item Response Theory Models. Sage Publications, Inc. California, USA.
Reise, S.P. & Yu, J. (1990). Parameter recovery in the graded response model using MULTİLOG. Journal of Educational Measurement, 27 (2), 133-144.
Roberts, J.S. & Laughlin, J.E. (1996). The graded unfolding model: A unidimensional item response model for unfolding graded responses. Research Report, Educational Testing Service, Princeton.
Rubio, V.J., Aguado, D., Hontangas, P.M. & Hernandez, J.M. (2007). Psychometric properties of an emotional adjustment measure. European Journal of Psychological Assessment, 23 (1), 39-46.
Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores (Psychometric monograph No. 17). Richmond, VA: Psychometric society. Retrieved from http://www.psychometrika.org/journal/online/MN17.pdf
Seungho Yang, M. A. (2007). A Comparison of Unidimensional and Multidimensional RaschModels Using Parametrer Estimates and Fit İndices When Assumption of Unidimensionality is Violated. Unpublished Doctoral Dissertation. The Ohio State University
Sijstma, K. &Hemker, B.T. (2000). A taxonomyof IRT models for ordering persons anditems using simple sum scores. Journal of Educational and Behavioral Statistics, 25 (4), 391-415.
Sijstma, K. & Junker, B.W. (2006). Item response theory: Past performance, present developments and future expectatitons. Behaviormetrika, 33 (1), 75-102.
Stone, C.A., Akenmann, R.D. & Liu, M. (1995). Examination of the assumptions and properties of the graded item response model: An example using a mathematics perfromance assessment. Applied Measurement in Education, 8(4), 313-340.
Sukirno, A. & Sununta, S. (2010). The comparison of graded response model and classical test theory in human resource research: A model fitness test. Research and Practice in Human Resource Management, 18(2), 77-86.
Tezbaşaran, A. & Kelecioğlu, H. (2004). Madde-ölçek korelasyonlarına, alt-üst grup ortalamalarına ve aşamalı tepki modeline göre geliştirilen sigaraya ilişkin tutum ölçeğinin madde ve ölçek özelliklerinin incelenmesi. XIII. Ulusal Eğitim Bilimleri Kurultayı, İnönü Üniversitesi, Malatya.
Thissen, D. (1988). MULTILOG (Computer program). Mooresville, IN: Scientific Software.
Uttaro, T. & Lehman, A. (1999). Graded response modeling of the Quality of Life Interview. Evaluation and Program Planning, 22(1999), 41-52
Wu, M. & Adams, R. (2006). Modelling mathematics problem solving item responses using a multidimensional IRT model. Mathematics Education Research Journal, 18(2), 93-113.

Toplam 35 adet kaynakça vardır.

Ayrıntılar

Birincil Dil	Türkçe
Bölüm	Makaleler
Yazarlar	İbrahim Alper Köse
Yayımlanma Tarihi	21 Aralık 2015
Gönderilme Tarihi	21 Aralık 2015
Yayımlandığı Sayı	Yıl 2015 Cilt: 15 Sayı: 2

Kaynak Göster

APA	Köse, İ. A. (2015). AŞAMALI TEPKİ MODELİ VE KLASİK TEST KURAMI ALTINDA ELDE EDİLEN TEST VE MADDE PARAMETRELERİNİN KARŞILAŞTIRILMASI. Abant İzzet Baysal Üniversitesi Eğitim Fakültesi Dergisi, 15(2), 184-197. https://doi.org/10.17240/aibuefd.2015.15.2-5000161319

Cited By

Investigation of education value perception scale's psychometric properties according to CTT and IRT

International Journal of Assessment Tools in Education

https://doi.org/10.21449/ijate.986530

STUDENT DIFFICULTIES IN SOLVING PROBLEMS CONCERNING SPECIAL RELATIVITY AND POSSIBLE REASONS FOR THESE DIFFICULTIES

Journal of Baltic Science Education

Zafer Tanel

https://doi.org/10.33225/jbse/14.13.573

Aynı Dereceli Puanlama Kategorilerini Kullanan Maddelerin Psikometrik Özelliklerini Değerlendirmede Çok Kategorili Madde Tepki Kuramı Modellerinin Karşılaştırılması

Abant İzzet Baysal Üniversitesi Eğitim Fakültesi Dergisi

COMPARISON OF TEST AND ITEM PARAMETERS UNDER GRADED RESPONSE MODEL (IRT) AND CLASSICAL TEST THEORY

Öz

Anahtar Kelimeler

Kaynakça

AŞAMALI TEPKİ MODELİ VE KLASİK TEST KURAMI ALTINDA ELDE EDİLEN TEST VE MADDE PARAMETRELERİNİN KARŞILAŞTIRILMASI

Öz

Anahtar Kelimeler

Kaynakça

Ayrıntılar

Kaynak Göster

Cited By

Investigation of education value perception scale's psychometric properties according to CTT and IRT

International Journal of Assessment Tools in Education

https://doi.org/10.21449/ijate.986530

STUDENT DIFFICULTIES IN SOLVING PROBLEMS CONCERNING SPECIAL RELATIVITY AND POSSIBLE REASONS FOR THESE DIFFICULTIES

Journal of Baltic Science Education

Zafer Tanel

https://doi.org/10.33225/jbse/14.13.573

Aynı Dereceli Puanlama Kategorilerini Kullanan Maddelerin Psikometrik Özelliklerini Değerlendirmede Çok Kategorili Madde Tepki Kuramı Modellerinin Karşılaştırılması

Eğitimde ve Psikolojide Ölçme ve Değerlendirme Dergisi

Esra SÖZER

https://doi.org/10.21031/epod.819927

Examination of the Reliability of the Measurements Regarding the Written Expression Skills According to Different Test Theories

Eğitimde ve Psikolojide Ölçme ve Değerlendirme Dergisi

Merve Yıldırım Seheryeli

https://doi.org/10.21031/epod.559470

Development of a “Perceived Stress Scale" Based on Classical Test Theory and Graded Response Model

International Journal of Assessment Tools in Education

Metin Yaşar

https://doi.org/10.21449/ijate.626053

Sendikaya Bağlılık Ölçeği Test ve Madde Puanlarının Madde Tepki Kuramına Dayalı Aşamalı Tepki Modeli ile İncelenmesi

Anadolu Üniversitesi Sosyal Bilimler Dergisi

Dr. Öğr. Üyesi Erol Karaca

https://doi.org/10.18037/ausbd.550234