•  
  •  
 

Jurnal Penelitian dan Evaluasi Pendidikan

Keywords

content validity; empirical validity; performance assessment; scientific literacy; STEM

Document Type

Article

Abstract

Penelitian ini merupakan bagian dari penelitian pengembangan asesmen kinerja literasi sains berbasis STEM pada pembelajaran fisika. Tujuan dari penelitian ini adalah untuk mengungkapkan validitas isi, validitas empiris, dan reliabilitas instrumen asesmen kinerja literasi sains berbasis STEM yang sebelumnya telah disusun. Instrumen yang dikembangkan berupa lembar pengamatan dan tes pilihan ganda. Analisis validitas isi dari lembar pengamatan menggunakan Koefisien V oleh Aiken sedangkan validitas isi instrumen tes dianalisis dengan menggunakan CVI (Content Validity Index) oleh Lawshe. Validitas empiris reliabilitas instrumen tes diestimasi dengan IRT (Item Response Theory). Reliabilitas lembar pengamatan ditentukan dengan ICC (Item Correlation Coefficient). Hasil dari penelitian ini menunjukan bahwa (1) Lembar pengamatan berupa rubrik penskoran dan penilaian diri terbuktivalid dengan koefisien V Aiken 0,75 dan reliabel dengan koefisien Reliabilitas Alfa > 0,8 dan ICC yang Excellent. (2) Instrumen tes terbukti realiabel untuk digunakan pada peserta didik dengan kategori sedang sampai dengan tinggi (-0,7 sampai dengan 6,7 ) dengan CVI=1 dan INFIT MNSQ sesuai model Rasch. Berdasarkan hasil penelitian tersebut maka asesmen kinerja Literasi Sains berbasis STEMlayak digunakan.

Kata kunci:validitas isi, validitas empiris, asesmen kinerja, literasi sains, STEM

VALIDITY AND RELIABILITY INSTRUMENT OF SCIENTIFIC LITERACY PERFORMANCE ASSESSMENT IN PHYSICS TEACHING BASED ON STEM

Abstract

This research is part of the development of scientific literacy performance assessment based on STEM in teaching physics. The aim of this research is to reveal the validity (content and also empiric) and reliability of scientific literacy performance assessment instrument based on STEM. The kind of instruments were developed are observational sheet and multiple choice test. The content validity of observational sheet was revealed by used the Aiken's V Coefficient. The content validity of multiple choice tests was revealed by used Content Validity Index (CVI) which proposed by Lawshe. The empirical validity and reliability of multiple choice tests was revealed by used Item Response Theory Analysis. The reliability of observational sheet was revealed by used ICC (Item Correlation Coefficient) Analysis. The results of this study are the validity from the contents and empirical trials from the developed instruments. The observation sheet from scoring rubric and self-assessment has been valid with Aiken's V value that exceeds the standard of 0,75. The reliability of the scoring rubric has Alfa Reliability> 0.8 and Excellent of ICC. Validity values from The written test is shown with CVI of 1 and the MNSQ INFIT value which match to the Rasch model. Based on the TIC and SEM graphs, the written test is stated to be reliable for use in students with moderate to high categories (-0.7 to 6.7). STEM-based Science Literacy performance assessment with caloric material is appropriate to use.

Keywords: content validity, empirical validity, performance assessment, scientific literacy, STEM

First Page

219

Last Page

230

Issue

2

Volume

22

Digital Object Identifier (DOI)

10.21831/pep.v22i2.19590

References

Aiken, L. R. (1985). Three Coefficients for Analyzing the Reliability and Validity of Ratings. Educational and Psychological Measurement, 45(1), 131–142. https://doi.org/10.1177/0013164485451012

Azwar, S. (2012). Reliabiltas dan validitas(4th ed.). Yogyakarta: Pustaka Pelajar.

Azwar, S. (2015). Metode penelitian. Yogyakarta: Pustaka Pelajar.

Badan Standar Nasional Pendidikan. (2006). Panduan penyusunan kurikulum tingkat satuan pendidikan jenjang pendidikan dasar dan menengah. Jakarta: BSNP. Retrieved from http://bsnp-indonesia.org/id/wp-content/uploads/kompetensi/Panduan_Umum_KTSP.pdf

Bashooir, K., & Supahar, S. (2016). Analisis aspek kinerja literasi sainspada materi kalor Fisika. UPEJ Unnes Physics Education Journal, 5(1). Retrieved from https://journal.unnes.ac.id/sju/index.php/upej/article/view/12711

Breiner, J. M., Harkness, S. S., Johnson, C. C., & Koehler, C. M. (2012). What is STEM? A discussion about conceptions of STEM in education and partnerships. School Science and Mathematics, 112(1), 3–11. https://doi.org/10.1111/j.1949-8594.2011.00109.x

Chiappetta, E. L., & Koballa, T. R. (2010). Science instruction in the middle and secondary schools: developing fundamental knowledge and skills (7th ed.). USA: Pearson Education, Inc.

Cicchetti, D. V. (1994). Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology. Psychological Assessment, 6(4), 284–290. https://doi.org/10.1037/1040-3590.6.4.284

Departemen Pendidikan dan Kebudayaan. (2001). Kamus besar bahasa Indonesia (3rd ed.). Jakarta: Balai Pustaka.

Department of Education. (2009). Report of the STEM review. Retrieved from https://www.education-ni.gov.uk/sites/default/files/publications/de/Report of the STEM Review 2009_1.PDF

Gonzalez, H. B., & Kuenzi, J. J. (2012). Science, technology, engineering, and mathematics (STEM) education: a primer. Retrieved from https://fas.org/sgp/crs/misc/R42642.pdf

Hernandez, P. R., Bodin, R., Elliott, J. W., Ibrahim, B., Rambo-Hernandez, K. E., Chen, T. W., & de Miranda, M. A. (2014). Connecting the STEM dots: measuring the effect of an integrated engineering design intervention. International Journal of Technology and Design Education, 24(1), 107–120. https://doi.org/10.1007/s10798-013-9241-0

Ismail, I., Permanasari, A., & Setiawan, W. (2016). Efektivitas virtual lab berbasis STEM dalam meningkatkan literasi sains siswa dengan perbedaan gender. Jurnal Inovasi Pendidikan IPA, 2(2), 190. https://doi.org/10.21831/jipi.v2i2.8570

Kartowagiran, B., & Jaedun, A. (2016). Model asesmen autentik untuk menilai hasil belajar siswa sekolah menengah pertama (SMP): implementasi asesmen autentik di SMP. Jurnal Penelitian Dan Evaluasi Pendidikan, 20(2), 131. https://doi.org/10.21831/pep.v20i2.10063

Lawshe, C. H. (1975). A quantitative approach to content validity. Personnel Psychology, 28(4), 563–575. https://doi.org/10.1111/j.1744-6570.1975.tb01393.x

Mardapi, D. (2012). Pengukuran, penilaian dan evaluasi pendidikan. Yogyakarta: Nuha Medika.OECD. (2014). PISA 2012 results: what students know and can do student performance in mathematics, reading and science volume I. Paris: OECD Publishing.

P21. (2009). 21st century skills map.Retrieved from http://www.p21.org/storage/documents/21st_century_skills_english_map.pdf

Presiden Republik Indonesia. Undang-Undang Republik Indonesia nomor 20 tahun 2003 tentang Sistem Pendidikan Nasional (2003). Indonesia.Reeve, E. M. (2013). Implementing science, technology, mathematics, and engineering (STEM) education in Thailand and in ASEAN. Retrieved from http://dpst-apply.ipst.ac.th/specialproject/images/IPST_Global/document/Implementing STEM in ASEAN -IPST May 7 2013 -Final.pdf

Retnawati, H. (2016). Validitas reliabilitas dan karakteristik butir. Yogyakarta: Parama Publishing.

Shrout, P. E., & Fleiss, J. L. (1979). Intraclass correlations: uses in assessing rater reliability. Psychological Bulletin, 86(2), 420–428. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/18839484

Streiner, D. L. (2003). Starting at the beginning: an introduction to coefficient alpha and internal consistency. Journal of Personality Assessment, 80(1), 99–103. https://doi.org/10.1207/S15327752JPA8001_18

Subali, B., & Suyata, P. (2012). Pengembangan item tes konvergen dan divergen: penyelidikan validitasnya Secara empiris. Yogyakarta: Diandra Pustaka Indonesia.

Sumintono, B., & Widhiarso, W. (2015). Aplikasi pemodelan RASCH pada assessment pendidikan. Cimahi: Tim Komunikata Publishing House.

Supahar, & Prasetyo, Z. K. (2015). Pengembangan instrumen penilaian kinerja kemampuan inkuiri peserta didik pada mata pelajaran fisika SMA. Jurnal Penelitian Dan Evaluasi Pendidikan, 19(1), 96–108. Retrieved from https://journal.uny.ac.id/index.php/jpep/article/view/4560

Supahar, S. (2014). The estimation of inquiry performance test items of high school physics subject with quest program. In International Conference on Research, Implementation And Education of Mathematics And Sciences. Yogyakarta: Yogyakarta State University.

Supahar, S. (2015). Applying content validity ratios (CVR) to the quantitative content validity of physics learning achievement tests. In International Conference on Research, Implementation And Education of Mathematics And Sciences. Yogyakarta: Yogyakarta State University.

Wagner, T. (2008). The global achievement gap. New York: Basic Book.

Yore, L. D., & Treagust, D. F. (2006). Current realities and future possibilities: language and science literacy—empowering research and informing instruction. International Journal of Science Education, 28(2–3), 291–314. https://doi.org/10.1080/09500690500336973

Share

COinS