Clustering senior high school students’ numeracy score in Java using the K-means algorithm

Authors

DOI:

https://doi.org/10.24071/icre.v1i1.27

Keywords:

clustering, Davies-Bouldin Index, K-Means algorithm, national assessment, numeracy

Abstract

Numeracy skill is a key ability that supports critical thinking and is assessed in Indonesia through the Minimum Competency Assessment. This study aims to (1) analyze high school students' numeracy skills in Java based on four numeracy domains and three cognitive levels; and (2) cluster students’ numeracy skills using the K-Means algorithm with the Davies-Bouldin Index (DBI) as the evaluation metric. The data were obtained from the education data portal for the year 2021–2023. The process went through several stages: data cleaning, normalization, feature selection, clustering, and validation using the Davies–Bouldin Index (DBI). The results indicate a steady improvement in students’ numeracy performance, with average scores increasing from 51.18 in 2021, 54.08 in 2022, and 59.19 in 2023. The clustering process resulted in four clusters in 2021 and 2022 and two clusters in 2023. The clustering results revealed four distinct groups in 2021 and 2022, representing high, moderate, fluctuating, and low numeracy performance across regions. In 2023, these merged into two clusters, indicating reduced disparity and greater uniformity in students’ numeracy achievement across Java. Furthermore, it provides insights for educators and policymakers to design targeted interventions to improve students' numeracy skills and reduce learning gaps across regions.

Downloads

Download data is not yet available.

References

Aditya, A., Jovian, I., & Sari, B. N. (2020). Implementasi K-Means clustering ujian nasional sekolah menengah pertama di Indonesia tahun 2018/2019. Jurnal Media Informatika Budidarma, 4(1), 51-58. https://doi.org/10.30865/mib.v4i1.1784

Ahmed, M., Seraj, R., & Islam, S. M. S. (2020). The K-Means algorithm: A comprehensive survey and performance evaluation. Electronics, 9(8), 1295-1316. https://doi.org/10.3390/electronics9081295

Anderha, R. R., & Maskar, S. (2021). Pengaruh kemampuan numerasi dalam menyelesaikan masalah matematika terhadap prestasi belajar mahasiswa pendidikan matematika. Jurnal Ilmiah Matematika Realistik (JI-MR), 2(1), 1-10. https://doi.org/10.33365/ji-mr.v2i1.774

Ashari, I. F., Banjarnahor, R., Farida, D. R., Aisyah, S. P., Dewi, A. P., & Humaya, N. (2022). Application of data mining with the K-Means clustering method and davies bouldin index for grouping IMDB movies. Journal of Applied Informatics and Computing (JAIC), 6(1), 7-15. https://doi.org/10.30871/jaic.v6i1.3485

Faizah, N. M., Surohman, Fabrianto, L., Hendra, & Prasetyo, R. (2020). Unbalanced data clustering with K-Means and euclidean distance algorithm approach case study population and refugee data. Journal of Physics: Conference Series, 1477(2), Article 022005. https://doi.org/10.1088/1742-6596/1477/2/022005

Fajriyah, E. (2022). Kemampuan Literasi Numerasi Siswa pada Pembelajaran Matematika di Abad 21. Prosiding Seminar Nasional Pendidikan, 4, 403-409. https://prosiding.unma.ac.id/index.php/semnasfkip/article/view/824

Farissa, R. A., Mayasari, R., & Umaidah, Y. (2021). Perbandingan algoritma K-Means dan K-Medoids untuk pengelompokkan data obat dengan silhouette coefficient di Puskesmas Karangsambung. Journal of Applied Informatics and Computing (JAIC), 5(2), 109-116. https://doi.org/10.30871/jaic.v5i1.3237

Gustriansyah, R., Suhandi, N., & Antony, F. (2020). Clustering optimization in RFM analysis based on K-Means. Indonesian Journal of Electrical Engineering and Computer Science, 18(1), 470~477. http://doi.org/10.11591/ijeecs.v18.i1.pp470-477

Harahap, B., & Rambe, A. (2021). Implementasi K-Means clustering terhadap mahasiswa yang menerima beasiswa Yayasan Pendidikan Battuta di Universitas Battuta tahun 2020/2021 studi kasus Prodi Informatika. Informatika, 9(3), 90-97. https://doi.org/10.36987/informatika.v9i3.2185

Hassan, I. H., Abdullahi, M., Ahmad, B. I., & Ali, Y. S. (2021). Analysis of techniques for selecting appropriate number of clusters in K-means clustering algorithm. In International Conference on Computing and Advances in Information Technology (ICCAIT), (pp. 90-96). https://www.researchgate.net/profile/Ibrahim-Hayatu-Hassan/publication/369747243

Jollyta, D., Siddik, M., Johan, & Gustientiedina. (2023). Pengoptimalan pengukuran Bregman divergences menggunakan Davies Bouldin Index. JOISIE (Journal of Information Systems and Informatics Engineering), 7(1), 174-180. https://ejournal.pelitaindonesia.ac.id/ojs32/index.php/JOISIE/article/view/3178

Kementerian Pendidikan, Kebudayaan, Riset, dan Teknologi Republik Indonesia. (2021). Peraturan Menteri Pendidikan, Kebudayaan, Riset, dan Teknologi Republik Indonesia nomor 17 tahun 2021 tentang asesmen nasional. https://peraturan.bpk.go.id/Details/175175/permendikbud-no-17-tahun-2021

Maori, N. A. & Evanita. (2023). Metode Elbow dalam Optimasi Jumlah Cluster pada K-Means Clustering. SIMETRIS, 14(2), 277-287. https://doi.org/10.24176/simet.v14i2.9630

Mariamah, Suciyati, & Hendrawan. (2021). Kemampuan numerasi siswa sekolah dasar ditinjau dari jenis kelamin. TUNAS: Jurnal Penelitian Pendidikan Dasar, 1(2), 17-19. https://jurnal.fkip.unmul.ac.id/index.php/tunas/article/view/818

Muttaqin, Widiyanto, W. W., Munsarif, M., Mandias, G. F., Pungus, S. R., Widarman, A., Hapsari, W. K., Hardiyanti, S. A., Fatkhudin, A., Pasnur, Bisono, E. F., Anshori, M., Suryani, Saputra, N. (2023). Pengenalan data mining. Yayasan Kita Menulis.

Nugraha, A. S., & Rudhito, M. A. (2025). Unveiling numeracy competency domains of high school students in Indonesia: A clustering analysis approach. JMPM: Jurnal Matematika dan Pendidikan Matematika, 10(2), 161–179. https://doi.org/10.26594/jmpm.v10i2.5972

Nurfadillah, D., Aufa, F. N., & Rachman, I. F. (2024). Membangun kualitas pendidikan melalui kemampuan literasi dan numerisasi dalam implementasi Kurikulum Merdeka. ALFIHRIS: Jurnal Inspirasi Pendidikan, 2(3), 128-140. https://doi.org/10.59246/alfihris.v2i3.876

Purnamasari, E. N. (2024). Analisis pengaruh jumlah penduduk, jumlah pengangguran dan tingkat pendidikan terhadap pertumbuhan ekonomi di pulau Jawa. Jurnal Ilmu Ekonomi (JIE), 8(1), 123-133. https://doi.org/10.22219/jie.v8i01.31768

Putri, S. A., Komariyah, & Dianawanti, T. (2023). Pro: Ketimpangan pembangunan pendidikan di Jawa - luar Jawa masih tinggi. In K. S. Nugroho (Ed.), Pro kontra isu pembangunan (pp. 128-132). CV Agung Ilmu.

Rahayu, N. D., Anshor, A. H., & Afriantoro, I. (2024). Penerapan data mining untuk pemetaan siswa berprestasi menggunakan metode clustering K-Means. JUKI: Jurnal Komputer dan Informatika, 6(1), 71-83. https://doi.org/10.53842/juki.v6i1.474

Rosida, W., & Wijaya, Y. A. (2023). Klasterisasi penyakit HIV/AIDS di Jawa Barat menggunakan algoritma K-Means clustering. Blend Sains Jurnal Teknik, 1(4), 306-315. https://doi.org/10.56211/blendsains.v1i4.235

Setyaningtyas, S., Nugroho, B. I., & Arif, Z. (2022). Tinjauan pustaka sistematis pada data mining: Studi kasus algoritma K-Means clustering. Jurnal Teknoif Teknik Informatika Institut Teknologi Padang, 10(2), 52-61. https://doi.org/10.21063/jtif.2022.V10.2.52-61

Stiadi, M. J., & Sundani, D. (2021). Performance measurement of K-Means clustering algorithm using Davies Bouldin Index method. International Research Journal of Advanced Engineering and Science, 6(4), 72-75. https://irjaes.com/wp-content/uploads/2021/10/IRJAES-V6N4P76Y21.pdf

Sukaryo, A. F., & Sari, R. M. M. (2024). Systematic literature review: Kemampuan numerasi siswa dalam pembelajaran matematika. Jurnal Theorems (The Original Reasearch of Mathematics), 8(2), 461-473. https://doi.org/10.31949/th.v8i2.8212

Suputra, I. W. A., Candiasa, I. M., & Suryawan, I. P. P. (2021). Klasterisasi hasil ujian nasional SMA/MA dengan algoritma K-Means. Wahana Matematika dan Sains: Jurnal Matematika, Sains, dan, Pembelajarannya, 15(1), 22-30. https://ejournal.undiksha.ac.id/index.php/JPM/article/view/25380

Downloads

Published

26-11-2025

How to Cite

Putri, D. W. M., & Nugraha, A. S. (2025). Clustering senior high school students’ numeracy score in Java using the K-means algorithm. Proceedings of International Conference on Research in Education, 1(1), 277-292. https://doi.org/10.24071/icre.v1i1.27

Similar Articles

You may also start an advanced similarity search for this article.