Comparison of K-Means and K-Medoids Algorithms for Clustering Poverty Data in South Sumatra Using DBI Evaluation
DOI:
https://doi.org/10.31849/digitalzone.v15i2.23624Keywords:
K-Means, K-Medoids, Poverty Data, Clustering, Davies Bouldin Index (DBI)Abstract
This research focuses on the implementation and comparison of the K-Means and K-Medoids algorithms that function as poverty data clustering in South Sumatra Province, the poverty data is taken from the Central Statistics Agency of Indonesia (BPS Indonesia). This research also aims to analyze the poverty level in South Sumatra Province by including additional variables such as average years of schooling and per capita expenditure in the community in each regency or city in South Sumatra Province. Data clustering is done by both algorithms and then the performance value is Evaluated using Davies Bouldin Index DBI shows that K-Means gives better results, with a lower DBI value (0.204 at K=5) while K-Medoids has a DBI value of 0.239 at K=5, which indicates more compact and separated clusters. The superiority of K-Means is due to the homogeneous and minimal outlier characteristics of the dataset, which makes the centroid approach more optimal than medoids in K-Medoids. With these results, K-Means was chosen as the best algorithm for clustering poverty data in the region. The use of the K-Means algorithm produces a pattern in clusters related to education, economic inequality, and poverty distribution in various regions in South Sumatra. This implementation provides insight into how data clustering techniques can be applied to socio-economic data to provide policy makers in a region with information about the region, especially information about poverty-stricken areas.
References
N. Nursini, “Micro, small, and medium enterprises (MSMEs) and poverty reduction: empirical evidence from Indonesia,” Development Studies Research, vol. 7, no. 1, pp. 153–166, Jan. 2020, https://doi.org/10.1080/21665095.2020.1823238.
L. Sugiharti, M. A. Esquivias, M. S. Shaari, A. D. Jayanti, and A. R. Ridzuan, “Indonesia’s poverty puzzle: Chronic vs. transient poverty dynamics,” Cogent Economics and Finance, vol. 11, no. 2, 2023, https://doi.org/10.1080/23322039.2023.2267927.
W. Liu, J. Li, and R. Zhao, “The effects of rural education on poverty in China: a spatial econometric perspective,” J Asia Pac Econ, vol. 28, no. 1, pp. 176–198, Jan. 2023, https://doi.org/10.1080/13547860.2021.1877240.
H. Aggarwal, V. Kumar, and H. D. Arora, “Data mining algorithm based on Renyi fuzzy association rule: an application for the selection of suitable course,” Research in Statistics, vol. 1, no. 1, Oct. 2023, https://doi.org/10.1080/27684520.2023.2271902.
H. S. Kim, S. K. Kim, and L. S. Kang, “BIM performance assessment system using a K-means clustering algorithm,” Journal of Asian Architecture and Building Engineering, vol. 20, no. 1, pp. 78–87, 2021, https://doi.org/10.1080/13467581.2020.1800471.
P. Wang, H. Shi, X. Yang, and J. Mi, “Three-way k-means: integrating k-means and three-way decision,” International Journal of Machine Learning and Cybernetics, vol. 10, no. 10, pp. 2767–2777, 2019 https://doi.org/10.1007/s13042-018-0901-y.
A. Sobrinho Campolina Martins, L. Ramos de Araujo, and D. Rosana Ribeiro Penido, “K-Medoids clustering applications for high-dimensionality multiphase probabilistic power flow,” International Journal of Electrical Power and Energy Systems, vol. 157, Jun. 2024, https://doi.org/10.1016/j.ijepes.2024.109861.
N. H. M. M. Shrifan, M. F. Akbar, and N. A. M. Isa, “An adaptive outlier removal aided k-means clustering algorithm,” Journal of King Saud University - Computer and Information Sciences, vol. 34, no. 8, pp. 6365–6376, Sep. 2022, https://doi.org/10.1016/j.jksuci.2021.07.003.
S. Mousavi, F. Zamani Boroujeni, and S. Aryanmehr, “Improving Customer Clustering By Optimal Selection Of Cluster Centroids In K-Means And K-Medoids Algorithms,” J Theor Appl Inf Technol, vol. 98, no. 18, 2020, [Online]. Available: www.jatit.org
N. H. M. M. Shrifan, M. F. Akbar, and N. A. M. Isa, “An adaptive outlier removal aided k-means clustering algorithm,” Journal of King Saud University - Computer and Information Sciences, vol. 34, no. 8, pp. 6365–6376, Sep. 2022, https://doi.org/10.1016/j.jksuci.2021.07.003.
E. D. Madyatmadja, D. J. M. Sembiring, S. M. Br Perangin Angin, D. Ferdy, and J. F. Andry, “Big Data in Educational Institutions using RapidMiner to Predict Learning Effectiveness,” Journal of Computer Science, vol. 17, no. 4, pp. 403–413, 2021, https://doi.org/ 10.3844/jcssp.2021.403.413.
J. Heidari, N. Daneshpour, and A. Zangeneh, “A novel K-means and K-medoids algorithms for clustering non-spherical-shape clusters non-sensitive to outliers,” Pattern Recognit, vol. 155, Nov. 2024, https://doi.org/10.1016/j.patcog.2024.110639.
S. Ghaida Muthmainah and A. Id Hadiana, “Comparative Analysis of K-Means and K-Medoids Clustering in Retail Store Product Grouping,” International Journal of Quantitative Research and Modeling, vol. 5, no. 3, pp. 280–294, 2024.
M. P. Repollo, R. Aurelius, and C. Robielos, “Applying Clustering Algorithm on Poverty Analysis in a Community in the Philippines.”
P. I. Dalatu and Midi, “MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES New Approaches to Normalization Techniques to Enhance K-Means Clustering Algorithm,” 2020.
S. Sinsomboonthong, “Performance Comparison of New Adjusted Min-Max with Decimal Scaling and Statistical Column Normalization Methods for Artificial Neural Network Classification,” Int J Math Math Sci, vol. 2022, 2022, https://doi.org/10.1155/2022/3584406.
Y. A. Wijaya, D. A. Kurniady, E. Setyanto, W. S. Tarihoran, D. Rusmana, and R. Rahim, “Davies Bouldin Index Algorithm for Optimizing Clustering Case Studies Mapping School Facilities,” TEM Journal, vol. 10, no. 3, pp. 1099–1103, Aug. 2021, https://doi.org/10.18421/TEM103-13.
M. Li, D. Xu, D. Zhang, and J. Zou, “The seeding algorithms for spherical k-means clustering,” Journal of Global Optimization, vol. 76, no. 4, pp. 695–708, 2020, https:doi.org/10.1007/s10898-019-00779-w.
Z. Feng, W. Niu, R. Zhang, S. Wang, and C. Cheng, “Operation rule derivation of hydropower reservoir by k-means clustering method and extreme learning machine based on particle swarm optimization,” J Hydrol (Amst), vol. 576, pp. 229–238, 2019, https://doi.org/10.1016/j.jhydrol.2019.06.045.






