The Clustering YouTube Videos of SMK Negeri 1 Percut Sei Tuan Based on Views and Likes Using the K-Means Algorithm

Authors

  • Nathania Asyifaa Magister Teknologi Informasi, Universitas Pembangunan Panca Budi, Jl. Gatot Subroto, Kec. Medan Sunggal, Kota Medan, Sumatera Utara 20122, Indonesia
  • Muhammad Iqbal Magister Teknologi Informasi, Universitas Pembangunan Panca Budi, Jl. Gatot Subroto, Kec. Medan Sunggal, Kota Medan, Sumatera Utara 20122, Indonesia

DOI:

https://doi.org/10.65126/jocosir.v2i2.59

Keywords:

YouTube analytics, K-Means clustering, audience engagement, educational content, data mining

Abstract

The increasing use of YouTube as a digital learning and promotional platform has encouraged educational institutions to optimize their content strategies to enhance audience engagement. This study aims to analyze and categorize YouTube videos from SMK N 1 Percut Sei Tuan based on views and likes using the K-Means clustering algorithm. A total of 50 videos were collected and preprocessed using normalization techniques to ensure consistent data scaling. The optimal number of clusters was determined using the Elbow Method, resulting in three distinct engagement groups: high, medium, and low. The clustering process was implemented using Python with the support of the pandas, numpy, scikit-learn, and matplotlib libraries. The results show that videos categorized under high engagement typically consist of school achievements and major institutional events, while low-engagement videos are related to administrative or routine activities with limited public appeal. The clustering outcomes provide valuable insights into audience preferences, allowing educational institutions to improve future content strategies by focusing on video types that generate higher engagement. This research demonstrates that the K-Means algorithm is effective in identifying content patterns and can be used as a decision-support tool for optimizing YouTube channel growth in the educational sector.

References

E. K. Ratnasari, R. V. H. Ginardi dan C. Fatichah, “Pengenalan penyakit noda pada citra daun tebu berdasarkan ciri tekstur fractal dimension co-occurrence matrix dan L*a*b* color moments,” JUTI, vol. 12, no. 2, p. 27– 36, 2014.

J. Liu, Z. Chang, C. K. S. Leun, R. C. W. Wong, Y. Xu and R. Zha, "Efficient mining of extraordinary patterns by pruning and predicting," Expert Systems with Applications, vol. 125, no. July, pp. 55-68, 2019.

M. Masinde and k. Mkhonto, "The Critical Success Factors for e-Government Implementation in South Africa’s Local government: Factoring in Apartheid Digital Divide," in 2019 IEEE 2nd International Conference on Information and Computer Technologies (ICICT), Kahului, HI, USA, 2019.

J. R. Varma, "Blockchain in Finance," Vikalpa: The Journal for Decision Makers, vol. 44, no. 1, pp. 1-11, 2019.

A. Orsdemir, G. Tilki and F. Altinay, "Evaluation by Teachers of “Use of Influence in Agile Management” by School Administration," International Journal of Disability, Development and Education, pp. 1-13, 2019.

Zhang, J., & Liu, P. “Analyzing YouTube Video Popularity Metrics for Educational Content,” IEEE Access, vol. 8, pp. 120945–120953, 2021.

Khan, M. L. “Social Media Engagement: What Motivates Users to Interact with Educational Content,” Journal of Information Systems Education, vol. 32, no. 1, pp. 45–56, 2021.

Sun, Y., et al. “Clustering Techniques for Big Data Analysis: A Review,” ACM Computing Surveys, vol. 54, no. 3, pp. 1–36, 2022.

Li, X., & Wang, H. “Application of K-Means Algorithm in Social Media Content Classification,” Procedia Computer Science, vol. 199, pp. 456–463, 2022.

Rahman, A. “Educational Video Analytics Using Machine Learning Techniques,” International Journal of Emerging Technologies in Learning (iJET), vol. 17, no. 4, pp. 112–124, 2023.

Jagtap, P., & Singh, R. “An Efficient Data Preprocessing Technique for Social Media Analytics,” International Journal of Data Science, 2021.

MacQueen, J. “Some Methods for Classification and Analysis of Multivariate Observations,” Berkeley Symposium, 1967.

Kodinariya, T. M., & Makwana, P. R. “Review on Determining Number of Cluster in K-Means Clustering,” IJARCSMS, 2013.

Gandomi, A., & Haider, M. “Beyond the Hype: Big Data Concepts, Methods, and Analytics,” International Journal of Information Management, 2015.

Zhang, J., & Liu, P. “Analyzing YouTube Video Popularity Metrics for Educational Content,” IEEE Access, 2021.

Sun, Y., et al. “Clustering Techniques for Big Data Analysis: A Review,” ACM Computing Surveys, 2022.

Jagtap, P., & Singh, R. “An Efficient Data Preprocessing Technique for Social Media Analytics,” International Journal of Data Science, 2021.

MacQueen, J. “Some Methods for Classification and Analysis of Multivariate Observations,” Berkeley Symposium, 1967.

Kodinariya, T. M., & Makwana, P. R. “Review on Determining Number of Cluster in K-Means Clustering,” IJARCSMS, 2013.

Gandomi, A., & Haider, M. “Beyond the Hype: Big Data Concepts, Methods, and Analytics,” International Journal of Information Management, 2015.

Zhang, J., & Liu, P. “Analyzing YouTube Video Popularity Metrics for Educational Content,” IEEE Access, 2021.

Sun, Y., et al. “Clustering Techniques for Big Data Analysis: A Review,” ACM Computing Surveys, 2022.

Figueiredo, F., et al. “Social Media Engagement Analysis Using K-Means Clustering,” Journal of Educational Technology & Society, 2020.

Chen, H., et al. “Big Data Analytics for Social Media: A Review of Methods and Tools,” Information Processing & Management, 2019.

Downloads

Published

2025-10-31

How to Cite

Asyifaa, N., & Iqbal , M. (2025). The Clustering YouTube Videos of SMK Negeri 1 Percut Sei Tuan Based on Views and Likes Using the K-Means Algorithm. Journal of Computer Science and Research (JoCoSiR), 2(2), 16–20. https://doi.org/10.65126/jocosir.v2i2.59