A Hybrid Movie Recommendation System to Address Data Sparsity Using Genre-Based K-Means and Neural Collaborative Filtering
Herdianti Darwis(1); Firdaus Abrazawaiz Syahrir(2*); Lilis Nur Hayati(3);
(1) Universitas Muslim Indonesia
(2) Universitas Muslim Indonesia
(3) Universitas Muslim Indonesia
(*) Corresponding Author
AbstractRecommendation systems play a crucial role in helping users navigate the overwhelming volume of information on digital platforms. However, conventional Collaborative Filtering (CF) methods often suffer from data sparsity, leading to reduced prediction accuracy and limited recommendation diversity. To address this challenge, this study proposes a hybrid recommendation model that integrates K-Means clustering based on genre, release year, and rating statistics into the Neural Collaborative Filtering (NCF) framework. Unlike previous works that rely on a single dimension like genre or demographics for clustering, our model uniquely combines multiple content-based features. Furthermore, we explicitly integrate the cluster labels as additional embedding features within the NCF framework, enabling more nuanced and context-aware representation learning. Using the MovieLens Latest-Small dataset, our hybrid model significantly outperforms the baseline NCF across all metrics, achieving a Mean Absolute Error (MAE) of 0.6097, a Root Mean Square Error (RMSE) of 0.7946, and improvements in Precision@10 (0.6065) and Recall@10 (0.7063). These findings highlight the effectiveness of our novel, content-aware clustering approach in deep learning recommenders, resulting in more accurate, diverse, and contextually relevant movie suggestions. KeywordsCollaborative Filtering, Data Sparsity, K-Means, Neural Collaborative Filtering, Recommendation System
|
Full Text:PDF |
Article MetricsAbstract view: 83 timesPDF view: 27 times |
Digital Object Identifier![]() |
Cite |
References
H. Zhou, F. Xiong, and H. Chen, “A Comprehensive Survey of Recommender Systems Based on Deep Learning,” Oct. 01, 2023, Multidisciplinary Digital Publishing Institute (MDPI). doi: 10.3390/app132011378.
J. Joy, N. S. Raj, and V. G. Renumol, “Ontology-based E-learning Content Recommender System for Addressing the Pure Cold-start Problem,” Journal of Data and Information Quality, vol. 13, no. 3, 2021, doi: 10.1145/3429251.
X. Zhang et al., “A Survey on Side Information-driven Session-based Recommendation: From a Data-centric Perspective,” May 2025, doi: 10.48550/arXiv.2505.12279.
X. Zhang et al., “Dynamic intent-aware iterative denoising network for session-based recommendation,” Inf Process Manag, vol. 59, no. 3, May 2022, doi: 10.1016/j.ipm.2022.102936.
L. Li, Z. Zhang, and S. Zhang, “Hybrid Algorithm Based on Content and Collaborative Filtering in Recommendation System Optimization and Simulation,” Sci Program, vol. 2021, 2021, doi: 10.1155/2021/7427409.
Purnawansyah, A. Adnan, H. Darwis, A. P. Wibawa, T. Widyaningtyas, and Haviluddin, “Ensemble semi-supervised learning in facial expression recognition,” International Journal of Advances in Intelligent Informatics, vol. 11, no. 1, pp. 1–24, Feb. 2025, doi: 10.26555/ijain.v11i1.1880.
L. Wu, X. He, X. Wang, K. Zhang, and M. Wang, “A Survey on Accuracy-oriented Neural Recommendation: From Collaborative Filtering to Information-rich Recommendation,” Dec. 2021, doi: 10.1109/TKDE.2022.3145690.
A. Mustaqeem, S. M. Anwar, and M. Majid, “A modular cluster based collaborative recommender system for cardiac patients,” Artif Intell Med, vol. 102, Jan. 2020, doi: 10.1016/j.artmed.2019.101761.
L. Chen, Y. Luo, X. Liu, W. Wang, and M. Ni, “Improved collaborative filtering recommendation algorithm based on user attributes and K-means clustering algorithm,” in Journal of Physics: Conference Series, IOP Publishing Ltd, May 2021. doi: 10.1088/1742-6596/1903/1/012036.
A. Althbiti, R. Alshamrani, T. Alghamdi, S. Lee, and X. Ma, “Addressing Data Sparsity in Collaborative Filtering Based Recommender Systems Using Clustering and Artificial Neural Network,” in 2021 IEEE 11th Annual Computing and Communication Workshop and Conference, CCWC 2021, Institute of Electrical and Electronics Engineers Inc., Jan. 2021, pp. 218–227. doi: 10.1109/CCWC51732.2021.9376008.
Purnawansyah, A. P. Wibawa, T. Widiyaningtyas, Haviluddin, H. Darwis, and H. Azis, “An in-depth exploration of supervised and semi-supervised learning on face recognition,” Open Computer Science, vol. 15, no. 1, Jan. 2025, doi: 10.1515/comp-2025-0029.
S. Siet, S. Peng, S. Ilkhomjon, M. Kang, and D. S. Park, “Enhancing Sequence Movie Recommendation System Using Deep Learning and KMeans,” Applied Sciences (Switzerland), vol. 14, no. 6, Mar. 2024, doi: 10.3390/app14062505.
H. Azis, Nirmala, L. Syafie, Herman, F. Fattah, and T. Hasanuddin, “Unveiling Algorithm Classification Excellence: Exploring Calendula and Coreopsis Flower Datasets with Varied Segmentation Techniques,” in 2024 18th International Conference on Ubiquitous Information Management and Communication (IMCOM), 2024, pp. 1–7. doi: 10.1109/IMCOM60618.2024.10418246.
M. Balfaqih, “A Hybrid Movies Recommendation System Based on Demographics and Facial Expression Analysis using Machine Learning.” doi: 10.14569/IJACSA.2023.0141177.
N. K. Ayyiyah, R. Kusumaningrum, and R. Rismiyati, “Film Recommender System Menggunakan Metode Neural Collaborative Filtering,” Jurnal Teknologi Informasi dan Ilmu Komputer, vol. 10, no. 3, pp. 699–708, Jul. 2023, doi: 10.25126/jtiik.2023106616.
X. Zhang, B. Xu, Z. Ren, X. Wang, H. Lin, and F. Ma, “Disentangling ID and Modality Effects for Session-based Recommendation,” in SIGIR 2024 - Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, Association for Computing Machinery, Inc, Jul. 2024, pp. 1883–1892. doi: 10.1145/3626772.3657748.
X. Zhang, B. Xu, F. Ma, C. Li, L. Yang, and H. Lin, “Beyond Co-occurrence: Multi-modal Session-based Recommendation,” Sep. 2023, doi: 10.48550/arXiv.2309.17037.
P. A. Sedyo Mukti and Z. K. A. Baizal, “Enhancing Neural Collaborative Filtering with Metadata for Book Recommender System,” IJCCS (Indonesian Journal of Computing and Cybernetics Systems), vol. 19, no. 1, p. 61, Jan. 2025, doi: 10.22146/ijccs.103611.
X. Zhang et al., “Price DOES Matter!: Modeling Price and Interest Preferences in Session-based Recommendation,” in SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Association for Computing Machinery, Inc, Jul. 2022, pp. 1684–1693. doi: 10.1145/3477495.3532043.
I. Anggriawan and W. Gunawan, “Implementation of Data Mining Using K-Means Algorithm for Bicycle Sales Prediction,” ILKOM Jurnal Ilmiah, vol. 14, no. 3, pp. 284–293, Dec. 2022, doi: 10.33096/ilkom.v14i3.1291.284-293.
Z. Y. Taha and S. A. Aminifar, “High Accurate Multicriteria Cluster-Based Collaborative Filtering Recommender System,” Journal of Computer Science, vol. 18, no. 12, pp. 1189–1200, 2022, doi: 10.3844/jcssp.2022.1189.1200.
B. N. Yulisasih, H. Herman, S. Sunardi, and H. Yuliansyah, “Evaluation of K-Means Clustering Using Silhouette Score Method on Customer Segmentation,” ILKOM Jurnal Ilmiah, vol. 16, no. 3, pp. 330–342, Dec. 2024, doi: 10.33096/ilkom.v16i3.2325.330-342.
Purnawansyah et al., “Congestion Predictive Modelling on Network Dataset Using Ensemble Deep Learning,” Journal of Applied Data Sciences, vol. 5, no. 4, pp. 1597–1613, Dec. 2024, doi: 10.47738/jads.v5i4.333.
M. Ibrahim, I. S. Bajwa, N. Sarwar, F. Hajjej, and H. A. Sakr, “An Intelligent Hybrid Neural Collaborative Filtering Approach for True Recommendations,” IEEE Access, vol. 11, pp. 64831–64849, 2023, doi: 10.1109/ACCESS.2023.3289751.
A. R. Manga, Nirmala, H. Azis, F. Fattah, Y. Salim, and H. Darwis, “ResNet-50 for Flower Image Classification: A Comparative Study of Segmentation and Non-Segmentation Approaches,” in 2025 19th International Conference on Ubiquitous Information Management and Communication (IMCOM), 2025, pp. 1–6. doi: 10.1109/IMCOM64595.2025.10857520.
H. Darwis, R. Puspitasari, Purnawansyah, W. Astuti, D. Atmajaya, and M. Hasnawi, “A Deep Learning Approach for Improving Waste Classification Accuracy with ResNet50 Feature Extraction,” in 2025 19th International Conference on Ubiquitous Information Management and Communication (IMCOM), 2025, pp. 1–8. doi: 10.1109/IMCOM64595.2025.10857536.
X. Zhang, B. Xu, Y. Wu, Y. Zhong, H. Lin, and F. Ma, “FineRec: Exploring Fine-grained Sequential Recommendation,” in SIGIR 2024 - Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, Association for Computing Machinery, Inc, Jul. 2024, pp. 1599–1608. doi: 10.1145/3626772.3657761.
S. Rendle, W. Krichene, L. Zhang, and J. Anderson, “Neural Collaborative Filtering vs. Matrix Factorization Revisited,” in RecSys 2020 - 14th ACM Conference on Recommender Systems, Association for Computing Machinery, Inc, Sep. 2020, pp. 240–248. doi: 10.1145/3383313.3412488.
Y. Guo and Z. Yan, “Recommended System: Attentive Neural Collaborative Filtering,” IEEE Access, vol. 8, pp. 125953–125960, 2020, doi: 10.1109/ACCESS.2020.3006141.
Y. Afoudi, M. Lazaar, and M. Al Achhab, “Hybrid recommendation system combined content-based filtering and collaborative prediction using artificial neural network,” Simul Model Pract Theory, vol. 113, Dec. 2021, doi: 10.1016/j.simpat.2021.102375.
Refbacks
- There are currently no refbacks.
Copyright (c) 2025 Herdianti Darwis, Firdaus Abrazawaiz Syahrir, Lilis Nur hayati

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.