Smart Journal Finder: A Web-Based Scientific Article Categorization Using Jaccard Similarity
Rodiah Rodiah(1*);
(1) Universitas Gunadarma
(*) Corresponding Author
AbstractThe rapid growth of scientific publications presents challenges for researchers in identifying appropriate journals for manuscript submission. With an overwhelming number of journals across diverse disciplines, manually matching a manuscript to a suitable journal becomes inefficient and prone to misclassification. This study proposes the Smart Journal Finder, a web-based system designed to recommend relevant scientific journals by analyzing textual similarities between user-submitted manuscripts and indexed journal articles. The system processes input data including the title, abstract, keywords, and field of study through several stages: preprocessing, stop word removal, stemming using the Nazief-Adriani algorithm, and duplicate term elimination. Similarity scoring is performed using the Jaccard Similarity algorithm, followed by ranking the results and displaying journal metadata such as subject, publisher, and citation metrics. Results show that the system accurately transforms and filters input text, effectively calculates similarity scores, and successfully matches manuscripts to appropriate journals. By automating this process, the Smart Journal Finder enhances the efficiency of journal selection, improves the relevance of publication targets, and supports researchers in increasing the visibility and impact of their work. However, the current implementation is limited to Indonesian-language journals and does not yet incorporate semantic similarity or multilingual processing. Future work will focus on expanding coverage across disciplines and integrating more advanced similarity models. KeywordsDocument Classification; Jaccard Similarity; Journal; Recommendation System; Publication;
|
Full Text:PDF |
Article MetricsAbstract view: 100 timesPDF view: 43 times |
Digital Object Identifier https://doi.org/10.33096/ilkom.v18i1.2814.17-29
|
Cite |
References
M. Pagliaro, “Publishing scientific articles in the digital era,” Open Science Journal, vol. 5, no. 3, pp. 1–12, 2020, doi: 10.23954/osj.v5i3.2617.
J. Paul, “Publishing in premier journals with high impact factor and Q1 journals: Dos and Don’ts,” International Journal of Consumer Studies, vol. 48, no. 3, pp. 1–9, 2024, doi: 10.1111/ijcs.13049.
W. Bouaguel, N. Benyounes, C. Eddine, and B. Ncir, “Recommender systems and their effectiveness,” International Journal of Advanced and Applied Sciences, vol. 11, no. 5, pp. 217–229, 2024.
N. Kang, M. Doornenbal, and B. Schijvenaars, “Elsevier Journal Finder: Recommending journals for your paper,” in Proceedings of the 9th ACM Conference on Recommender Systems (RecSys 2015), pp. 261–264, 2015, doi: 10.1145/2792838.2799663.
F. Malo and A. Al-Zebari, “Intelligent semantic search for academic journals using AI and NLP techniques,” International Journal of Intelligent Systems and Applications, vol. 10, pp. 404–420, 2025.
A. M. Vieriu and G. Petrea, “The impact of Artificial Intelligence (AI) on students’ academic development,” Education Sciences, vol. 15, no. 3, pp. 1–12, 2025, doi: 10.3390/educsci15030343.
K. Madatov and S. Sattarova, “Using the Jaccard similarity method for recommendation system of books,” Society and Innovation Journal, vol. 5, no. 1, pp. 59–69, 2024, doi: 10.47689/2181-1415-vol5-iss1-pp59-69.
S. T. Gupta and U. Thakar, “A comprehensive survey on techniques for numerical similarity measurement,” Expert Systems with Applications, vol. 277, 2025, doi: 10.1016/j.eswa.2025.127235.
S. Bag, S. K. Kumar, and M. K. Tiwari, “An efficient recommendation generation using relevant Jaccard similarity,” Information Sciences, vol. 483, pp. 53–64, 2019, doi: 10.1016/j.ins.2019.01.023.
M. Blazevic, L. B. Sina, C. A. Secco, and K. Nazemi, “Recommendation of scientific publications—A real-time text analysis and publication recommendation system,” Electronics, vol. 12, no. 7, 2023, doi: 10.3390/electronics12071699.
O. Ismael, “The standard deviation score: A novel similarity metric for data analysis,” Journal of Big Data, vol. 12, no. 1, 2025, doi: 10.1186/s40537-025-01091-z.
M. Ostendorff, T. Blume, T. Ruas, B. Gipp, and G. Rehm, “Specialized document embeddings for aspect-based similarity of research papers,” Proceedings of the ACM Web Conference, vol. 1, no. 1, 2022, doi: 10.1145/3529372.3530912.
M. Hasan, A. T. Islam, and N. Islam, “Utilizing collaborative filtering in a personalized research-paper recommendation system,” arXiv preprint, 2024. [Online]. Available: http://arxiv.org/abs/2409.19267
T. P. Rinjeni, A. Indriawan, and N. A. Rakhmawati, “Matching scientific article titles using cosine similarity and Jaccard similarity algorithm,” Procedia Computer Science, vol. 234, pp. 553–560, 2024, doi: 10.1016/j.procs.2024.03.039.
K. K. Jena et al., “E-learning course recommender system using collaborative filtering models,” Electronics, vol. 12, no. 1, 2023, doi: 10.3390/electronics12010157.
S. Zhang, K. Liu, Z. Yu, B. Feng, and Z. Ou, “Hybrid recommendation system combining collaborative filtering and content-based recommendation with keyword extraction,” Applied Computing and Engineering, vol. 2, no. 1, pp. 927–939, 2023, doi: 10.54254/2755-2721/2/20220579.
M. A. A. Y. Afoudi and M. Lazaar, “Hybrid recommendation system combining content-based filtering and collaborative prediction using artificial neural network,” Simulation Modelling Practice and Theory, vol. 113, no. 102375, 2021, doi: 10.1016/j.simpat.2021.102375.
R. Riyanto, “Implementation of the Jaccard similarity algorithm on answer type description,” International Journal of Informatics and Information Systems (IJIIS), vol. 5, no. 2, pp. 76–83, 2022, doi: 10.47738/ijiis.v5i2.130.
A. Rahmatulloh, N. I. Kurniati, I. Darmawan, A. Z. Asyikin, and D. Witarsyah, “Comparison of the effects of stemmer Porter and Nazief-Adriani on the performance of Winnowing algorithms for measuring plagiarism,” Journal of Digital Information Management, vol. 18, no. 2, pp. 49–56, 2020, doi: 10.6025/jdim/2020/18/2/49-56.
A. R. Munappy, J. Bosch, H. H. Olsson, A. Arpteg, and B. Brinne, “Data management for production quality deep learning models: Challenges and solutions,” Journal of Systems and Software, vol. 191, p. 111359, 2022, doi: 10.1016/j.jss.2022.111359.
Huawei Technologies Co., Ltd., Database Design Fundamentals. Singapore: Springer, 2023, doi: 10.1007/978-981-19-3032-4_7.
S. M. Abdullah, S. M. Ali, and M. A. Makttof, “Modifying Jaccard coefficient for text similarity,” Nuevos Sistemas Especiales Journal, vol. 35, pp. 2899–2921, 2019.
M. Siino, I. Tinnirello, and M. La Cascia, “Is text preprocessing still worth the time? A comparative survey on the influence of popular preprocessing methods on Transformers and traditional classifiers,” Information Systems, vol. 121, p. 102342, 2024, doi: 10.1016/j.is.2023.102342.
Z. Abidin, A. Junaidi, and Wamiliana, “Text stemming and lemmatization of regional languages in Indonesia: A systematic literature review,” Journal of Information Systems Engineering and Business Intelligence, vol. 10, no. 2, pp. 217–231, 2024, doi: 10.20473/jisebi.10.2.217-231.
R. Pramana, Debora, J. J. Subroto, A. A. S. Gunawan, and Anderies, “Systematic literature review of stemming and lemmatization performance for sentence similarity,” in Proceedings of the 7th IEEE International Conference on Information Technology and Digital Applications (ICITDA 2022), 2022, doi: 10.1109/ICITDA55840.2022.9971451.
A. Rahmatulloh, N. I. Kurniati, I. Darmawan, A. Z. Asyikin, and J. D. Witarsyah, “Comparison between the stemmer Porter effect and Nazief-Adriani on the performance of Winnowing algorithms for measuring plagiarism,” International Journal of Advanced Science, Engineering and Information Technology, vol. 9, no. 4, pp. 1124–1128, 2019, doi: 10.18517/ijaseit.9.4.8844.
F. Rumaisa, “Evaluation of Indonesian language stemmer algorithms: A comparative analysis,” Brilliance – Journal of Artificial Intelligence, vol. 5, no. 1, pp. 21–25, 2025.
D. Mustikasari, I. Widaningrum, R. Arifin, and W. H. E. Putri, “Comparison of effectiveness of stemming algorithms in Indonesian documents,” in Proceedings of the 2nd Borobudur International Symposium on Science and Technology (BIS-STE 2020), vol. 203, pp. 154–158, 2021, doi: 10.2991/aer.k.210810.025.
L. Zahrotun, “Comparison of Jaccard similarity, cosine similarity and the combination of both for data clustering using shared nearest neighbor method,” Computer Engineering and Applications Journal, vol. 5, no. 1, pp. 11–18, 2016, doi: 10.18495/comengapp.v5i1.160.
M. H. Ahmed, S. Tiun, N. Omar, and N. S. Sani, “Short text clustering algorithms, application and challenges: A survey,” Applied Sciences, vol. 13, no. 1, 2023, doi: 10.3390/app13010342.
Refbacks
- There are currently no refbacks.
Copyright (c) 2026 Rodiah

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.






