Analisis Faktor Penentu Harga Mobil Bekas Menggunakan Model Random Forest Regressor serta Perbandingan Linear
Sabrina Khoirunnisa(1*); Nining Rahaninsih(2); Irfan Ali(3); Willy Prihartono(4);
(1) STMIK IKMI
(2) STMIK IKMI
(3) STMIK IKMI
(4) STMIK IKMI
(*) Corresponding Author
AbstractPenentuan harga mobil bekas merupakan proses yang kompleks karena dipengaruhi oleh berbagai faktor seperti jarak tempuh, usia kendaraan, merek, jenis bahan bakar, dan riwayat kerusakan. Metode konvensional sering kali menghasilkan penilaian yang subjektif dan kurang akurat, sehingga diperlukan pendekatan berbasis data untuk meningkatkan objektivitas dan konsistensi estimasi harga. Penelitian ini bertujuan untuk membangun model prediksi harga mobil bekas menggunakan algoritma Random Forest Regressor serta membandingkan performanya dengan Multiple Linear Regression sebagai baseline. Dataset yang digunakan berasal dari platform Kaggle dan mencakup 4.009 data kendaraan yang telah melalui proses data cleaning, rekayasa fitur, dan penghapusan outlier. Metode penelitian meliputi preprocessing data, pelatihan model, hyperparameter tuning, serta evaluasi menggunakan metrik R², MAE, MSE, dan RMSE. Hipotesis penelitian menyatakan bahwa Random Forest memiliki performa prediktif yang lebih baik dibandingkan model linier serta mampu mengidentifikasi fitur yang paling berpengaruh terhadap harga kendaraan. Hasil eksperimen menunjukkan bahwa Random Forest R² = 0.6827, lebih tinggi dibandingkan Multiple Linear Regression dengan R² = 0.5673. Analisis feature importance mengungkapkan bahwa mileage dan usia kendaraan merupakan faktor dominan dalam pembentukan harga. Dengan demikian, penelitian ini menyimpulkan bahwa Random Forest merupakan pendekatan yang lebih akurat dan stabil untuk prediksi harga mobil bekas serta berpotensi diimplementasikan dalam sistem valuasi otomotif berbasis data. Keywordsprediksi harga mobil; machine learning; random forest; regresi linier; feature importance
|
Full Text:PDF |
Article MetricsAbstract view: 223 timesPDF view: 149 times |
Digital Object Identifier https://doi.org/10.33096/linier.v2i4.3346
|
Cite |
References
Y. Chen, “Research on Machine Learning-based Prediction of Coffee Futures Prices,” 2024.
C. Li, “Machine Learning-Based Models for Accurate Car Prices Prediction,” 2024.
J. Gao, “Second-hand car price prediction based on multiple linear regression and random forest,” Theoretical and Natural Science, vol. 52, no. 1, pp. 31–40, 2024, doi: 10.54254/2753-8818/52/2024ch0105.
J. Umapathi and R. Latha, “Automated pricing predictions for pre-owned vehicles using random forest,” IJERST, vol. 2, no. 2, pp. 21–26, 2024, doi: 10.63458/ijerst.v2i2.80.
L. Bukvić, J. Pašagić Škrinjar, T. Fratrović, and B. Abramović, “Price prediction and classification of used-vehicles using supervised machine learning,” Sustainability, vol. 14, no. 24, p. 17034, 2022, doi: 10.3390/su142417034.
A. Alhakamy, A. Alhowaity, A. A. Alatawi, and H. Alsaadi, “Are used cars more sustainable? Price prediction based on linear regression,” Sustainability, vol. 15, no. 2, p. 911, 2023, doi: 10.3390/su15020911.
B. Cui, Z. Ye, H. Zhao, Z. Renqing, L. Meng, and Y. Yang, “Used car price prediction based on the iterative framework of XGBoost+LightGBM,” Electronics (Basel)., vol. 11, no. 18, p. 2932, 2022, doi: 10.3390/electronics11182932.
M. Aruna, M. Anjana, H. Chauhan, and R. Deepa, “Optimized hyperparameter tuned random forest regressor algorithm in predicting resale car value based on grid search method,” International Journal of Advanced Research in Science Communication and Technology, pp. 106–113, 2021, doi: 10.48175/ijarsct-1217.
E. Fitri, “Analisis Perbandingan Metode Regresi Linier, Random Forest Regression dan Gradient Boosted Trees Regression Method untuk Prediksi Harga Rumah,” JOURNAL OF APPLIED COMPUTER SCIENCE AND TECHNOLOGY (JACOST), vol. 4, no. 1, pp. 2723–1453, 2023, doi: 10.52158/jacost.491.
L. Breiman, “Random Forests,” 2001.
M. Gollapalli, T. Alqahtani, D. Alhamed, M. Alnassar, A. Alajmi, and Y. Alali, “Intelligent modelling techniques for predicting used cars prices in Saudi Arabia,” Mathematical Modelling and Engineering Problems, vol. 10, no. 1, pp. 139–148, 2023, doi: 10.18280/mmep.100115.
S. Bergmann and S. Feuerriegel, “Machine learning for predicting used car resale prices using granular vehicle equipment information,” Expert Syst. Appl., vol. 263, p. 125640, Mar. 2025, doi: 10.1016/j.eswa.2024.125640.
N. O. Idris and F. Pontoiyo, “Evaluasi Model Machine Learning untuk Prediksi Harga Mobil dengan Perbandingan Ensemble dan Regresi Linear,” Jurnal Ilmu Komputer dan Sistem Informasi, vol. 4, no. 1, pp. 129–143, Jan. 2025, doi: 10.70340/jirsi.v4i1.181.
Md. A. Kadir, B. Nassar, S. M. Jahangir Alam, S. Ahmed, and Md. K. Syfullah, “Car Price Prediction: A Comparative Study of Machine Learning and Deep Learning Approaches,” in 2025 International Conference on Electrical, Computer and Communication Engineering (ECCE), IEEE, Feb. 2025, pp. 1–6. doi: 10.1109/ECCE64574.2025.11013535.
H. Lahuddin, Muh. R. A. Muliawan, K. Takemoto, H. Darwis, S. R. Jabir, and R. Adawiyah, “Cryptocurrency Prices Forecasting Using LSTM, CNN, Transformer, TCN, and Hybrid Model: A Deep Learning Approach,” in 2025 9th International Conference On Electrical, Electronics And Information Engineering (ICEEIE), IEEE, Sep. 2025, pp. 1–6. doi: 10.1109/ICEEIE66203.2025.11252474.
A. Eklund, A. Berndt, and S. Sandberg, “Orchestrating an experiential value proposition: The case of a Scandinavian automotive manufacturer,” European Business Review, vol. 34, no. 5, pp. 624–641, 2022, doi: 10.1108/ebr-07-2021-0149.
Y. Danylenko, “Analysis of the used passenger car market in Ukraine,” Economics Finances Law, pp. 24–36, 2023, doi: 10.37634/efp.2023.9.6.
V. K. Gohil, “Car Price Prediction by Machine Learning Approach,” INTERNATIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT, vol. 09, no. 09, pp. 1–9, Sep. 2025, doi: 10.55041/IJSREM52586.
R. Cheng, P. Haste, E. Levens, and J. Bergmann, “Feature importance for estimating rating of perceived exertion from cardiorespiratory signals using machine learning,” Front. Sports Act. Living, vol. 6, 2024, doi: 10.3389/fspor.2024.1448243.
Y. Gerstorfer, M. Hahn-Klimroth, and L. Krieg, “A Notion of Feature Importance by Decorrelation and Detection of Trends by Random Forest Regression,” Regression. Data Science Journal, vol. 22, pp. 1–14, doi: 10.5334/dsj.
Refbacks
- There are currently no refbacks.
Copyright (c) 2026 Sabrina Khoirunnisa, Nining Rahaninsih, Irfan Ali, Willy Prihartono

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Linier: Literatur Informatika dan Komputer indexed by
ISSN 3063-2218
Published by Prodi Teknik Informatika FIK Universitas Muslim Indonesia
Website : https://jurnal.fikom.umi.ac.id/index.php/LINIER/
Email : linier@umi.ac.id
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0

