Studi Komparatif Performa Model Ensemble Learning dalam Klasifikasi Kepuasan Penumpang Maskapai Penerbangan

Fransiska Elly Renni Susanti; Heni Ermewaningsih; Donatus Leo; Kristian Tengker

doi:10.52436/1.jpti.1504

Authors

Fransiska Elly Renni Susanti Rekayasa Komputer, Institut Teknologi Keling Kumang
Heni Ermewaningsih Rekayasa Komputer, Institut Teknologi Keling Kumang
Donatus Leo Rekayasa Komputer, Institut Teknologi Keling Kumang
Kristian Tengker Rekayasa Komputer, Institut Teknologi Keling Kumang

DOI:

https://doi.org/10.52436/1.jpti.1504

Keywords:

ensemble learning, kepuasan penumpang, klasifikasi, maskapai penerbangan, XGBoost

Abstract

Penelitian ini bertujuan untuk mengevaluasi dan membandingkan performa beberapa model ensemble learning dalam klasifikasi kepuasan penumpang maskapai penerbangan menggunakan dataset publik dari Kaggle. Dataset terdiri dari 25.893 observasi dengan 23 variabel prediktor yang mencakup karakteristik pelanggan, kualitas pelayanan, dan pengalaman penerbangan. Tiga algoritma yang digunakan adalah Random Forest, Gradient Boosting, dan XGBoost. Proses pemodelan dilakukan menggunakan skema train-test split 80:20 serta validasi silang, dan evaluasi menggunakan metrik accuracy, precision, recall, F1-Score, dan ROC-AUC. Hasil penelitian menunjukkan bahwa XGBoost memberikan performa terbaik pada seluruh metrik evaluasi dengan akurasi sebesar 0,963 dan ROC-AUC sebesar 0,995. Analisis feature importance menunjukkan bahwa variabel online boarding, tipe perjalanan bisnis, serta kualitas layanan dalam penerbangan merupakan faktor dominan yang memengaruhi kepuasan penumpang. Kontribusi utama penelitian ini adalah penyajian evaluasi komparatif model ensemble learning yang terintegrasi dengan analisis interpretabilitas untuk mengidentifikasi determinan utama kepuasan pelanggan pada data tabular. Temuan ini memberikan implikasi bahwa peningkatan layanan digital dan kualitas pengalaman kabin menjadi prioritas strategis dalam meningkatkan kepuasan pelanggan maskapai.

Downloads

Download data is not yet available.

References

S. Chung, J. W. Park, and S. Lee, “The Influence of CSR on Airline Loyalty through the Mediations of Passenger Satisfaction, Airline Brand, and Airline Trust: Korean Market Focused,” Sustainability (Switzerland), vol. 14, no. 8, Apr. 2022, doi: 10.3390/su14084548.

J. Wang, J. Wu, S. Sun, and S. Wang, “The relationship between attribute performance and customer satisfaction: An interpretable machine learning approach,” Data Science and Management, vol. 7, no. 3, pp. 164–180, Sep. 2024, doi: 10.1016/j.dsm.2024.01.003.

S. Li, B. Zhu, Y. Zhang, F. Liu, and Z. Yu, “A Two-Stage Nonlinear User Satisfaction Decision Model Based on Online Review Mining: Considering Non-Compensatory and Compensatory Stages,” Journal of Theoretical and Applied Electronic Commerce Research, vol. 19, no. 1, pp. 272–296, Mar. 2024, doi: 10.3390/jtaer19010015.

A. I. Fadri, A. Zahfran, T. Irak, N. H. Firjatullah, and J. E. Herianto, “Comparison of Supervised Learning Algorithms for Predicting Airline Passenger Satisfaction,” IJATIS: Indonesian Journal of Applied Technology and Innovation Science, vol. 2, no. 1, pp. 42–52, Mar. 2025, doi: 10.57152/ijatis.v2i1.1868.

J. Suwanto et al., “Comparison of Classification Algorithm in Classifying Airline Passenger Satisfaction,” Jurnal Sistem Informasi dan Ilmu Komputer Prima, vol. 6, no. 1, 2022, doi: 10.34012/jurnalsisteminformasidanilmukomputer.v6i1.2848.

R. Murugesan, R. A P, N. N, and R. Balanathan, “Forecasting airline passengers’ satisfaction based on sentiments and ratings: An application of VADER and machine learning techniques,” J. Air Transp. Manag., vol. 120, Sep. 2024, doi: 10.1016/j.jairtraman.2024.102668.

B. Laksono, I. Kurniawati, A. B. Sriwiyanta, Z. Zen Zaenudin, J. A. Ramadha, and D. Alfian, “Integration of SMOTE and Ensemble Models for Predicting Airline Passenger Satisfaction,” Innovation in Research of Informatics (INNOVATICS), vol. 7, no. 1, pp. 77–85, 2025, doi: 10.37058/innovatics.v7i1.14001.

W. Zhu, R. Qiu, and Y. Fu, “Comparative Study on the Performance of Categorical Variable Encoders in Classification and Regression Tasks,” ArXiv, Jan. 2024, doi: 10.48550/arXiv.2401.09682.

F. Bolikulov, R. Nasimov, A. Rashidov, F. Akhmedov, and Y. I. Cho, “Effective Methods of Categorical Data Encoding for Artificial Intelligence Algorithms,” Mathematics, vol. 12, no. 16, Aug. 2024, doi: 10.3390/math12162553.

M. Sivakumar, S. Parthasarathy, and T. Padmapriya, “Trade-off between training and testing ratio in machine learning for medical image processing,” PeerJ Comput. Sci., vol. 10, 2024, doi: 10.7717/PEERJ-CS.2245.

Q. Dong, “Leakage Prediction in Machine Learning Models When Using Data from Sports Wearable Sensors,” Comput. Intell. Neurosci., vol. 2022, p. 9, 2022, doi: 10.1155/2022/5314671.

C. Yang, R. A. Brower-Sinning, G. Lewis, and C. Kästner, “Data Leakage in Notebooks: Static Detection and Better Processes,” in ACM International Conference Proceeding Series, Association for Computing Machinery, Sep. 2022. doi: 10.1145/3551349.3556918.

A. Apicella, F. Isgrò, and R. Prevete, “Don’t push the button! Exploring data leakage risks in machine learning and transfer learning,” Artif. Intell. Rev., vol. 58, no. 11, Nov. 2025, doi: 10.1007/s10462-025-11326-3.

A. Moscovich and S. Rosset, “On the cross-validation bias due to unsupervised preprocessing,” J. R. Stat. Soc. Series B Stat. Methodol., vol. 84, no. 4, pp. 1474–1502, Sep. 2022, doi: 10.1111/rssb.12537.

I. Domor Mienye and Y. Sun, “A Survey of Ensemble Learning: Concepts, Algorithms, Applications, and Prospects,” IEEE Access, vol. 10, pp. 1–1, Sep. 2022, doi: 10.1109/ACCESS.2022.3207287.

Z. Sun, G. Wang, P. Li, H. Wang, M. Zhang, and X. Liang, “An improved random forest based on the classification accuracy and correlation measurement of decision trees,” Expert Syst. Appl., vol. 237, Mar. 2024, doi: 10.1016/j.eswa.2023.121549.

F. E. Arévalo-Cordovilla and M. Peña, “Evaluating ensemble models for fair and interpretable prediction in higher education using multimodal data,” Sci. Rep., vol. 15, no. 1, Dec. 2025, doi: 10.1038/s41598-025-15388-9.

L. W. Rizkallah, “Enhancing the performance of gradient boosting trees on regression problems,” J. Big Data, vol. 12, no. 1, Dec. 2025, doi: 10.1186/s40537-025-01071-3.

A. A. Khan, O. Chaudhari, and R. Chandra, “A review of ensemble learning and data augmentation models for class imbalanced problems: Combination, implementation and evaluation,” Expert Syst. Appl., vol. 244, Jun. 2024, doi: 10.1016/j.eswa.2023.122778.

O. Rainio, J. Teuho, and R. Klén, “Evaluation metrics and statistical tests for machine learning,” Sci. Rep., vol. 14, no. 1, Dec. 2024, doi: 10.1038/s41598-024-56706-x.

G. Naidu, T. Zuva, and E. M. Sibanda, “A Review of Evaluation Metrics in Machine Learning Algorithms,” Lecture Notes in Networks and Systems, vol. 724 LNNS, pp. 15–25, 2023, doi: 10.1007/978-3-031-35314-7_2.

I. M. Rajagukguk, R. Hartanto, Julian, and R. Halim, “Comparative Analysis of XGBoost, Random Forest, and Logistic Regression for Classifying Jakarta’s Air Pollution Index (ISPU),” Procedia Comput. Sci., vol. 269, pp. 108–120, 2025, doi: 10.1016/j.procs.2025.08.264.

L. R. Sitompul, A. A. Nababan, M. L. Manihuruk, W. A. Ponsen, and S. Supriyandi, “Comparison of Xgboost, Random Forest and Logistic Regression Algorithms in Stroke Disease Classification,” Sinkron, vol. 9, no. 2, pp. 957–968, Jun. 2025, doi: 10.33395/sinkron.v9i2.14794.

R. Septiawan Putra, H. Fachri Satia Simbolon, A. Linhar, F. Izhari, and U. Syekh Ali Hasan Ahmad Addary, “Perbandingan Algoritma Decision Tree dan Random Forest dalam Klasifikasi Kepuasan Pengguna Sistem Informasi Akademik,” TECHSI - Jurnal Teknik Informatika, vol. 16, p. 2025, doi: 10.29103/techsi.v16i2.25799.

A. N. S. Kinasih, A. N. Handayani, J. T. Ardiansah, and N. S. Damanhuri, “Comparative analysis of decision tree and random forest classifiers for structured data classification in machine learning,” Science in Information Technology Letters, vol. 5, no. 2, pp. 13–24, Nov. 2024, doi: 10.31763/sitech.v5i2.1746.

Harminto Mulyo and Akhmad Khanif Zyen, “Pengaruh Hyperparameter Tuning Gradient Boosting Terhadap Prediksi Pemilihan Program Studi Mahasiswa Baru,” Bulletin of Computer Science Research, vol. 5, no. 2, pp. 131–137, Feb. 2025, doi: 10.47065/bulletincsr.v5i2.454.

R. Arya Andika and C. Dewi, “Importance of Feature Selection for Multiple Disease Classification,” Jurnal Buana Informatika, vol. 16, no. 1, pp. 34–45, 2025, [Online]. Available: https://ojs.uajy.ac.id/index.php/jbi/article/view/11354

N. Shiwakoti, Q. Hu, M. K. Pang, T. M. Cheung, Z. Xu, and H. Jiang, “Passengers’ Perceptions and Satisfaction with Digital Technology Adopted by Airlines during COVID-19 Pandemic,” Future Transportation, vol. 2, no. 4, pp. 988–1009, Dec. 2022, doi: 10.3390/futuretransp2040055.

M. S. Eshaghi, M. Afshardoost, G. Lohmann, and B. D. Moyle, “Drivers and outcomes of airline passenger satisfaction: A Meta-analysis,” Journal of the Air Transport Research Society, vol. 3, Dec. 2024, doi: 10.1016/j.jatrs.2024.100034.

H. Ragab, A. I. Polo-Peña, and A. A. Mahrous, “The effect of airline service quality, perceived value, emotional attachment, and brand loyalty on passengers’ willingness to pay: The moderating role of airline origin,” Case Stud. Transp. Policy, vol. 18, Dec. 2024, doi: 10.1016/j.cstp.2024.101313.

Anita Dyah Nur’aini, Mariani, and Fauzan Qamaar, “The influence of service quality, passenger satisfaction, perceived value, and airline brand love on word-of-mouth behavior among domestic airline users in Indonesia,” Priviet Social Sciences Journal, vol. 4, no. 6, Jun. 2024, doi: 10.55942/pssj.v4i6.237.

Studi Komparatif Performa Model Ensemble Learning dalam Klasifikasi Kepuasan Penumpang Maskapai Penerbangan

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Issue

Section

License

Language

Information