Preventing recession through GDP growth prediction: A classical and machine learning classification approach
DOI:
https://doi.org/10.32734/jocai.v7.i2-10507Keywords:
Accuracy; COVID-19; Data Classification; Machine Learning; Regional GDPAbstract
Classification methods are a popular method applied in many various fields of science. To represent the effect of predictor factors on categorical response variables, different machine learning classification algorithms are used, namely logistic regression, neural network (NN), random forest, support vector machine (SVM), and bayesian model averaging (BMA). Every classifier has its unique characteristic, performing well in certain datasets but not in others. Hence, it is always a quest to find the best classifier to use for a certain dataset. Economic growth, most commonly using a gross regional domestic product, is experiencing a recession or acceleration, especially before and during the COVID-19 pandemic. This research proposed a comparison of classification methods using regional GDP data for 2019-2020, before and during the COVID-19 pandemic, by predictor variables; percentage of workers, foreign direct investment (PMA), regional revenue (PAD), general allocation fund (DAU), revenue sharing fund (DBH), and the dummy of COVID-19. The results are that all selected machine learning models can classify the regional GDP growth perfectly for the training data, but, NN model outperforms the other methods with an accuracy of 100% in training and testing data. COVID-19 and the PMA are the most significant variables predicting regional GDP growth for all models. Further research relating to interpretable machine learning, such as feature interaction, global surrogate, and Shapley values, is also necessary to predict regional GDP growth using machine learning methods.
Downloads
References
J. Jelina and R. Sasikala, “Multi Class Classification Methods on Data Analysis Using Data Mining Techniques,” IOSRJEN, pp. 1– 4, 2018.
R. M. Stefan, “A Comparison of Data Classification Methods,” Procedia Economics and Finance, 3, pp. 420–425. Dec. 2012, DOI: 10.1016/S2212-5671(12)00174-8.
A. Tharwat, “Classification Assessment Methods,” Applied Computing and Informatics, vol. 17, no. 1, pp. 168–192, Jan. 2021, DOI: 10.1016/j.aci.2018.08.003
G. Rohit, 7 Types of Classification Algorithms, India, 2018, Accessed on: Dec. 16, 2021. [Online]. Available: https://analyticsindiamag.com/7-types-classification-algorithms/
D. Varghese, “Comparative Study on Classic Machine learning Algorithms,” Towards Data Science, 2018, Accessed on: Dec. 16, 2021. [Online]. Available: https://towardsdatascience.com/comparative-study-on-classic-machine-learning-algorithms-24f9ff6ab222
A. B. Abel and B. S. Bernake, Macroeconomics, NY, USA: Addison Wesley Longman. Inc., vol. 5, pp. 266–269, 2005.
N. Oktaviana and N. Amalia, “Gross Regional Domestic Product Forecasts Using Trend Analysis: Case Study of Bangka Belitung Province,” JESP, vol. 19, no. 2, pp. 142–151, Oct. 2018, DOI. 10.18196/jesp.19.2.5005.
R. Nuraini, Kasus COVID-19 Pertama, Masyarakat Jangan Panik, Indonesia, 2020, Accessed on: Dec. 17, 2021. [Online]. Available: https://indonesia.go.id/narasi/indonesia-dalam-angka/ekonomi/kasus-COVID-19-pertama-masyarakat-jangan-panik
R. D. A. Saptoyo, Update Corona Dunia 5 Juli: 184 Juta Kasus COVID-19 | Angka Kematian akibat Tak Vaksinasi, Kompas, Indonesia, 2021, Accessed on: Dec. 16, 2020. [Online]. Available: https://www.kompas.com/tren/read/2021/07/05/112600765/update-corona-dunia -5-juli--184-juta-kasus-COVID-19-angka-kematian-akibat?page=all
D. Wuryandani, “Dampak Pandemi COVID-19 Terhadap Pertumbuhan Ekonomi Indonesia 2020 dan Solusinya,” Info Singkat, Economics and Public Policy, Research Center of People's Representative Council of Indonesia, vol. XII, no, 15, Aug. 2020.
H. F. Nasution and Z. Matondang, “The Role of Leading Sector Labor on The GRDP of North Sumatera Province,” JIEK, vol. 9, no. 1, pp. 76–92, June 2021.
Y. N. Malau and L. Loren, “Pengaruh Investasi, Tenaga Kerja, dan Ekspor Terhadap PDB di Provinsi Sumatera Utara Periode 2017-2019,” JIMEA, vol. 4, no. 3, pp. 1711–1721, 2020.
N. D. Muchisha, N. Tamara, Andriansyah, and A. M. Soleh, “Nowcasting Indonesia’s GDP Growth Using Machine Learning Algorithms,” IJSA, vol. 5, no. 2, pp. 355–368, June 2021, DOI. 10.29244/ijsa.v5i2p355-368.
Istiqomah, A. A. Wibowo, E. Yunita, and D. S. Gunawan, “Determinants of Gross Regional Domestic Product in Eastern Indonesia Region,” Trikonomika, vol. 18, no. 1, pp. 18, July 2019, DOI. 10.23969/trikonomika.v18i1.1233.
L. Sri and U. M. Suyana, “The Effect of Local Government Own Revenue and Revenue Sharing Funds on Econoomic Performance and Community Welfare Throught Capital Expenditure of Regency/City in Bali Province, Indonesia,” RJOAS, vol. 7, no. 91, pp. 67–87, July 2019, DOI. 10.18551/rjoas.2019-07.08.
A. S. Wahyuni, “Effect of Original Revenue Income and Funds General Allocation of Growth Economy of Surakarta City,” JSTEIE, vol. XX, no. XX, pp. 1–21, July 2020.
W. Susanto and C. Sugiyanto, “The Structure of Regional Original Revenue and Its Effect on Economic Growth: Facts from Regencies and Cities in Central Jawa,” IJDP, vol. III, no. 1, pp. 68–102, Apr. 2019.
N. Anisah, “Investment Development Before and During the COVID-19 Pandemic and Impact on Regional Economy in West Java,” JEP, vol. 19, no. 2, pp. 81–96. Nov. 2021.
Sirusa, Metadata Indikator, Central Bureau of Statistics (BPS), Indonesia, Accessed on: Dec. 16, 2020. [Online]. Available: https://sirusa.bps.go.id/sirusa/index.php/indikator/index
Berkas DPR, Dana Bagi Hasil, People's Representative Council of Indonesia, Accessed on: Dec. 16, 2020. [Online]. Available: https%3A%2F%2Fberkas.dpr.go.id%2Fpuskajianggaran%2Fformula%2Ffile%2Fformula-9.pdf&clen=523443&chunk=true
A. Agresti, “Logistic Regression,” in An Introduction to Categorical Data Analysis, 2nd ed. Hoboken, NJ, USA: John Wiley & Sons, pp. 99–136, 2007.
T. Abedin, Z. Chowdhury, A. R. Afzal, F. Yeasmin, and T. C. Turin, “Application of Binary Logistic Regression in Clinical Research,” JNHFB, 5, pp. 8–11, Jan. 2016.
Suhartono et al., “Model Selection in Feedforward Neural Networks for Forecasting Inflow and Outflow in Indonesia,” in Mohamed A., Berry M., Yap B. (eds) Soft Computing in Data Science. SCDS 2017. Communications in Computer and Information Science, vol. 788, Springer, Singapore, 2017.
M. Kuhn, and K. Johnson, Applied Predictive Modeling, New York: Springer, 2013.
V. N. Vapnik, The Nature of Statistical Learning Theory 2nd Edition, New York: Springer, 2000.
T. Nogueira, G. Palacio, F. Braga, P. Maia, E. Moura, C. Andrade, and P. Rocha, “Imbalance classification in a scaled-down wind turbine using radial basis function kernel and support vector machines,” Energy, pp. 238, 2021.
S. Abe, Support Vector Machines for Pattern Classification, 2nd ed., London: Springer, 2010.
M. Hinne, Q. Gronau, D. Bergh, and E. J. Wagenmakers, “A Conceptual Introduction to Bayesian Model Averaging,” Advances in Methods and Practices in Psychological Science, pp. 200–215, 2020. DOI. 10.1177/2515245919898657.
T. Fragoso, W. Bertoli, and F. Louzada, “Bayesian Model Averaging: A Systematic Review and Conceptual Classification,” International Statistical Review, pp. 1-28. 2017, DOI. 10.1111/insr.12243.
H. AL-Behadili, and K. Ku-Mahamud, “Hybrid K-Nearest Neighbour and Particle Swarm Optimization Technique for Divorce Classification,” International Journal on Advanced Science, Engineering and Information Technology, vol. 11, no. 4, pp. 1447-1454, 2021, DOI. 10.18517/ijaseit.11.4.14868.
Schober, et al, “Correlation Coefficients: Appropriate Use and Interpretation,” Anesthesia and Analgesia, vol. 126, no. 5, pp. 1763–1768, 2018.
A. Statnikov, L. Wang, and C. F. Aliferis, “A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification,” BMC Bioinformatics, vol. 9, pp. 319. 2008.
C. Molnar. "Interpretable machine learning” in A Guide for Making Black Box Models Explainable, 2021. [Online]. Available: https://christophm.github.io/interpretable-ml-book/
Published
How to Cite
Issue
Section
Copyright (c) 2023 Data Science: Journal of Computing and Applied Informatics
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
The Authors submitting a manuscript do so on the understanding that if accepted for publication, copyright of the article shall be assigned to Data Science: Journal of Informatics Technology and Computer Science (JoCAI) and Faculty of Computer Science and Information Technology as well as TALENTA Publisher Universitas Sumatera Utara as publisher of the journal.
Copyright encompasses exclusive rights to reproduce and deliver the article in all form and media. The reproduction of any part of this journal, its storage in databases and its transmission by any form or media, will be allowed only with a written permission fromData Science: Journal of Informatics Technology and Computer Science (JoCAI).
The Copyright Transfer Form can be downloaded here.
The copyright form should be signed originally and sent to the Editorial Office in the form of original mail or scanned document.