Preventing recession through GDP growth prediction: A classical and machine learning classification approach

Authors

  • Prilyandari Dina Saputri Institut Teknologi Sepuluh Nopember
  • Arin Berliana Angrenani Institut Teknologi Sepuluh Nopember
  • Ika Nur Laily Fitriana Institut Teknologi Sepuluh Nopember

DOI:

https://doi.org/10.32734/jocai.v7.i2-10507

Keywords:

Accuracy; COVID-19; Data Classification; Machine Learning; Regional GDP

Abstract

Classification methods are a popular method applied in many various fields of science. To represent the effect of predictor factors on categorical response variables, different machine learning classification algorithms are used, namely logistic regression, neural network (NN), random forest, support vector machine (SVM), and bayesian model averaging (BMA). Every classifier has its unique characteristic, performing well in certain datasets but not in others. Hence, it is always a quest to find the best classifier to use for a certain dataset. Economic growth, most commonly using a gross regional domestic product, is experiencing a recession or acceleration, especially before and during the COVID-19 pandemic. This research proposed a comparison of classification methods using regional GDP data for 2019-2020, before and during the COVID-19 pandemic, by predictor variables; percentage of workers, foreign direct investment (PMA), regional revenue (PAD), general allocation fund (DAU), revenue sharing fund (DBH), and the dummy of COVID-19.  The results are that all selected machine learning models can classify the regional GDP growth perfectly for the training data, but, NN model outperforms the other methods with an accuracy of 100% in training and testing data. COVID-19 and the PMA are the most significant variables predicting regional GDP growth for all models. Further research relating to interpretable machine learning, such as feature interaction, global surrogate, and Shapley values, is also necessary to predict regional GDP growth using machine learning methods.

Downloads

Download data is not yet available.

References

J. Jelina and R. Sasikala, “Multi Class Classification Methods on Data Analysis Using Data Mining Techniques,” IOSRJEN, pp. 1– 4, 2018.

R. M. Stefan, “A Comparison of Data Classification Methods,” Procedia Economics and Finance, 3, pp. 420–425. Dec. 2012, DOI: 10.1016/S2212-5671(12)00174-8.

A. Tharwat, “Classification Assessment Methods,” Applied Computing and Informatics, vol. 17, no. 1, pp. 168–192, Jan. 2021, DOI: 10.1016/j.aci.2018.08.003

G. Rohit, 7 Types of Classification Algorithms, India, 2018, Accessed on: Dec. 16, 2021. [Online]. Available: https://analyticsindiamag.com/7-types-classification-algorithms/

D. Varghese, “Comparative Study on Classic Machine learning Algorithms,” Towards Data Science, 2018, Accessed on: Dec. 16, 2021. [Online]. Available: https://towardsdatascience.com/comparative-study-on-classic-machine-learning-algorithms-24f9ff6ab222

A. B. Abel and B. S. Bernake, Macroeconomics, NY, USA: Addison Wesley Longman. Inc., vol. 5, pp. 266–269, 2005.

N. Oktaviana and N. Amalia, “Gross Regional Domestic Product Forecasts Using Trend Analysis: Case Study of Bangka Belitung Province,” JESP, vol. 19, no. 2, pp. 142–151, Oct. 2018, DOI. 10.18196/jesp.19.2.5005.

R. Nuraini, Kasus COVID-19 Pertama, Masyarakat Jangan Panik, Indonesia, 2020, Accessed on: Dec. 17, 2021. [Online]. Available: https://indonesia.go.id/narasi/indonesia-dalam-angka/ekonomi/kasus-COVID-19-pertama-masyarakat-jangan-panik

R. D. A. Saptoyo, Update Corona Dunia 5 Juli: 184 Juta Kasus COVID-19 | Angka Kematian akibat Tak Vaksinasi, Kompas, Indonesia, 2021, Accessed on: Dec. 16, 2020. [Online]. Available: https://www.kompas.com/tren/read/2021/07/05/112600765/update-corona-dunia -5-juli--184-juta-kasus-COVID-19-angka-kematian-akibat?page=all

D. Wuryandani, “Dampak Pandemi COVID-19 Terhadap Pertumbuhan Ekonomi Indonesia 2020 dan Solusinya,” Info Singkat, Economics and Public Policy, Research Center of People's Representative Council of Indonesia, vol. XII, no, 15, Aug. 2020.

H. F. Nasution and Z. Matondang, “The Role of Leading Sector Labor on The GRDP of North Sumatera Province,” JIEK, vol. 9, no. 1, pp. 76–92, June 2021.

Y. N. Malau and L. Loren, “Pengaruh Investasi, Tenaga Kerja, dan Ekspor Terhadap PDB di Provinsi Sumatera Utara Periode 2017-2019,” JIMEA, vol. 4, no. 3, pp. 1711–1721, 2020.

N. D. Muchisha, N. Tamara, Andriansyah, and A. M. Soleh, “Nowcasting Indonesia’s GDP Growth Using Machine Learning Algorithms,” IJSA, vol. 5, no. 2, pp. 355–368, June 2021, DOI. 10.29244/ijsa.v5i2p355-368.

Istiqomah, A. A. Wibowo, E. Yunita, and D. S. Gunawan, “Determinants of Gross Regional Domestic Product in Eastern Indonesia Region,” Trikonomika, vol. 18, no. 1, pp. 18, July 2019, DOI. 10.23969/trikonomika.v18i1.1233.

L. Sri and U. M. Suyana, “The Effect of Local Government Own Revenue and Revenue Sharing Funds on Econoomic Performance and Community Welfare Throught Capital Expenditure of Regency/City in Bali Province, Indonesia,” RJOAS, vol. 7, no. 91, pp. 67–87, July 2019, DOI. 10.18551/rjoas.2019-07.08.

A. S. Wahyuni, “Effect of Original Revenue Income and Funds General Allocation of Growth Economy of Surakarta City,” JSTEIE, vol. XX, no. XX, pp. 1–21, July 2020.

W. Susanto and C. Sugiyanto, “The Structure of Regional Original Revenue and Its Effect on Economic Growth: Facts from Regencies and Cities in Central Jawa,” IJDP, vol. III, no. 1, pp. 68–102, Apr. 2019.

N. Anisah, “Investment Development Before and During the COVID-19 Pandemic and Impact on Regional Economy in West Java,” JEP, vol. 19, no. 2, pp. 81–96. Nov. 2021.

Sirusa, Metadata Indikator, Central Bureau of Statistics (BPS), Indonesia, Accessed on: Dec. 16, 2020. [Online]. Available: https://sirusa.bps.go.id/sirusa/index.php/indikator/index

Berkas DPR, Dana Bagi Hasil, People's Representative Council of Indonesia, Accessed on: Dec. 16, 2020. [Online]. Available: https%3A%2F%2Fberkas.dpr.go.id%2Fpuskajianggaran%2Fformula%2Ffile%2Fformula-9.pdf&clen=523443&chunk=true

A. Agresti, “Logistic Regression,” in An Introduction to Categorical Data Analysis, 2nd ed. Hoboken, NJ, USA: John Wiley & Sons, pp. 99–136, 2007.

T. Abedin, Z. Chowdhury, A. R. Afzal, F. Yeasmin, and T. C. Turin, “Application of Binary Logistic Regression in Clinical Research,” JNHFB, 5, pp. 8–11, Jan. 2016.

Suhartono et al., “Model Selection in Feedforward Neural Networks for Forecasting Inflow and Outflow in Indonesia,” in Mohamed A., Berry M., Yap B. (eds) Soft Computing in Data Science. SCDS 2017. Communications in Computer and Information Science, vol. 788, Springer, Singapore, 2017.

M. Kuhn, and K. Johnson, Applied Predictive Modeling, New York: Springer, 2013.

V. N. Vapnik, The Nature of Statistical Learning Theory 2nd Edition, New York: Springer, 2000.

T. Nogueira, G. Palacio, F. Braga, P. Maia, E. Moura, C. Andrade, and P. Rocha, “Imbalance classification in a scaled-down wind turbine using radial basis function kernel and support vector machines,” Energy, pp. 238, 2021.

S. Abe, Support Vector Machines for Pattern Classification, 2nd ed., London: Springer, 2010.

M. Hinne, Q. Gronau, D. Bergh, and E. J. Wagenmakers, “A Conceptual Introduction to Bayesian Model Averaging,” Advances in Methods and Practices in Psychological Science, pp. 200–215, 2020. DOI. 10.1177/2515245919898657.

T. Fragoso, W. Bertoli, and F. Louzada, “Bayesian Model Averaging: A Systematic Review and Conceptual Classification,” International Statistical Review, pp. 1-28. 2017, DOI. 10.1111/insr.12243.

H. AL-Behadili, and K. Ku-Mahamud, “Hybrid K-Nearest Neighbour and Particle Swarm Optimization Technique for Divorce Classification,” International Journal on Advanced Science, Engineering and Information Technology, vol. 11, no. 4, pp. 1447-1454, 2021, DOI. 10.18517/ijaseit.11.4.14868.

Schober, et al, “Correlation Coefficients: Appropriate Use and Interpretation,” Anesthesia and Analgesia, vol. 126, no. 5, pp. 1763–1768, 2018.

A. Statnikov, L. Wang, and C. F. Aliferis, “A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification,” BMC Bioinformatics, vol. 9, pp. 319. 2008.

C. Molnar. "Interpretable machine learning” in A Guide for Making Black Box Models Explainable, 2021. [Online]. Available: https://christophm.github.io/interpretable-ml-book/

Published

2023-07-31

How to Cite

Saputri, P. D., Angrenani, A. B., & Fitriana, I. N. L. (2023). Preventing recession through GDP growth prediction: A classical and machine learning classification approach. Data Science: Journal of Computing and Applied Informatics, 7(2), 51-68. https://doi.org/10.32734/jocai.v7.i2-10507