Improving KNN by Gases Brownian Motion Optimization Algorithm to Breast Cancer Detection
Keywords:Breast Cancer Detection, Classification, K-Nearest-Neighbor Algorithm, Gases Brownian Motion Optimization
In the last decade, the application of information technology and artificial intelligence algorithms are widely developed in collecting information of cancer patients and detecting them based on proposing various detection algorithms. The K-Nearest-Neighbor classification algorithm (KNN) is one of the most popular of detection algorithms, which has two challenges in determining the value of k and the volume of computations proportional to the size of the data and sample selected for training. In this paper, the Gaussian Brownian Motion Optimization (GBMO) algorithm is utilized for improving the KNN performance to breast cancer detection. To achieve to this aim, each gas molecule contains the information such as a selected subset of features to apply the KNN and k value. The GBMO has lower time-complexity order than other algorithms and has also been observed to perform better than other optimization algorithms in other applications. The algorithm and three well-known meta-heuristic algorithms such as Genetic Algorithm (GA), Particle Swarm Optimization (PSO) and Imperialist Competitive Algorithm (ICA) have been implemented on five benchmark functions and compared the obtained results. The GBMO+KNN performed on three benchmark datasets of breast cancer from UCI and the obtained results are compared with other existing cancer detection algorithms. These comparisons show significantly improves this classification accuracy with the proposed detection algorithm.
Araújo, T., et al., Classification of breast cancer histology images using convolutional neural networks. PloS one, 2017. 12(6): p. e0177544.
Devi, R.D.H. and M.I. Devi, Outlier detection algorithm combined with decision tree classifier for early diagnosis of breast cancer. Int J Adv Engg Tech/Vol. VII/Issue II/April-June, 2016. 93: p. 98.
Luo, Y., et al., Unraveling biophysical interactions of radiation pneumonitis in non-small-cell lung cancer via Bayesian network analysis. Radiotherapy and Oncology, 2017. 123(1): p. 85-92.
Karabatak, M., A new classifier for breast cancer detection based on Naïve Bayesian. Measurement, 2015. 72: p. 32-36.
Dikaios, N., et al., Logistic regression model for diagnosis of transition zone prostate cancer on multi-parametric MRI. European radiology, 2015. 25(2): p. 523-532.
Wang, H., et al., A support vector machine-based ensemble algorithm for breast cancer diagnosis. European Journal of Operational Research, 2018. 267(2): p. 687-699.
Jalayeri, S. and M. Abdolrazzagh-Nezhad, Chemical reaction optimization to disease diagnosis by optimizing hyper-planes classifiers. Soft Computing, 2019: p. 1-20.
Hossain, E., M.F. Hossain, and M.A. Rahaman. An Approach for the Detection and Classification of Tumor Cells from Bone MRI Using Wavelet Transform and KNN Classifier. in 2018 International Conference on Innovation in Engineering and Technology (ICIET). 2018. IEEE.
Hiesh, M.-H., et al. Classification of schizophrenia using genetic algorithm-support vector machine (ga-svm). in Engineering in Medicine and Biology Society (EMBC), 2013 35th Annual International Conference of the IEEE. 2013. IEEE.
Majid, A. and R. Homa, Classification application with the help of cultural algorithm in predicting and diagnosing cancer. 3rd International Conference on Applied Research in Computer Engineering and Information Technology, 2014.
Wu, M., Z. Xu, and J. Watada, Memetic algorithm based support vector machine classification. International Journal of Innovative Management Information &Production, 2012. 3(3): p. 99-117.
Wang, K.-J., et al., A hybrid classifier combining Borderline-SMOTE with AIRS algorithm for estimating brain metastasis from lung cancer: A case study in Taiwan. Computer methods and programs in biomedicine, 2015. 119(2): p. 63-76.
Polat, K. and S. Güneş, An improved approach to medical data sets classification: artificial immune recognition system with fuzzy resource allocation mechanism. Expert Systems, 2007. 24(4): p. 252-270.
Ling Chen, H., et al., Towards an optimal support vector machine classifier using a parallel particle swarm optimization strategy. Applied Mathematics and Computation, 2014. 239: p. 180-197.
Saidi, M., M.A. Chikh, and N. Settouti. Automatic identification of diabetes diseases using a modified artificial immune recognition system2 (MAIRS2). in Proceedings of 3ème conference internationale sur l ‘informatique et ses applications. 2011.
Huang, C.-L. and J.-F. Dun, A distributed PSO–SVM hybrid system with feature selection and parameter optimization. Applied soft computing, 2008. 8(4): p. 1381-1391.
Ch, S., et al., A support vector machine-firefly algorithm based forecasting model to determine malaria transmission. Neurocomputing, 2014. 129: p. 279-288.
Liu, X. and H. Fu, PSO-based support vector machine with Cuckoo search technique for clinical disease diagnoses. The Scientific World Journal, 2014. 2014.
AlMuhaideb, S. and M.E.B. Menai, HColonies: a new hybrid metaheuristic for medical data classification. Applied intelligence, 2014. 41(1): p. 282-298.
Tomar, D. and S. Agarwal, A survey on Data Mining approaches for Healthcare. International Journal of Bio-Science and Bio-Technology, 2013. 5(5): p. 241-266.
Zheng, B., S.W. Yoon, and S.S. Lam, Breast cancer diagnosis based on feature extraction using a hybrid of K-means and support vector machine algorithms. Expert Systems with Applications, 2014. 41(4): p. 1476-1482.
Han, J., J. Pei, and M. Kamber, Data mining: concepts and techniques. 2011: Elsevier.
Jain, A.K., R.P.W. Duin, and J. Mao, Statistical pattern recognition: A review. IEEE Transactions on pattern analysis and machine intelligence, 2000. 22(1): p. 4-37.
Abdechiri, M., M.R. Meybodi, and H. Bahrami, Gases Brownian motion optimization: an algorithm for optimization (GBMO). Applied Soft Computing, 2013. 13(5): p. 2932-2946.
How to Cite
Copyright (c) 2020 Data Science: Journal of Computing and Applied Informatics
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
The Authors submitting a manuscript do so on the understanding that if accepted for publication, copyright of the article shall be assigned to Data Science: Journal of Informatics Technology and Computer Science (JoCAI) and Faculty of Computer Science and Information Technology as well as TALENTA Publisher Universitas Sumatera Utara as publisher of the journal.
Copyright encompasses exclusive rights to reproduce and deliver the article in all form and media. The reproduction of any part of this journal, its storage in databases and its transmission by any form or media, will be allowed only with a written permission fromData Science: Journal of Informatics Technology and Computer Science (JoCAI).