Improving the genetic bee colony optimization algorithm for efficient gene selection in microarray data

Abstract

Feature selection is a very critical component in the workflow of biomedical data mining applications. In particular, there is a need for feature selection methods that can find complex relationships among genes, yet computationally efficient. Within the scope of microarray data analysis, the genetic bee colony (GBC) algorithm is one of the best feature selection algorithms, which leverages the combination between genetic and ant colony optimization algorithms to search for the optimal solution. In this paper, we analyse in depth the fundamentals lying behind the GBC and propose some improvements in both efficiency and accuracy, so that researchers can even take more advantage of this excellent method. By (i) replacing the filtering phase of GBC with a more efficient technique, (ii) improving the population generation in the artificial colony algorithm used in GBC, and (iii) improving the exploitation method in GBC, our experiments in microarray data sets reveal that our new method GBC+ is not only significantly more accurate, but also around ten times faster on average than the original.

Publication
In Progress in Artificial Intelligence
Camilo Velázquez-Rodríguez
Camilo Velázquez-Rodríguez
Postdoctoral Researcher

My research interests include software engineering, artificial intelligence on code and mining software repositories.