MACHINE LEARNING APPROACH FOR BREAST CANCER CLASSIFICATION

dc.contributor.authorAimufua, Gilbert Imuetinyan Osaze
dc.contributor.authorMbanaso, Uche M.
dc.contributor.authorAbdullahi, M.U.
dc.date.accessioned2023-12-14T07:04:53Z
dc.date.available2023-12-14T07:04:53Z
dc.date.issued2022-06-20
dc.description.abstractBreast cancer is the most common cancer among women in Africa. These facts have led researchers to continue studying how to treat and detect breast cancer in women, especially older women, who are at higher risk. Achieving satisfactory cancer classification accuracy with the complete set of genes remains a great challenge (most especially with microarray datasets), due to the high dimensions, small sample size, and presence of noise in gene expression data. Feature reduction is critical and sensitive to the classification task. One of the major drawbacks of cancer studies is recognizing informative genes (features) among the thousands of others in the dataset. A large number of features (genes) against a small sample size and redundancy in expressed data are the main two reasons that lead to poor classification accuracy in machine learning and data mining processes. Therefore, dimensionality reduction is an exciting research area in the fields of pattern recognition, machine learning, data mining, and statistics. The purpose of dimensionality reduction is to improve classification performance through the removal of redundant or irrelevant features. Furthermore, feature selection is typically useful in reducing computation time and memory complexity, which have always been challenges in big data tasks. Besides, the high complexity of the memory space or time as a result of high dimension, noise effect, and outliers but it also has adverse impacts on the performance of the algorithms This paper tends to improve the low general accuracy and minimize memory space and execution time in classification models of machine learning algorithms; hence, the system will employ InfoGain for dimensional reduction and the Random Forest algorithm for classification.en_US
dc.identifier.citationAimufua, G.I.O. & et al. (2022) MACHINE LEARNING APPROACH FOR BREAST CANCER CLASSIFICATIONen_US
dc.identifier.urihttps://keffi.nsuk.edu.ng/handle/20.500.14448/5573
dc.language.isoenen_US
dc.publisherDepartment of Computer Science, Nasarawa State University Keffien_US
dc.subjectBreast cancer, random forest algorithm, machine learning, database, information gain algorithm, confusion matrix, feature selection, filtering technique.en_US
dc.titleMACHINE LEARNING APPROACH FOR BREAST CANCER CLASSIFICATIONen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
28. June (2022).pdf
Size:
2.08 MB
Format:
Adobe Portable Document Format
Description:
Article
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description:

Collections