Search In this Thesis
   Search In this Thesis  
العنوان
Mining of biomedical big data /
المؤلف
Hassib, Eslam Mohsin Mahmoud.
هيئة الاعداد
باحث / إسلام محسن محمود حسيب
مشرف / علي إبراهيم الدسوقي
مشرف / لبيب محمد لبيب
مناقش / مفرح محمد سالم
مناقش / عبدالناصر حسين رياض
الموضوع
Computers Engineering. Data Mining. Health Policy. Legislation, Medical.
تاريخ النشر
2021.
عدد الصفحات
online resource (122 pages) :
اللغة
الإنجليزية
الدرجة
الدكتوراه
التخصص
هندسة النظم والتحكم
تاريخ الإجازة
1/1/2021
مكان الإجازة
جامعة المنصورة - كلية الهندسة - قسم هندسة الحاسبات والنظم
الفهرس
Only 14 pages are availabe for public view

from 118

from 118

Abstract

Big Data analysis play a very important role in many real-world applications, especially, biomedical data sets. Unfortunately, the majority of these data are imbalanced because, there is a small number of instances in one class (minority class) compared to the other class (majority class). The standard classifiers are not capable of handling such data sets. Hence, a new technique for dealing with such data sets is required. For solving local optima problem in high dimensional data sets, this thesis proposes an imbalanced biomedical big data deep learning framework (WOA+BRNN) that consists of three developed phases. The first phase is the feature selection phase, which uses the WOA (Whale optimization algorithm) for finding the best set of features. The second phase is the preprocessing phase, which uses the SMOTE algorithm and the LSH-SMOTE algorithm for solving the class imbalance problem. Lastly, the third phase is WOA+BRNN framework, which is using the Whale optimization algorithm for training a deep learning approach called Bidirectional Recurrent Neural Network (BRNN). The proposed framework WOA+BRNN has been tested against 9 highly imbalanced data sets one of them is big data set in terms of AUC (Area Under Curve) against four of the most common use machine-learning algorithms (Naïve Bayes, AdaBoostM1, Decision Table, Random Tree), in addition to GWO-MLP (Training Multi-Layer Perceptron using Greywolf optimizer), then we test our algorithm over 4 well-known data sets against GWO-MLP and Particle Swarm Optimization (PSO-MLP), Genetic Algorithm (GA-MLP), Ant Colony Optimization (ACO-MLP), Evolution Strategy (ES-MLP), and Population-based Incremental Learning (PBIL-MLP) in terms of classification accuracy. Experimental results proved that our proposed algorithm WOA+BRNN has achieve promising accuracy and high local optima avoidance, and outperformed four of the most common use machine-learning algorithms, and GWO-MLP in terms of AUC (Area Under Curve).