Catalogue en ligne

Deep Belief Networks Applied to Alzheimer’s Disease Detection and Classification using Neuroimaging Data / Yacine Deradra

Public

ISBD

Titre : Deep Belief Networks Applied to Alzheimer’s Disease Detection and Classification using Neuroimaging Data
Type de document : texte imprimé
Auteurs : Yacine Deradra ; Yahia Benameur ; Abdelouahab Moussaoui, Directeur de thèse
Editeur : Setif:UFA
Année de publication : 2023
Importance : 1 vol (83 f.)
Format : 29 cm
Langues : Français (fre)
Catégories : Thèses & Mémoires:Informatique

Mots-clés : Alzheimer’s disease deep learning convolutional neural networks CNN Restricted Boltzmann machines RBMs pretrained models Inception Xception MobileNet,
DenseNet VGG-16 VGG-19 EfficientNet B7 and ResNET magnetic resonance imaging (MRI) early detection classification Vision Transformer ViT maladie d’Alzheimer apprentissage profond réseaux neuronaux convolutionnels machines de Boltzmann restreintes modèles pré-entraînés MobileNet DenseNet imagerie par résonance magnétique (IRM) détection précoce
Index. décimale : 004 Informatique
Résumé : Accurately diagnosing early Alzheimer’s disease (AD) presents significant challenges
for neuroscientists and medical professionals. Current clinical, neuropsychological,
and imaging assessments have limitations, often resulting in inconclusive diagnoses for
patients with early dementia. Magnetic Resonance Imaging (MRI) combined with deep learning
techniques has emerged as a promising alternative for detecting and characterizing Alzheimer’s
disease.
Our approach involves fine-tuning and integrating deep learning models pretrained on large
datasets, including Restricted Boltzmann Machines (RBMs), convolutional neural network (CNN)
architectures both with and without attention mechanisms. Inception, Xception, MobileNet,
DenseNet, VGG-16, VGG-19, EfficientNet B7, and ResNET. Additionally, we explore the potential
of applying Vision Transformer (ViT) models powered by Keras (ViT B32).
By capitalizing on the strengths of these models and techniques, our proposed approach achieves
an accuracy of up to 94.06% for the detection and classification of Alzheimer’s disease = Le diagnostic précis de la maladie d’Alzheimer précoce présente d’importants défis pour
les neuroscientifiques et les professionnels de la santé. Les évaluations cliniques, neuropsychologiques
et d’imagerie actuelles ont des limites, conduisant souvent à des
diagnostics inconclusifs pour les patients atteints de démence précoce. L’imagerie par résonance
magnétique (IRM), combinée à des techniques d’apprentissage profond, s’est révélée être une
alternative prometteuse pour détecter et caractériser la maladie d’Alzheimer.
Notre approche consiste à affiner et à intégrer des modèles d’apprentissage profond pré-entraînés
sur de grands ensembles de données, comprenant des machines de Boltzmann restreintes (RBM),
des architectures de réseaux neuronaux convolutionnels (CNN) avec ou sans mécanismes d’attention.
Nous utilisons Inception, Xception, MobileNet, DenseNet, VGG-16, VGG-19, EfficientNet B7
et ResNet. De plus, nous explorons le potentiel d’application des modèles Vision Transformer
(ViT) alimentés par Keras (ViT B32).
En tirant parti des forces de ces modèles et techniques, notre approche proposée atteint une
précision impressionnante allant jusqu’à 94,06 % pour la détection et la classification de la
maladie d’Alzheimer précoce.
Côte titre : MAI/0758
En ligne : https://drive.google.com/file/d/1lf42lgwF-sGiN3V5qlxA5Tcl-7Y3L4Ap/view?usp=drive [...]
Format de la ressource électronique : pdf

Deep Belief Networks Applied to Alzheimer’s Disease Detection and Classification using Neuroimaging Data [texte imprimé] / Yacine Deradra ; Yahia Benameur ; Abdelouahab Moussaoui, Directeur de thèse . - [S.l.] : Setif:UFA, 2023 . - 1 vol (83 f.) ; 29 cm.
Langues : Français (fre)
Catégories : Thèses & Mémoires:Informatique

Mots-clés : Alzheimer’s disease deep learning convolutional neural networks CNN Restricted Boltzmann machines RBMs pretrained models Inception Xception MobileNet,
DenseNet VGG-16 VGG-19 EfficientNet B7 and ResNET magnetic resonance imaging (MRI) early detection classification Vision Transformer ViT maladie d’Alzheimer apprentissage profond réseaux neuronaux convolutionnels machines de Boltzmann restreintes modèles pré-entraînés MobileNet DenseNet imagerie par résonance magnétique (IRM) détection précoce
Index. décimale : 004 Informatique
Résumé : Accurately diagnosing early Alzheimer’s disease (AD) presents significant challenges
for neuroscientists and medical professionals. Current clinical, neuropsychological,
and imaging assessments have limitations, often resulting in inconclusive diagnoses for
patients with early dementia. Magnetic Resonance Imaging (MRI) combined with deep learning
techniques has emerged as a promising alternative for detecting and characterizing Alzheimer’s
disease.
Our approach involves fine-tuning and integrating deep learning models pretrained on large
datasets, including Restricted Boltzmann Machines (RBMs), convolutional neural network (CNN)
architectures both with and without attention mechanisms. Inception, Xception, MobileNet,
DenseNet, VGG-16, VGG-19, EfficientNet B7, and ResNET. Additionally, we explore the potential
of applying Vision Transformer (ViT) models powered by Keras (ViT B32).
By capitalizing on the strengths of these models and techniques, our proposed approach achieves
an accuracy of up to 94.06% for the detection and classification of Alzheimer’s disease = Le diagnostic précis de la maladie d’Alzheimer précoce présente d’importants défis pour
les neuroscientifiques et les professionnels de la santé. Les évaluations cliniques, neuropsychologiques
et d’imagerie actuelles ont des limites, conduisant souvent à des
diagnostics inconclusifs pour les patients atteints de démence précoce. L’imagerie par résonance
magnétique (IRM), combinée à des techniques d’apprentissage profond, s’est révélée être une
alternative prometteuse pour détecter et caractériser la maladie d’Alzheimer.
Notre approche consiste à affiner et à intégrer des modèles d’apprentissage profond pré-entraînés
sur de grands ensembles de données, comprenant des machines de Boltzmann restreintes (RBM),
des architectures de réseaux neuronaux convolutionnels (CNN) avec ou sans mécanismes d’attention.
Nous utilisons Inception, Xception, MobileNet, DenseNet, VGG-16, VGG-19, EfficientNet B7
et ResNet. De plus, nous explorons le potentiel d’application des modèles Vision Transformer
(ViT) alimentés par Keras (ViT B32).
En tirant parti des forces de ces modèles et techniques, notre approche proposée atteint une
précision impressionnante allant jusqu’à 94,06 % pour la détection et la classification de la
maladie d’Alzheimer précoce.
Côte titre : MAI/0758
En ligne : https://drive.google.com/file/d/1lf42lgwF-sGiN3V5qlxA5Tcl-7Y3L4Ap/view?usp=drive [...]
Format de la ressource électronique : pdf

Exemplaires (1)

Code-barres Cote Support Localisation Section Disponibilité
MAI/0758 MAI/0758 Mémoire Bibliothéque des sciences Anglais Disponible
Disponible

Deep-Convolution-Neural-Network-and-Autoencoders- Based-Unsupervised-Feature-Learning-of-EEG / Krachni,Rayane

Public

ISBD

Titre : Deep-Convolution-Neural-Network-and-Autoencoders- Based-Unsupervised-Feature-Learning-of-EEG
Type de document : texte imprimé
Auteurs : Krachni,Rayane, Auteur ; Moussaoui ,Abdelouahab, Directeur de thèse
Editeur : Setif:UFA
Année de publication : 2021
Importance : 1 vol (87 f .)
Format : 29 cm
Langues : Français (fre)
Catégories : Thèses & Mémoires:Informatique

Mots-clés : Informatique
Index. décimale : 004 - Informatique
Côte titre : MAI/0549
En ligne : https://drive.google.com/file/d/1CS0M3WHfnGjvOJIbi7LwitLIM-mgzuQs/view?usp=shari [...]
Format de la ressource électronique : pdf

Deep-Convolution-Neural-Network-and-Autoencoders- Based-Unsupervised-Feature-Learning-of-EEG [texte imprimé] / Krachni,Rayane, Auteur ; Moussaoui ,Abdelouahab, Directeur de thèse . - [S.l.] : Setif:UFA, 2021 . - 1 vol (87 f .) ; 29 cm.
Langues : Français (fre)
Catégories : Thèses & Mémoires:Informatique

Mots-clés : Informatique
Index. décimale : 004 - Informatique
Côte titre : MAI/0549
En ligne : https://drive.google.com/file/d/1CS0M3WHfnGjvOJIbi7LwitLIM-mgzuQs/view?usp=shari [...]
Format de la ressource électronique : pdf

Exemplaires (1)

Code-barres Cote Support Localisation Section Disponibilité
MAI/0549 MAI/0549 Mémoire Bibliothéque des sciences Français Disponible
Disponible

Deep Convolutional Network Model for Chest X-ray Abnormalities Detection / Zemmit, Maroua

Public

ISBD

Titre : Deep Convolutional Network Model for Chest X-ray Abnormalities Detection
Type de document : texte imprimé
Auteurs : Zemmit, Maroua, Auteur ; Nasri,Khaled, Directeur de thèse
Editeur : Setif:UFA
Année de publication : 2021
Importance : 1 vol (47 f .)
Format : 29 cm
Langues : Français (fre)
Catégories : Thèses & Mémoires:Informatique

Mots-clés : CNN
VGG19
VGG16
Resnet50v2
Index. décimale : 004 - Informatique
Résumé :
Les radiographies thoraciques (CXR) sont largement utilisées pour le diagnostic
d’anomalies dans la région du coeur et des poumons. Automatiquement
détecter ces anomalies avec une grande précision pourrait améliorer considérablement
les processus de diagnostic et puis de bien prendre en charge ces cas.
Avec les progrès des algorithmes informatiques, la détection de ce type des anomalies
à un stade précoce est nécessaire de toute urgence pour le rétablissement
rapide des patients. L’ensemble de données utilisé dans cette étude COVID19
radiographys dataset a été collecté auprès de différentes sources et il est disponible
pour les chercheurs. Le nombre d’images dans l’ensemble de données collectées
est de 3616 cas positifs au COVID-19, 10 192 images normales ainsi que
6012 images d’opacité pulmonaire (infection pulmonaire non COVID) et 1345
images de pneumonie virale. Pour éprouver notre architecture very deep learning,
fondée sur 5 blocs, nous avons choisis trois architectures very deep qui
sont : vgg16,vgg19, resnet50v2. Ces modèles ont été sélectionnés parce qu’ils obtiennent
des résultats élevés dans la base de données imagenet.
Résultats : Dans l’ensemble de données COVID19 radiographys, notre modèle atteint
une réussite(accuracy) de 92,9% et une perte de 20%.
Conclusions : Notre CNN proposé obtient des résultats améliorés par rapport aux
architectures utilisées sur notre base de données ce qui prouve la qualité de notre
modèle.
Côte titre : MAI/0550
En ligne : https://drive.google.com/file/d/1ZT9kGy9j0-NhQTQmV0jwrNCrdFK5DG5T/view?usp=shari [...]
Format de la ressource électronique : pdf

Deep Convolutional Network Model for Chest X-ray Abnormalities Detection [texte imprimé] / Zemmit, Maroua, Auteur ; Nasri,Khaled, Directeur de thèse . - [S.l.] : Setif:UFA, 2021 . - 1 vol (47 f .) ; 29 cm.
Langues : Français (fre)
Catégories : Thèses & Mémoires:Informatique

Mots-clés : CNN
VGG19
VGG16
Resnet50v2
Index. décimale : 004 - Informatique
Résumé :
Les radiographies thoraciques (CXR) sont largement utilisées pour le diagnostic
d’anomalies dans la région du coeur et des poumons. Automatiquement
détecter ces anomalies avec une grande précision pourrait améliorer considérablement
les processus de diagnostic et puis de bien prendre en charge ces cas.
Avec les progrès des algorithmes informatiques, la détection de ce type des anomalies
à un stade précoce est nécessaire de toute urgence pour le rétablissement
rapide des patients. L’ensemble de données utilisé dans cette étude COVID19
radiographys dataset a été collecté auprès de différentes sources et il est disponible
pour les chercheurs. Le nombre d’images dans l’ensemble de données collectées
est de 3616 cas positifs au COVID-19, 10 192 images normales ainsi que
6012 images d’opacité pulmonaire (infection pulmonaire non COVID) et 1345
images de pneumonie virale. Pour éprouver notre architecture very deep learning,
fondée sur 5 blocs, nous avons choisis trois architectures very deep qui
sont : vgg16,vgg19, resnet50v2. Ces modèles ont été sélectionnés parce qu’ils obtiennent
des résultats élevés dans la base de données imagenet.
Résultats : Dans l’ensemble de données COVID19 radiographys, notre modèle atteint
une réussite(accuracy) de 92,9% et une perte de 20%.
Conclusions : Notre CNN proposé obtient des résultats améliorés par rapport aux
architectures utilisées sur notre base de données ce qui prouve la qualité de notre
modèle.
Côte titre : MAI/0550
En ligne : https://drive.google.com/file/d/1ZT9kGy9j0-NhQTQmV0jwrNCrdFK5DG5T/view?usp=shari [...]
Format de la ressource électronique : pdf

Exemplaires (1)

Code-barres Cote Support Localisation Section Disponibilité
MAI/0550 MAI/0550 Mémoire Bibliothéque des sciences Anglais Disponible
Disponible

Deep Feature Learning (Extraction and Generation) Using a Bidirectional LSTM-CNN and Deep Generative Models Applied to Physiological Signals (EEG/ECG) Classification / Hichem Betiche

Public

ISBD

Titre : Deep Feature Learning (Extraction and Generation) Using a Bidirectional LSTM-CNN and Deep Generative Models Applied to Physiological Signals (EEG/ECG) Classification
Type de document : texte imprimé
Auteurs : Hichem Betiche, Auteur ; Youcef Oualid Zemame, Auteur ; Abdelouahab Moussaoui, Directeur de thèse
Année de publication : 2022
Importance : 1 vol (60 f .)
Format : 29cm
Langues : Français (fre)
Catégories : Thèses & Mémoires:Informatique

Mots-clés : Informatique
Index. décimale : 004 Informatique
Résumé :
In this thesis, we want to propose some classification models based on deep learning
that will be able to classify and extract features from EEG/ECG signals with high accuracy .
Electroencephalogram or EEG is related to the brain and electrocardiogram or ECG is related
to the heart. EEG is the equipment used for measuring electrical activities of the brain.
On the other hand, ECG is used for measuring activities of heart.
We have tried many models which are tested and verified using the public datasets : mitbih
arrhythmia and EEG in schizophrenia ,We have used Variational autoencoders to solve the
unbalance of MIT dataset then a comparative study was made between different deep learning
models and the result showed that the deep approach with Bi-lstm cnn gave the best test accuracy
where we obtained an accuracy up to 98% for ECG dataset and 84% for EEG dataset
Côte titre : MAI/0593
En ligne : https://drive.google.com/file/d/1CWN7ejvxw2JSsLBrnr-hBmkqu7OE2-Jj/view?usp=share [...]
Format de la ressource électronique : pdf

Deep Feature Learning (Extraction and Generation) Using a Bidirectional LSTM-CNN and Deep Generative Models Applied to Physiological Signals (EEG/ECG) Classification [texte imprimé] / Hichem Betiche, Auteur ; Youcef Oualid Zemame, Auteur ; Abdelouahab Moussaoui, Directeur de thèse . - 2022 . - 1 vol (60 f .) ; 29cm.
Langues : Français (fre)
Catégories : Thèses & Mémoires:Informatique

Mots-clés : Informatique
Index. décimale : 004 Informatique
Résumé :
In this thesis, we want to propose some classification models based on deep learning
that will be able to classify and extract features from EEG/ECG signals with high accuracy .
Electroencephalogram or EEG is related to the brain and electrocardiogram or ECG is related
to the heart. EEG is the equipment used for measuring electrical activities of the brain.
On the other hand, ECG is used for measuring activities of heart.
We have tried many models which are tested and verified using the public datasets : mitbih
arrhythmia and EEG in schizophrenia ,We have used Variational autoencoders to solve the
unbalance of MIT dataset then a comparative study was made between different deep learning
models and the result showed that the deep approach with Bi-lstm cnn gave the best test accuracy
where we obtained an accuracy up to 98% for ECG dataset and 84% for EEG dataset
Côte titre : MAI/0593
En ligne : https://drive.google.com/file/d/1CWN7ejvxw2JSsLBrnr-hBmkqu7OE2-Jj/view?usp=share [...]
Format de la ressource électronique : pdf

Exemplaires (1)

Code-barres Cote Support Localisation Section Disponibilité
MAI/0593 MAI/0593 Mémoire Bibliothéque des sciences Anglais Disponible
Sorti jusqu'au 06/04/2025

Deep Generative Models in Omics Data Augmentation for Disease Classification / Boutheyna Khenouche

Public

ISBD

Titre : Deep Generative Models in Omics Data Augmentation for Disease Classification
Type de document : texte imprimé
Auteurs : Boutheyna Khenouche, Auteur ; Imene Madoui ; Abderrahim Lakehal, Directeur de thèse
Editeur : Setif:UFA
Année de publication : 2024
Importance : 1 vol (115 f .)
Format : 29 cm
Langues : Anglais (eng)
Catégories : Thèses & Mémoires:Informatique

Mots-clés : Informatique
Index. décimale : 004 - Informatique
Résumé : The intersection of bioinformatics and machine learning has opened new possibilities
in cancer research, particularly in the analysis and classification of omics
data, which is often which are frequently limited and diverse. This thesis explores
the application of deep generative models, specifically Generative Adversarial Networks
(GANs), to augment omics data, including both gene and protein expression
data, for improved disease classification. Cancer, being one of the most prevalent
and complex diseases, presents significant challenges that require advanced analytical
approaches. Traditional machine learning methods often struggle with highdimensional
and limited datasets, whereas deep learning offers a promising alternative
due to its ability to learn intricate patterns. Our approach involves collecting
and preprocessing gene expression data from 11,070 samples and protein expression
data from 7,790 samples, generating synthetic data through GANs , and training
classifiers like Convolutional Neural Networks (CNNs) and Multi-Layer Perceptrons
(MLPs) on the augmented datasets for the prediction of overall survival (OS) and
progression-free interval (PFI). We validate the data generated by our GAN through
biological classification of genes and proteins, and by comprehensive data visualization
techniques.
Note de contenu : Sommaire
1 Bioinformatics , Machine Learning and Deep Learning 10
1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.2 Bioinformatics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
1.2.1 Origin of Bioinformatics . . . . . . . . . . . . . . . . . . . . . 11
1.2.2 Definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
1.2.3 Bioinformatics Subfields . . . . . . . . . . . . . . . . . . . . . 12
1.2.4 Bioinformatics data types and Databases . . . . . . . . . . . . 13
1.2.5 Bioinfromatics Application Domain . . . . . . . . . . . . . . . 15
1.2.6 Benefits of Bioinformatics in Healthcare . . . . . . . . . . . . 16
1.3 Machine Learning: . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
1.3.1 Definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
1.3.2 Machine learning techniques . . . . . . . . . . . . . . . . . . 17
1.4 Deep Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
1.4.1 definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
1.4.2 Neural Network in Deep Learning . . . . . . . . . . . . . . . . 20
1.5 Generative Models in machine Learning and deep learning . . . . . . 22
1.5.1 Generative Adversarial Networks (GANs) . . . . . . . . . . . . 22
1.5.2 Variational AutoEncoders (VAEs) . . . . . . . . . . . . . . . . 25
1.6 Machine Learning Vs Deep Learning . . . . . . . . . . . . . . . . . . 27
1.7 Application Domain of ML and DL in Health Care . . . . . . . . . . 29
1.8 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
2 Disease classification and data augmentation: Literature review 31
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
2.2 Omics data, Machine learning and medical research . . . . . . . . . . 32
2.2.1 Omics data: Definition and types . . . . . . . . . . . . . . . . 32
2.2.2 Overview on omics data and machine learning . . . . . . . . . 32
2.3 Related work in Generative model for data augmentation . . . . . . . 33
2.4 Generative models for spatial data augmentation . . . . . . . . . . . 34
2.5 Generative model for omics data augmentation . . . . . . . . . . . . . 41
2.6 Generative adversarial network for omics data augmentation in cancer
classification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
2.7 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
3 Deep Generative models for omics data augmentation: Cancer case
study 46
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
3.2 Analyzing Gene and Protein Data: From Collection to Insight . . . . 47
3.3 Proposed Architecture for Data Augmentation and Classification . . 47
3.4 Data Collection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
3.5 Data Preprocessing . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
3.5.1 Gene Expression data . . . . . . . . . . . . . . . . . . . . . . 50
3.5.2 Protein Expression Data Pre-processing . . . . . . . . . . . . . 53
3.6 Dimensionality Reduction for Gene Expression Data: . . . . . . . . . 54
3.7 Generative adversarial network for data augmentation . . . . . . . . 56
3.7.1 Data preparation . . . . . . . . . . . . . . . . . . . . . . . . . 56
3.7.2 Model elaboration . . . . . . . . . . . . . . . . . . . . . . . . . 56
3.7.3 Training and evaluation . . . . . . . . . . . . . . . . . . . . . 57
3.8 Convolutional Neural Networks. . . . . . . . . . . . . . . . . . . . . . 58
3.8.1 Data Preparation . . . . . . . . . . . . . . . . . . . . . . . . . 58
3.8.2 Model Definition and Building . . . . . . . . . . . . . . . . . 58
3.8.3 Model Training and Evaluation . . . . . . . . . . . . . . . . . 58
3.9 Biological Classification of Genes and Proteins . . . . . . . . . . . . . 59
3.9.1 Objective and Methodology . . . . . . . . . . . . . . . . . . . 59
3.10 Multi-Layer Perceptron Implementation . . . . . . . . . . . . . . . . 59
3.10.1 Data Preparation . . . . . . . . . . . . . . . . . . . . . . . . . 59
3.10.2 Data Splitting . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
3.10.3 Model Building and Predictions for OS . . . . . . . . . . . . . 59
3.10.4 Model Building and Predictions for PFI . . . . . . . . . . . . 59
3.10.5 Synthetic Data . . . . . . . . . . . . . . . . . . . . . . . . . . 60
3.11 conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
4 Implementation and evaluation 61
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
4.2 Experimental tools and Packages . . . . . . . . . . . . . . . . . . . . 62
4.2.1 PYTHON . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
4.3 Evaluation Metrics for Classification Models . . . . . . . . . . . . . . 64
4.3.1 Precision . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
4.3.2 Recall . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
4.3.3 Accuracy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
4.3.4 F1-Score . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
4.3.5 Matthews Correlation Coefficient (MCC) . . . . . . . . . . . 65
4.4 Datasets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
4.5 Implementation and Dataset Evaluation . . . . . . . . . . . . . . . . 66
4.5.1 Pre-processing:Gene Expression . . . . . . . . . . . . . . 66
4.5.2 Pre-processing :Protein Expression . . . . . . . . . . . . 67
4.5.3 Dimensionality Reduction: Gene Expression . . . . . . 68
4.5.4 Implement Generative Adversarial Networks (GANs) . . . . . 70
4.5.5 Implement Convolutional Neural Networks . . . . . . . . . . . 75
4.6 Implement Multi-Layer Perceptron . . . . . . . . . . . . . . . . . . . 79
4.7 Biological Classification of Genes and Proteins . . . . . . . . . . . . . 82
4.7.1 Tools and Techniques . . . . . . . . . . . . . . . . . . . . . . . 82
4.8 MLP Model for Distinguishing Upregulated and Downregulated Genes/Proteins 83
4.9 Evaluating Prediction Accuracy and Updating Gene and Protein Expression
Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
4.10 Data Visualisation and Interpretation . . . . . . . . . . . . . . . . . . 86
4.12 Comparison of CNN and MLP Performance on Augmented Gene Expression
Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
4.12.1 Gene Expression Data . . . . . . . . . . . . . . . . . . . . . 90
4.12.2 Protein Expression Data . . . . . . . . . . . . . . . . . . . . . 91
4.13 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
.1 First Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
.1.1 Pre-processing:Gene Expression . . . . . . . . . . . . . . 95
.1.2 Pre-processing :Protein Expression . . . . . . . . . . . . 100
.2 Second Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
.2.1 Implement Generative Adversarial Networks (GANs) . . . . . 105
.3 Third Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106
.3.1 Implementing Predictions and Labeling for Generated Data . 106
Côte titre : MAI/0926

Deep Generative Models in Omics Data Augmentation for Disease Classification [texte imprimé] / Boutheyna Khenouche, Auteur ; Imene Madoui ; Abderrahim Lakehal, Directeur de thèse . - [S.l.] : Setif:UFA, 2024 . - 1 vol (115 f .) ; 29 cm.
Langues : Anglais (eng)
Catégories : Thèses & Mémoires:Informatique

Mots-clés : Informatique
Index. décimale : 004 - Informatique
Résumé : The intersection of bioinformatics and machine learning has opened new possibilities
in cancer research, particularly in the analysis and classification of omics
data, which is often which are frequently limited and diverse. This thesis explores
the application of deep generative models, specifically Generative Adversarial Networks
(GANs), to augment omics data, including both gene and protein expression
data, for improved disease classification. Cancer, being one of the most prevalent
and complex diseases, presents significant challenges that require advanced analytical
approaches. Traditional machine learning methods often struggle with highdimensional
and limited datasets, whereas deep learning offers a promising alternative
due to its ability to learn intricate patterns. Our approach involves collecting
and preprocessing gene expression data from 11,070 samples and protein expression
data from 7,790 samples, generating synthetic data through GANs , and training
classifiers like Convolutional Neural Networks (CNNs) and Multi-Layer Perceptrons
(MLPs) on the augmented datasets for the prediction of overall survival (OS) and
progression-free interval (PFI). We validate the data generated by our GAN through
biological classification of genes and proteins, and by comprehensive data visualization
techniques.
Note de contenu : Sommaire
1 Bioinformatics , Machine Learning and Deep Learning 10
1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.2 Bioinformatics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
1.2.1 Origin of Bioinformatics . . . . . . . . . . . . . . . . . . . . . 11
1.2.2 Definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
1.2.3 Bioinformatics Subfields . . . . . . . . . . . . . . . . . . . . . 12
1.2.4 Bioinformatics data types and Databases . . . . . . . . . . . . 13
1.2.5 Bioinfromatics Application Domain . . . . . . . . . . . . . . . 15
1.2.6 Benefits of Bioinformatics in Healthcare . . . . . . . . . . . . 16
1.3 Machine Learning: . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
1.3.1 Definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
1.3.2 Machine learning techniques . . . . . . . . . . . . . . . . . . 17
1.4 Deep Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
1.4.1 definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
1.4.2 Neural Network in Deep Learning . . . . . . . . . . . . . . . . 20
1.5 Generative Models in machine Learning and deep learning . . . . . . 22
1.5.1 Generative Adversarial Networks (GANs) . . . . . . . . . . . . 22
1.5.2 Variational AutoEncoders (VAEs) . . . . . . . . . . . . . . . . 25
1.6 Machine Learning Vs Deep Learning . . . . . . . . . . . . . . . . . . 27
1.7 Application Domain of ML and DL in Health Care . . . . . . . . . . 29
1.8 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
2 Disease classification and data augmentation: Literature review 31
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
2.2 Omics data, Machine learning and medical research . . . . . . . . . . 32
2.2.1 Omics data: Definition and types . . . . . . . . . . . . . . . . 32
2.2.2 Overview on omics data and machine learning . . . . . . . . . 32
2.3 Related work in Generative model for data augmentation . . . . . . . 33
2.4 Generative models for spatial data augmentation . . . . . . . . . . . 34
2.5 Generative model for omics data augmentation . . . . . . . . . . . . . 41
2.6 Generative adversarial network for omics data augmentation in cancer
classification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
2.7 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
3 Deep Generative models for omics data augmentation: Cancer case
study 46
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
3.2 Analyzing Gene and Protein Data: From Collection to Insight . . . . 47
3.3 Proposed Architecture for Data Augmentation and Classification . . 47
3.4 Data Collection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
3.5 Data Preprocessing . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
3.5.1 Gene Expression data . . . . . . . . . . . . . . . . . . . . . . 50
3.5.2 Protein Expression Data Pre-processing . . . . . . . . . . . . . 53
3.6 Dimensionality Reduction for Gene Expression Data: . . . . . . . . . 54
3.7 Generative adversarial network for data augmentation . . . . . . . . 56
3.7.1 Data preparation . . . . . . . . . . . . . . . . . . . . . . . . . 56
3.7.2 Model elaboration . . . . . . . . . . . . . . . . . . . . . . . . . 56
3.7.3 Training and evaluation . . . . . . . . . . . . . . . . . . . . . 57
3.8 Convolutional Neural Networks. . . . . . . . . . . . . . . . . . . . . . 58
3.8.1 Data Preparation . . . . . . . . . . . . . . . . . . . . . . . . . 58
3.8.2 Model Definition and Building . . . . . . . . . . . . . . . . . 58
3.8.3 Model Training and Evaluation . . . . . . . . . . . . . . . . . 58
3.9 Biological Classification of Genes and Proteins . . . . . . . . . . . . . 59
3.9.1 Objective and Methodology . . . . . . . . . . . . . . . . . . . 59
3.10 Multi-Layer Perceptron Implementation . . . . . . . . . . . . . . . . 59
3.10.1 Data Preparation . . . . . . . . . . . . . . . . . . . . . . . . . 59
3.10.2 Data Splitting . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
3.10.3 Model Building and Predictions for OS . . . . . . . . . . . . . 59
3.10.4 Model Building and Predictions for PFI . . . . . . . . . . . . 59
3.10.5 Synthetic Data . . . . . . . . . . . . . . . . . . . . . . . . . . 60
3.11 conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
4 Implementation and evaluation 61
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
4.2 Experimental tools and Packages . . . . . . . . . . . . . . . . . . . . 62
4.2.1 PYTHON . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
4.3 Evaluation Metrics for Classification Models . . . . . . . . . . . . . . 64
4.3.1 Precision . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
4.3.2 Recall . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
4.3.3 Accuracy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
4.3.4 F1-Score . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
4.3.5 Matthews Correlation Coefficient (MCC) . . . . . . . . . . . 65
4.4 Datasets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
4.5 Implementation and Dataset Evaluation . . . . . . . . . . . . . . . . 66
4.5.1 Pre-processing:Gene Expression . . . . . . . . . . . . . . 66
4.5.2 Pre-processing :Protein Expression . . . . . . . . . . . . 67
4.5.3 Dimensionality Reduction: Gene Expression . . . . . . 68
4.5.4 Implement Generative Adversarial Networks (GANs) . . . . . 70
4.5.5 Implement Convolutional Neural Networks . . . . . . . . . . . 75
4.6 Implement Multi-Layer Perceptron . . . . . . . . . . . . . . . . . . . 79
4.7 Biological Classification of Genes and Proteins . . . . . . . . . . . . . 82
4.7.1 Tools and Techniques . . . . . . . . . . . . . . . . . . . . . . . 82
4.8 MLP Model for Distinguishing Upregulated and Downregulated Genes/Proteins 83
4.9 Evaluating Prediction Accuracy and Updating Gene and Protein Expression
Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
4.10 Data Visualisation and Interpretation . . . . . . . . . . . . . . . . . . 86
4.12 Comparison of CNN and MLP Performance on Augmented Gene Expression
Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
4.12.1 Gene Expression Data . . . . . . . . . . . . . . . . . . . . . 90
4.12.2 Protein Expression Data . . . . . . . . . . . . . . . . . . . . . 91
4.13 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
.1 First Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
.1.1 Pre-processing:Gene Expression . . . . . . . . . . . . . . 95
.1.2 Pre-processing :Protein Expression . . . . . . . . . . . . 100
.2 Second Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
.2.1 Implement Generative Adversarial Networks (GANs) . . . . . 105
.3 Third Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106
.3.1 Implementing Predictions and Labeling for Generated Data . 106
Côte titre : MAI/0926

Exemplaires (1)

Code-barres Cote Support Localisation Section Disponibilité
MAI/0926 MAI/0926 Mémoire Bibliothéque des sciences Anglais Disponible
Disponible

Deep Learning Algorithms for Colon Cancer Detection : A Comparative Study with Traditional machine Learning Methods / Ilhem Nabti

Permalink
Deep Learning Approach for Handwritten Arabic Recognition Characters / Nasri,Khaled

Permalink
A Deep Learning Approach for recommender systems of pedagogical resources / GhedjatIi,Oussama

Permalink
A Deep Learning Approach to Detecting Offensive Tweets / feriel Lafi

Permalink
Deep Learning based method For features extraction of Gene Expression Data / Sarah Soudane

Permalink
Deep Learning based method for features selection of Gene Expression Data / Kerkour ,Ibtissem

Permalink
Deep Learning Based Protein Function Prediction / Almas Djabar

Permalink
Deep Learning based Recommendation Systems for Big Data / Boughazi ,Chaima

Permalink
Deep Learning Based Sentiment Analysis of Movie Reviews / Akram Djidel

Permalink
Deep Learning for Age, Gender and Race Identification based on Face Recognition / Hamdi ,Skander

Permalink