Catalogue en ligne

University Sétif 1 FERHAT ABBAS Faculty of Sciences

Nouvelle recherche

Détail de l'auteur

Auteur Assala Bennour

Documents disponibles écrits par cet auteur

Ajouter le résultat dans votre panier Affiner la recherche

A Deep Learning Approach to Detecting Offensive Tweets / feriel Lafi

Public

ISBD

Titre : A Deep Learning Approach to Detecting Offensive Tweets
Type de document : texte imprimé
Auteurs : feriel Lafi, Auteur ; Assala Bennour, Auteur ; Lakhfif, Abdelaziz, Directeur de thèse
Année de publication : 2023
Importance : 1 vol (63 f .)
Format : 29cm
Langues : Français (fre)
Catégories : Thèses & Mémoires:Informatique

Mots-clés : detecting offensive language
Natural Language Processing
Index. décimale : 004 Informatique
Résumé : Social media platforms, such as Twitter, have become integral to our daily lives, providing a wide range of
services. However, these platforms also expose users to the risk of cyberbullying, which can result in mental
and emotional abuse.
Pre-trained transformers have become the standard models in natural language processing due to their outstanding
performance across various tasks and languages. However, the majority of these models have been
trained in languages that already possess abundant text resources, such as English, French, and others. As
a result, there is a need for increased focus on low-resource languages that have limited text data, such as
Arabic, particularly the Algerian dialect.
In this thesis, our primary focus is on developping an automated solution for detecting offensive language
in Arabic tweets. We used deep learning algorithms, including MarBERT, DziriBert, and Arabert transformers.
To train and evaluate the models, we utilized the OSACT 2022 and DziriOFN datasets. Through
experiments, we explore the impact of emojis on model performance and the need for additional training on
the Algerian dialect.
Our findings demonstrate that emojis provide valuable contextual information, improving the models understanding
and classification capabilities. TheMarBERT v2 model achieves an impressive F1 score of 85.1%,
while DziriBert excels in detecting offensive language in the Algerian dialect with an F1 score of 87%. These
results emphasize the importance of dialect-specific training.
Côte titre : MAI/0706
En ligne : https://drive.google.com/file/d/1y5pgvD9jTrDs-bysqsBvcmKQScebxK74/view?usp=drive [...]
Format de la ressource électronique : pdf

A Deep Learning Approach to Detecting Offensive Tweets [texte imprimé] / feriel Lafi, Auteur ; Assala Bennour, Auteur ; Lakhfif, Abdelaziz, Directeur de thèse . - 2023 . - 1 vol (63 f .) ; 29cm.
Langues : Français (fre)
Catégories : Thèses & Mémoires:Informatique

Mots-clés : detecting offensive language
Natural Language Processing
Index. décimale : 004 Informatique
Résumé : Social media platforms, such as Twitter, have become integral to our daily lives, providing a wide range of
services. However, these platforms also expose users to the risk of cyberbullying, which can result in mental
and emotional abuse.
Pre-trained transformers have become the standard models in natural language processing due to their outstanding
performance across various tasks and languages. However, the majority of these models have been
trained in languages that already possess abundant text resources, such as English, French, and others. As
a result, there is a need for increased focus on low-resource languages that have limited text data, such as
Arabic, particularly the Algerian dialect.
In this thesis, our primary focus is on developping an automated solution for detecting offensive language
in Arabic tweets. We used deep learning algorithms, including MarBERT, DziriBert, and Arabert transformers.
To train and evaluate the models, we utilized the OSACT 2022 and DziriOFN datasets. Through
experiments, we explore the impact of emojis on model performance and the need for additional training on
the Algerian dialect.
Our findings demonstrate that emojis provide valuable contextual information, improving the models understanding
and classification capabilities. TheMarBERT v2 model achieves an impressive F1 score of 85.1%,
while DziriBert excels in detecting offensive language in the Algerian dialect with an F1 score of 87%. These
results emphasize the importance of dialect-specific training.
Côte titre : MAI/0706
En ligne : https://drive.google.com/file/d/1y5pgvD9jTrDs-bysqsBvcmKQScebxK74/view?usp=drive [...]
Format de la ressource électronique : pdf

Exemplaires (1)

Code-barres Cote Support Localisation Section Disponibilité
MAI/0706 MAI/0706 Mémoire Bibliothéque des sciences Anglais Disponible
Disponible