Towards an enhanced understanding of bias in pre-trained neural language models: a survey with special emphasis on affective bias

Anoop Kadan*, Manjary P. Gangan, P. Deepak, V. L. Lajish

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The remarkable progress in Natural Language Processing (NLP) brought about by deep learning, particularly with the recent advent of large pre-trained neural language models, is brought into scrutiny as several studies began to discuss and report potential biases in NLP applications. Bias in NLP is found to originate from latent historical biases encoded by humans into textual data which gets perpetuated or even amplified by NLP algorithm. We present a survey to comprehend bias in large pre-trained language models and analyze the stages at which they occur in these models, and various ways in which these biases could be quantified and mitigated. Considering wide applicability of textual affective computing-based downstream tasks in real-world systems such as business, health care, and education, we give a special emphasis on investigating bias in the context of affect (emotion) i.e., Affective Bias, in large pre-trained language models. We present a summary of various bias evaluation corpora that help to aid future research and discuss challenges in the research on bias in pre-trained language models. We believe that our attempt to draw a comprehensive view of bias in pre-trained language models, and especially the exploration of affective bias will be highly beneficial to researchers interested in this evolving field.
Original languageEnglish
Title of host publication7th International Conference on Data Science and Engineering
EditorsJimson Mathew, G. Santhosh Kumar, P. Deepak, Joemon M. Jose
PublisherSpringer Singapore
Pages13-45
ISBN (Electronic)9789811944536
ISBN (Print)9789811944529
DOIs
Publication statusPublished - 15 Nov 2022
Externally publishedYes
Event7th International Conference on Data Science and Engineering: ICDSE 2021 - Patna, India
Duration: 17 Dec 202118 Dec 2021
https://www.iitp.ac.in/~icdse2021/index.php

Publication series

NameLecture Notes in Electrical Engineering
PublisherSpringer Nature
Volume940
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

Conference7th International Conference on Data Science and Engineering
Country/TerritoryIndia
CityPatna
Period17/12/2118/12/21
Internet address

Fingerprint

Dive into the research topics of 'Towards an enhanced understanding of bias in pre-trained neural language models: a survey with special emphasis on affective bias'. Together they form a unique fingerprint.

Cite this