"Thy algorithm shalt not bear false witness": An Evaluation of Multiclass Debiasing Methods on Word Embeddings

Thalea Schlender; Gerasimos Spanakis

"Thy algorithm shalt not bear false witness": An Evaluation of Multiclass Debiasing Methods on Word Embeddings

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

Abstract

With the vast development and employment of artificial intelligence applications, research into the fairness of these algorithms has been increased. Specifically, in the natural language processing domain, it has been shown that social biases persist in word embeddings and are thus in danger of amplifying these biases when used. As an example of social bias, religious biases are shown to persist in word embeddings and the need for its removal is highlighted. This paper investigates the state-of-the-art multiclass debiasing techniques: Hard debiasing, SoftWEAT debiasing and Conceptor debiasing. It evaluates their performance when removing religious bias on a common basis by quantifying bias removal via the Word Embedding Association Test (WEAT), Mean Average Cosine Similarity (MAC) and the Relative Negative Sentiment Bias (RNSB). By investigating the religious bias removal on three widely used word embeddings, namely: Word2Vec, GloVe, and ConceptNet, it is shown that the preferred method is ConceptorDebiasing. Specifically, this technique manages to decrease the measured religious bias on average by 82,42%, 96,78% and 54,76% for the three word embedding sets respectively.

Original language	English
Title of host publication	BNAIC/BeneLearn 2020
Editors	Lu Cao, Walter Kosters, Jefrey Lijffijt
Pages	254-268
Number of pages	15
Publication status	Published - Nov 2020
Event	Benelux Conference on Artificial Intelligence and Machine Learning - Online, Leiden University, Leiden, Netherlands Duration: 19 Nov 2020 → 20 Nov 2020 https://bnaic.liacs.leidenuniv.nl/

Conference

Conference	Benelux Conference on Artificial Intelligence and Machine Learning
Abbreviated title	BNAIC/BeneLearn 2020
Country/Territory	Netherlands
City	Leiden
Period	19/11/20 → 20/11/20
Internet address	https://bnaic.liacs.leidenuniv.nl/

Access to Document

http://bnaic.liacs.leidenuniv.nl/bnaic2020proceedings.pdf

Cite this

@inproceedings{d4b58aa329664958bf19903b7bfe136b,

title = "{"}Thy algorithm shalt not bear false witness{"}: An Evaluation of Multiclass Debiasing Methods on Word Embeddings",

abstract = "With the vast development and employment of artificial intelligence applications, research into the fairness of these algorithms has been increased. Specifically, in the natural language processing domain, it has been shown that social biases persist in word embeddings and are thus in danger of amplifying these biases when used. As an example of social bias, religious biases are shown to persist in word embeddings and the need for its removal is highlighted. This paper investigates the state-of-the-art multiclass debiasing techniques: Hard debiasing, SoftWEAT debiasing and Conceptor debiasing. It evaluates their performance when removing religious bias on a common basis by quantifying bias removal via the Word Embedding Association Test (WEAT), Mean Average Cosine Similarity (MAC) and the Relative Negative Sentiment Bias (RNSB). By investigating the religious bias removal on three widely used word embeddings, namely: Word2Vec, GloVe, and ConceptNet, it is shown that the preferred method is ConceptorDebiasing. Specifically, this technique manages to decrease the measured religious bias on average by 82,42%, 96,78% and 54,76% for the three word embedding sets respectively.",

author = "Thalea Schlender and Gerasimos Spanakis",

note = "Publisher Copyright: {\textcopyright} 2020 University of Groningen. All rights reserved.; Benelux Conference on Artificial Intelligence and Machine Learning, BNAIC/BeneLearn 2020 ; Conference date: 19-11-2020 Through 20-11-2020",

year = "2020",

month = nov,

language = "English",

pages = "254--268",

editor = "Lu Cao and Walter Kosters and Jefrey Lijffijt",

booktitle = "BNAIC/BeneLearn 2020",

url = "https://bnaic.liacs.leidenuniv.nl/",

}

Schlender, T & Spanakis, G 2020, "Thy algorithm shalt not bear false witness": An Evaluation of Multiclass Debiasing Methods on Word Embeddings. in L Cao, W Kosters & J Lijffijt (eds), BNAIC/BeneLearn 2020. pp. 254-268, Benelux Conference on Artificial Intelligence and Machine Learning, Leiden, Netherlands, 19/11/20. <http://bnaic.liacs.leidenuniv.nl/bnaic2020proceedings.pdf>

TY - GEN

T1 - "Thy algorithm shalt not bear false witness": An Evaluation of Multiclass Debiasing Methods on Word Embeddings

AU - Schlender, Thalea

AU - Spanakis, Gerasimos

PY - 2020/11

Y1 - 2020/11

N2 - With the vast development and employment of artificial intelligence applications, research into the fairness of these algorithms has been increased. Specifically, in the natural language processing domain, it has been shown that social biases persist in word embeddings and are thus in danger of amplifying these biases when used. As an example of social bias, religious biases are shown to persist in word embeddings and the need for its removal is highlighted. This paper investigates the state-of-the-art multiclass debiasing techniques: Hard debiasing, SoftWEAT debiasing and Conceptor debiasing. It evaluates their performance when removing religious bias on a common basis by quantifying bias removal via the Word Embedding Association Test (WEAT), Mean Average Cosine Similarity (MAC) and the Relative Negative Sentiment Bias (RNSB). By investigating the religious bias removal on three widely used word embeddings, namely: Word2Vec, GloVe, and ConceptNet, it is shown that the preferred method is ConceptorDebiasing. Specifically, this technique manages to decrease the measured religious bias on average by 82,42%, 96,78% and 54,76% for the three word embedding sets respectively.

AB - With the vast development and employment of artificial intelligence applications, research into the fairness of these algorithms has been increased. Specifically, in the natural language processing domain, it has been shown that social biases persist in word embeddings and are thus in danger of amplifying these biases when used. As an example of social bias, religious biases are shown to persist in word embeddings and the need for its removal is highlighted. This paper investigates the state-of-the-art multiclass debiasing techniques: Hard debiasing, SoftWEAT debiasing and Conceptor debiasing. It evaluates their performance when removing religious bias on a common basis by quantifying bias removal via the Word Embedding Association Test (WEAT), Mean Average Cosine Similarity (MAC) and the Relative Negative Sentiment Bias (RNSB). By investigating the religious bias removal on three widely used word embeddings, namely: Word2Vec, GloVe, and ConceptNet, it is shown that the preferred method is ConceptorDebiasing. Specifically, this technique manages to decrease the measured religious bias on average by 82,42%, 96,78% and 54,76% for the three word embedding sets respectively.

M3 - Conference article in proceeding

SP - 254

EP - 268

BT - BNAIC/BeneLearn 2020

A2 - Cao, Lu

A2 - Kosters, Walter

A2 - Lijffijt, Jefrey

T2 - Benelux Conference on Artificial Intelligence and Machine Learning

Y2 - 19 November 2020 through 20 November 2020

ER -