‘Thy Algorithm Shalt Not Bear False Witness’: An Evaluation of Multiclass Debiasing Methods on Word Embeddings

Thalea Schlender; Gerasimos Spanakis

doi:10.1007/978-3-030-76640-5_9

‘Thy Algorithm Shalt Not Bear False Witness’: An Evaluation of Multiclass Debiasing Methods on Word Embeddings

Thalea Schlender^*, Gerasimos Spanakis

^*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceeding › Chapter › Academic

8 Downloads (Pure)

Abstract

With the vast development and employment of artificial intelligence applications, research into the fairness of these algorithms has been increased. Specifically, in the natural language processing domain, it has been shown that social biases persist in word embeddings and are thus in danger of amplifying these biases when used. As an example of social bias, religious biases are shown to persist in word embeddings and the need for its removal is highlighted. This paper investigates the state-of-the-art multiclass debiasing techniques: Hard debiasing, SoftWEAT debiasing and Conceptor debiasing. It evaluates their performance when removing religious bias on a common basis by quantifying bias removal via the Word Embedding Association Test (WEAT), Mean Average Cosine Similarity (MAC) and the Relative Negative Sentiment Bias (RNSB). By investigating the religious bias removal on three widely used word embeddings, namely: Word2Vec, GloVe, and ConceptNet, it is shown that the preferred method is ConceptorDebiasing. Specifically, this technique manages to decrease the measured religious bias on average by 82.42%, 96.78% and 54.76% for the three word embedding sets respectively.

Original language	English
Title of host publication	Artificial Intelligence and Machine Learning - 32nd Benelux Conference, BNAIC/Benelearn 2020, Revised Selected Papers
Editors	Mitra Baratchi, Lu Cao, Walter A. Kosters, Jefrey Lijffijt, Jan N. van Rijn, Frank W. Takes
Publisher	Springer
Pages	141-156
Number of pages	16
Volume	1398 CCIS
ISBN (Print)	9783030766399
DOIs	https://doi.org/10.1007/978-3-030-76640-5_9
Publication status	Published - 1 Jan 2021
Event	32nd Benelux Conference on Artificial Intelligence and Belgian-Dutch Conference on Machine Learning - Online, Leiden, Netherlands Duration: 19 Nov 2020 → 20 Nov 2020 Conference number: 32 https://bnaic.liacs.leidenuniv.nl/

Publication series

Series	Communications in Computer and Information Science
Volume	1398 CCIS
ISSN	1865-0929

Conference

Conference	32nd Benelux Conference on Artificial Intelligence and Belgian-Dutch Conference on Machine Learning
Abbreviated title	BNAIC/BeNeLearn 2020
Country/Territory	Netherlands
City	Leiden
Period	19/11/20 → 20/11/20
Internet address	https://bnaic.liacs.leidenuniv.nl/

Keywords

Natural language processing
Social bias
Word embeddings

Access to Document

10.1007/978-3-030-76640-5_9

Full TextFinal published version, 383 KBLicence: Taverne

Cite this

Schlender, T., & Spanakis, G. (2021). ‘Thy Algorithm Shalt Not Bear False Witness’: An Evaluation of Multiclass Debiasing Methods on Word Embeddings. In M. Baratchi, L. Cao, W. A. Kosters, J. Lijffijt, J. N. van Rijn, & F. W. Takes (Eds.), Artificial Intelligence and Machine Learning - 32nd Benelux Conference, BNAIC/Benelearn 2020, Revised Selected Papers (Vol. 1398 CCIS, pp. 141-156). Springer. https://doi.org/10.1007/978-3-030-76640-5_9

Schlender, Thalea ; Spanakis, Gerasimos. / ‘Thy Algorithm Shalt Not Bear False Witness’ : An Evaluation of Multiclass Debiasing Methods on Word Embeddings. Artificial Intelligence and Machine Learning - 32nd Benelux Conference, BNAIC/Benelearn 2020, Revised Selected Papers. editor / Mitra Baratchi ; Lu Cao ; Walter A. Kosters ; Jefrey Lijffijt ; Jan N. van Rijn ; Frank W. Takes. Vol. 1398 CCIS Springer, 2021. pp. 141-156 (Communications in Computer and Information Science, Vol. 1398 CCIS).

@inbook{44aa77e8256044f18cb3840c99d98111,

title = "{\textquoteleft}Thy Algorithm Shalt Not Bear False Witness{\textquoteright}: An Evaluation of Multiclass Debiasing Methods on Word Embeddings",

abstract = "With the vast development and employment of artificial intelligence applications, research into the fairness of these algorithms has been increased. Specifically, in the natural language processing domain, it has been shown that social biases persist in word embeddings and are thus in danger of amplifying these biases when used. As an example of social bias, religious biases are shown to persist in word embeddings and the need for its removal is highlighted. This paper investigates the state-of-the-art multiclass debiasing techniques: Hard debiasing, SoftWEAT debiasing and Conceptor debiasing. It evaluates their performance when removing religious bias on a common basis by quantifying bias removal via the Word Embedding Association Test (WEAT), Mean Average Cosine Similarity (MAC) and the Relative Negative Sentiment Bias (RNSB). By investigating the religious bias removal on three widely used word embeddings, namely: Word2Vec, GloVe, and ConceptNet, it is shown that the preferred method is ConceptorDebiasing. Specifically, this technique manages to decrease the measured religious bias on average by 82.42%, 96.78% and 54.76% for the three word embedding sets respectively.",

keywords = "Natural language processing, Social bias, Word embeddings",

author = "Thalea Schlender and Gerasimos Spanakis",

note = "Publisher Copyright: {\textcopyright} 2021, Springer Nature Switzerland AG.; 32nd Benelux Conference on Artificial Intelligence and Belgian-Dutch Conference on Machine Learning, BNAIC/BeNeLearn 2020 ; Conference date: 19-11-2020 Through 20-11-2020",

year = "2021",

month = jan,

day = "1",

doi = "10.1007/978-3-030-76640-5_9",

language = "English",

isbn = "9783030766399",

volume = "1398 CCIS",

series = "Communications in Computer and Information Science",

publisher = "Springer",

pages = "141--156",

editor = "Mitra Baratchi and Lu Cao and Kosters, {Walter A.} and Jefrey Lijffijt and {van Rijn}, {Jan N.} and Takes, {Frank W.}",

booktitle = "Artificial Intelligence and Machine Learning - 32nd Benelux Conference, BNAIC/Benelearn 2020, Revised Selected Papers",

address = "United States",

url = "https://bnaic.liacs.leidenuniv.nl/",

}

Schlender, T & Spanakis, G 2021, ‘Thy Algorithm Shalt Not Bear False Witness’: An Evaluation of Multiclass Debiasing Methods on Word Embeddings. in M Baratchi, L Cao, WA Kosters, J Lijffijt, JN van Rijn & FW Takes (eds), Artificial Intelligence and Machine Learning - 32nd Benelux Conference, BNAIC/Benelearn 2020, Revised Selected Papers. vol. 1398 CCIS, Springer, Communications in Computer and Information Science, vol. 1398 CCIS, pp. 141-156, 32nd Benelux Conference on Artificial Intelligence and Belgian-Dutch Conference on Machine Learning, Leiden, Netherlands, 19/11/20. https://doi.org/10.1007/978-3-030-76640-5_9

‘Thy Algorithm Shalt Not Bear False Witness’: An Evaluation of Multiclass Debiasing Methods on Word Embeddings. / Schlender, Thalea; Spanakis, Gerasimos.
Artificial Intelligence and Machine Learning - 32nd Benelux Conference, BNAIC/Benelearn 2020, Revised Selected Papers. ed. / Mitra Baratchi; Lu Cao; Walter A. Kosters; Jefrey Lijffijt; Jan N. van Rijn; Frank W. Takes. Vol. 1398 CCIS Springer, 2021. p. 141-156 (Communications in Computer and Information Science, Vol. 1398 CCIS).

Research output: Chapter in Book/Report/Conference proceeding › Chapter › Academic

TY - CHAP

T1 - ‘Thy Algorithm Shalt Not Bear False Witness’

T2 - 32nd Benelux Conference on Artificial Intelligence and Belgian-Dutch Conference on Machine Learning

AU - Schlender, Thalea

AU - Spanakis, Gerasimos

N1 - Conference code: 32

PY - 2021/1/1

Y1 - 2021/1/1

N2 - With the vast development and employment of artificial intelligence applications, research into the fairness of these algorithms has been increased. Specifically, in the natural language processing domain, it has been shown that social biases persist in word embeddings and are thus in danger of amplifying these biases when used. As an example of social bias, religious biases are shown to persist in word embeddings and the need for its removal is highlighted. This paper investigates the state-of-the-art multiclass debiasing techniques: Hard debiasing, SoftWEAT debiasing and Conceptor debiasing. It evaluates their performance when removing religious bias on a common basis by quantifying bias removal via the Word Embedding Association Test (WEAT), Mean Average Cosine Similarity (MAC) and the Relative Negative Sentiment Bias (RNSB). By investigating the religious bias removal on three widely used word embeddings, namely: Word2Vec, GloVe, and ConceptNet, it is shown that the preferred method is ConceptorDebiasing. Specifically, this technique manages to decrease the measured religious bias on average by 82.42%, 96.78% and 54.76% for the three word embedding sets respectively.

AB - With the vast development and employment of artificial intelligence applications, research into the fairness of these algorithms has been increased. Specifically, in the natural language processing domain, it has been shown that social biases persist in word embeddings and are thus in danger of amplifying these biases when used. As an example of social bias, religious biases are shown to persist in word embeddings and the need for its removal is highlighted. This paper investigates the state-of-the-art multiclass debiasing techniques: Hard debiasing, SoftWEAT debiasing and Conceptor debiasing. It evaluates their performance when removing religious bias on a common basis by quantifying bias removal via the Word Embedding Association Test (WEAT), Mean Average Cosine Similarity (MAC) and the Relative Negative Sentiment Bias (RNSB). By investigating the religious bias removal on three widely used word embeddings, namely: Word2Vec, GloVe, and ConceptNet, it is shown that the preferred method is ConceptorDebiasing. Specifically, this technique manages to decrease the measured religious bias on average by 82.42%, 96.78% and 54.76% for the three word embedding sets respectively.

KW - Natural language processing

KW - Social bias

KW - Word embeddings

UR - http://www.scopus.com/inward/record.url?scp=85111243627&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-76640-5_9

DO - 10.1007/978-3-030-76640-5_9

M3 - Chapter

SN - 9783030766399

VL - 1398 CCIS

T3 - Communications in Computer and Information Science

SP - 141

EP - 156

BT - Artificial Intelligence and Machine Learning - 32nd Benelux Conference, BNAIC/Benelearn 2020, Revised Selected Papers

A2 - Baratchi, Mitra

A2 - Cao, Lu

A2 - Kosters, Walter A.

A2 - Lijffijt, Jefrey

A2 - van Rijn, Jan N.

A2 - Takes, Frank W.

PB - Springer

Y2 - 19 November 2020 through 20 November 2020

ER -

Schlender T, Spanakis G. ‘Thy Algorithm Shalt Not Bear False Witness’: An Evaluation of Multiclass Debiasing Methods on Word Embeddings. In Baratchi M, Cao L, Kosters WA, Lijffijt J, van Rijn JN, Takes FW, editors, Artificial Intelligence and Machine Learning - 32nd Benelux Conference, BNAIC/Benelearn 2020, Revised Selected Papers. Vol. 1398 CCIS. Springer. 2021. p. 141-156. (Communications in Computer and Information Science, Vol. 1398 CCIS). doi: 10.1007/978-3-030-76640-5_9

‘Thy Algorithm Shalt Not Bear False Witness’: An Evaluation of Multiclass Debiasing Methods on Word Embeddings

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Cite this