A Bayesian approach to uncertainty in word embedding bias estimation

doi:10.1162/coli_a_00507

Alicja Dobrzeniecka, Rafal Urbaniak

A Bayesian approach to uncertainty in word embedding bias estimation

Artificial Intelligence
Computer Science Applications
Linguistics and Language
Language and Linguistics

Abstract Multiple measures, such as WEAT or MAC, attempt to quantify the magnitude of bias present in word embeddings in terms of a single-number metric. However, such metrics and the related statistical significance calculations rely on treating pre-averaged data as individual data points and employing bootstrapping techniques with low sample sizes. We show that similar results can be easily obtained using such methods even if the data are generated by a null model lacking the intended bias. Consequently, we argue that this approach generates false confidence. To address this issue, we propose a Bayesian alternative: hierarchical Bayesian modeling, which enables a more uncertainty-sensitive inspection of bias in word embeddings at different levels of granularity. To showcase our method, we apply it to Religion, Gender, and Race word lists from the original research, together with our control neutral word lists. We deploy the method using Google, Glove, and Reddit embeddings. Further, we utilize our approach to evaluate a debiasing technique applied to the Reddit word embedding. Our findings reveal a more complex landscape than suggested by the proponents of single-number metrics. The datasets and source code for the paper are publicly available.

Need a simple solution for managing your BibTeX entries? Explore CiteDrive!

Web-based, modern reference management
Collaborate and share with fellow researchers
Integration with Overleaf
Comprehensive BibTeX/BibLaTeX support
Save articles and websites directly from your browser
Search for new articles from a database of tens of millions of references

Try out CiteDrive

A Bayesian approach to uncertainty in word embedding bias estimation

Need a simple solution for managing your BibTeX entries? Explore CiteDrive!

More from our Archive

Extraction of intersecting palm‐vein and palmprint features for cancellable identity verification

Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning

Language Varieties of Italy: Technology Challenges and Opportunities

<scp>AmbiFC</scp>: Fact-Checking Ambiguous Claims with Evidence

What Do the Regulators Mean? A Taxonomy of Regulatory Principles for the Use of AI in Financial Services

Editor's Introduction: Best Papers from the 20th International Conference on Cognitive Modeling

Sarcasm‐based tweet‐level stress detection

Examining the importance of local and global patterns for familiarity detection in soccer action sequences

Prosocial dynamics in multiagent systems

Analysis of deep mining model for indentation data of biomaterials