Bias Mitigation

Self-Debiasing Large Language Models: Zero-Shot Recognition and Reduction of Stereotypes (Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL) 2025)

A Multi-LLM Debiasing Framework (arXiv 2024)

Towards interpreting and mitigating shortcut learning behavior of NLU models (proceedings of the north american chapter of the association for computational linguistics 2021)