登录查看更多内容

A look back at the AI project and the issue of equity in group representation "NL-Augmenter collaborative project”

FabLab by Inetum

发布日期: 2022年3月3日

Is AI socially fair??

The problematics of the fairness in groups representation is an active domain of research in Natural Language Processing (NLP), the importance of which cannot be neglected, due to the augmented reliance of human decisions on expert systems with underlying AI models. It becomes crucial in legal or business domains, as the biased predictions of the AI model might influence the gravity of the sentence given by a judge based on the ethnicity bias, aggravate gender imbalances while CV selection or negatively impact a bank credit attribution, based on age, ethnic or gender bias present in such a system. ??

How is biai introduced??

The ways the bias is introduced to an AI model vary upon stages – at the dataset construction phase, during the training phase (depending on how good the model encoded the information to the embedded space) or at the finetuning phase, where both – the finetuning dataset, as well as the model hyperparameters, influence internal model representations. If the bias was present at several stages, the bias becomes algorithmic and is hard to detect. ?

What solutions exist??

The intrinsic solutions of the bias measurement, such as PCA (Principal Component Analysis) or WEAT (Word Embedding Association Test), showed no correlation with the extrinsic methods results. The extrinsic gender bias datasets, such as Winobias, Winogender, StereoSet, CrowS-Pairs, contain the inconsistencies and are far from optimal.

Moreover, the existing solutions consider the biased datasets as a given, proposing the methods of debiasing during the pre-training phase (by hiding the information that might be source of a potential bias, such as gender or age) or the finetuning phase, where the debiasing algorithms are introduced to calculate the loss, correcting model’s latent representations learned during the training.?

Where does the NL-Augmenter?project come from??

The NL-Augmenter framework is a subproject of the global Natural Language Generation, Evaluation & Metrics (GEM) project. GEM is an international initiative, patronized by Google Research team, which brought together academic and industrial researchers, who’s focus lies in a domain of Natural Language Generation (NLG). The list of participants includes such universities as Harvard University, Stanford University, Allen institute for AI or Georgia Tech, while the list of organisations comprises and not limited to the IBM Research, Microsoft and Huggingface.?

领英推荐

What does it take to build and train a large language…

Algolia 1 年前

Top examples of some of the best large language models…

Algolia 1 年前

Introduction to iAsk AI

Blockchain Council 10 个月前

As a PhD degree holder and a researcher in the field of conditional language generation, I’ve already participated to the GEM challenge of the first phase, testing the performance of non-autoregressive conditional generative language models on proposed datasets, which resulted in a workshop participation under the umbrella of ACL 21’ conference - a key event in the field, along with Soph.I.A. Summit 21’ participation, highlighted by ActuIA magazine. The NL-Augmenter emerged during the second phase of a GEM, having enlarged the scope of the project by proposing practical tools for datasets filtering and enhancement.?

Solution proposed by Inetum?

In view of the bias problem, it was important to propose a viable solution for bias and groups inequity detection at the dataset construction phase as one of the tools NL-Augmenter exposes.

While proposing a “plug and play” quadrilingual gender bias filter, which detects gender imbalance in English, French, Polish and Russian languages according to five categories (personal pronouns, words defining the relation, titles and names), I also added a language-agnostic universal bias filter, which allows the user to define lexical seeds in a language of preference. ?

A more sophisticated bilingual (for English and French) groups inequity filter helps to discover potential discrimination issues in the text corpus. The extrinsic nature of these filters eliminates the interpretability problem, as the explicit categorisation helps in understanding of precise lexical seeds for which the dataset utterance might be flagged as biased. ?

?? https://www.actuia.com/contribution/isabelle-galy/soph-i-a-summit-des-recherches-avancees-pour-ameliorer-lia/

Their API flexibility allows bias categories extension and lexical seeds addition, to better suit user needs. This contribution is highlighted in a common scientific article with other NL-Augmenter contributors, which has been submitted to a prestigious NLP conference.

Writting by Anna Shvets, Researcher, Deep Learning Engineer

要查看或添加评论，请登录

FabLab by Inetum的更多文章

2025年2月20日

Accélération des algorithmes : Vers des modèles d'IA plus compacts et performants

L'optimisation des algorithmes d'IA constitue un levier crucial pour réduire la taille et la complexité des modèles…

1 条评论
2025年2月14日

Peut-on vraiment analyser les sentiments avec une IA ?

Tous ceux qui ont pu lire Anna Karenine, Crimes et Chatiments ou encore Madame Bovary se rappellent tout à fait les…
2025年2月11日

Elles ont osé la Science : découvrez le parcours des femmes de l’innovation !

Aujourd’hui, nous célébrons celles qui osent, qui innovent et qui brisent les idées re?ues. Trop longtemps, les…

5 条评论
2025年1月23日

L’IA pour accélérer la gestion des demandes d’aide sociale

Un défi bien connu : la lenteur administrative Nombreux sont ceux qui ont déjà exprimé leur frustration face à la…
2025年1月16日

L’Intelligence artificielle : Un levier indispensable pour anticiper les défis de demain

L’intelligence artificielle s’impose comme un levier essentiel pour les entreprises qui souhaite accélérer l’innovation…
2024年12月24日

L’IA au c?ur des processus RH : un facteur d’amélioration de la performance globale de l’entreprise

L'Intelligence Artificielle s'invite de plus en plus dans le monde de l'entreprise, bouleversant à la fois les métiers…
2024年11月21日

Partenariats de R&D : Thèses CIFRE

Success story Chez Inetum, la R&D et l’innovation sont au c?ur de notre ADN. Nous cherchons constamment à explorer de…
2024年11月12日

Quels sont les leviers d’évaluation de l'efficacité des IA frugales ?

L'évaluation de l'efficacité énergétique d'un modèle d'IA repose sur la quantification de différentes métriques clés…
2024年10月16日

Pourquoi faut il aller vers une intelligence artificielle frugale ?

L'intelligence artificielle (IA) a connu une évolution fulgurante ces dernières années, devenant accessible au grand…
2024年10月10日

Intraverse Virtual Human : L'Innovation au service de la formation professionnelle

L'ère numérique a transformé notre manière de travailler, d'apprendre et d'interagir. Aujourd'hui, nous sommes ravi de…

1 条评论

See all articles

A look back at the AI project and the issue of equity in group representation "NL-Augmenter collaborative project”

FabLab by Inetum

领英推荐

FabLab by Inetum的更多文章

社区洞察

其他会员也浏览了

AiOla’s Automated Speech Recognition and Understanding Platform is Unlocking Productivity Potential in Every Language and Every Industry Sector

The Chain of Verification: Unlocking the Power of Fact-Checking

Small Language Models (SLMs): A Game-Changer in AI Development

Understanding DeepSeek AI: Why This Language Model is Making Waves in the U.S.

The Dual Edges of AI (Artificial Intelligence) in Qualitative Research: Collaboration over Replacement

The Battle of AI Model Adaptation: RAG vs Fine-Tuning

Meet ChatGPT-4: The AI Language Model That Will Make You Forget You're Talking to a Robot

Impact of Increasing Input Size on Attention Fidelity in Modified Transformer-based Models

We need to Rethink Chain-of-Thought (CoT) prompting - AI&YOU #68

Unlocking the Potential of Large Language Models with RAG Architecture | #rag #llm #ai #data #innovation #technology #datascience

领英推荐

FabLab by Inetum的更多文章

Accélération des algorithmes : Vers des modèles d'IA plus compacts et performants

Peut-on vraiment analyser les sentiments avec une IA ?

Elles ont osé la Science : découvrez le parcours des femmes de l’innovation !

L’IA pour accélérer la gestion des demandes d’aide sociale

L’Intelligence artificielle : Un levier indispensable pour anticiper les défis de demain

L’IA au c?ur des processus RH : un facteur d’amélioration de la performance globale de l’entreprise

Partenariats de R&D : Thèses CIFRE

Quels sont les leviers d’évaluation de l'efficacité des IA frugales ?

Pourquoi faut il aller vers une intelligence artificielle frugale ?

Intraverse Virtual Human : L'Innovation au service de la formation professionnelle

社区洞察

其他会员也浏览了

AiOla’s Automated Speech Recognition and Understanding Platform is Unlocking Productivity Potential in Every Language and Every Industry Sector

The Chain of Verification: Unlocking the Power of Fact-Checking

Small Language Models (SLMs): A Game-Changer in AI Development

Understanding DeepSeek AI: Why This Language Model is Making Waves in the U.S.

The Dual Edges of AI (Artificial Intelligence) in Qualitative Research: Collaboration over Replacement

The Battle of AI Model Adaptation: RAG vs Fine-Tuning

Meet ChatGPT-4: The AI Language Model That Will Make You Forget You're Talking to a Robot

Impact of Increasing Input Size on Attention Fidelity in Modified Transformer-based Models

We need to Rethink Chain-of-Thought (CoT) prompting - AI&YOU #68

Unlocking the Potential of Large Language Models with RAG Architecture | #rag #llm #ai #data #innovation #technology #datascience