登录查看更多内容

Integrated Gradients — Interpreting the LLM decision making process

Ankush Seth

CTO @ Mi Analyst | Helping businesses accelerate growth and efficiency with Gen AI

发布日期: 2023年10月11日

Large Language Models have attracted a lot of attention in recent times. Through the likes of ChatGPT these models have started to play a big role in the areas of content generation, customer support channels such as chat bots, content localization and even content curation. As the use cases and adoption increases there is a need to ensure accuracy, transparency and validation against bias. Therefore, understanding the behind the scenes decision making process is important in addressing these concerns.

It is important to understand that language models like GPT-3.5, do not operate using traditional decision trees that you can trace in the same way as traditional classifiers. Instead, Large Language Models (LLMs) use deep learning techniques, specifically transformers, which are neural network architectures designed for sequence-to-sequence tasks, making their decision-making processes harder to decipher.

However, one can leverage various interpretation techniques (integrated gradients, attention maps, input-output pairs, etc.) to gain insight into the decision making process. In this post we will focus on Integrated gradients as an interpretation technique.

Gradient-based techniques help highlight which words or tokens in an input sequence have significant impact on the model’s output. One can compute gradients with respect to the input to identify influential words or phrases.

The formula for Integrated Gradients looks like this -

领英推荐

GPT-4: How ChatGPT's upgrade will revolutionise your…

Airswift 2 年前

Understanding & Building LLM Applications!

Pavan Belagatti 10 个月前

Natural Language Processing is Now in Every Retailer's…

Mu Sigma Inc. 1 年前

We won’t get into math in this post but instead translate how this is applied in the deep learning context from a high level perspective. There are four key steps one typically follows to apply Integrated Gradients to a deep learning model :

Identify a baseline input (e.g., an all-zero vector or another meaningful baseline). For text based tasks a zero vector is a good choice as it is assumed the absence of any input feature has no impact on the model’s output.
Compute the gradients of the model’s output with respect to each input feature at each step along the path from the baseline to the actual input.
Integrate these gradients using numerical methods to obtain the attribution scores.
The resulting attribution scores indicate the importance of each input feature in producing the model’s output.

Using the above approach one can gain insight into the model’s interpretation process for a specific use-case. Furthermore, one can build a confidence heat map by applying this technique over a wide variety of test input data. As mentioned earlier this is just one interpretation technique. Depending upon the questions one is trying to address, some of the other techniques might need to be leveraged as well.

要查看或添加评论，请登录

Ankush Seth的更多文章

The Invisible Threat: How Prompt Injection and Leakage Undermine Security in LLM Applications

2024年2月28日

The Invisible Threat: How Prompt Injection and Leakage Undermine Security in LLM Applications

Large Language Models (LLMs) and their capabilities have been in the limelight in recent times. Specifically…

1 条评论
Gemini Vs GPT-4 - Battle of the titans

2023年12月11日

Gemini Vs GPT-4 - Battle of the titans

GPT-4 has been the State of the Art (SOTA) model in recent times for a number of generative AI use-cases but now we…

2 条评论
Understanding Back Propagation in human terms

2023年9月27日

Understanding Back Propagation in human terms

Deep learning neural networks and their fundamental building block, the perceptron, serve as a mathematical model…
Building a Machine Learning Pipeline – Deployment

2019年9月23日

Building a Machine Learning Pipeline – Deployment

Welcome Back! Hope you enjoyed the previous two articles on building a machine learning pipeline (Part 1, Part 2 for…
Building a Machine Learning Pipeline – Modeling

2019年9月4日

Building a Machine Learning Pipeline – Modeling

Welcome back everyone. Let’s dive into the Modeling aspect of the machine learning workflow.
Building a Machine Learning Pipeline – Exploration and Data Processing

2019年8月27日

Building a Machine Learning Pipeline – Exploration and Data Processing

In this three-part blog series, we are going to explore how to build a machine learning pipeline (defined below). Each…

See all articles

Integrated Gradients — Interpreting the LLM decision making process

Ankush Seth

CTO @ Mi Analyst | Helping businesses accelerate growth and efficiency with Gen AI

领英推荐

Ankush Seth的更多文章

社区洞察

其他会员也浏览了

ChatGPT vs Gemini; Uncertainty Quantification in GenAI; GPT-4 vs. GPT-4V vs. Humans On Abstraction and Reasoning; Private vs Public LLMs; and More.

FINE-TUNING LARGE LANGUAGE MODELS (LLMS) IN 2024

Artificial Intelligence: A Double-Edged Sword in the World of Information and Misinformation

10 Mind-Blowing Things You Didn't Know GPT-4 Could Do

A Practical introduction to Large Language Models (LLMs)

Customizing and optimizing methods for Large Language Models (LLMs)

DeepSeek vs. OpenAI: The AI Disruptor Challenging Global Tech Dominance

Everything You Need to Know About Large Language Models

LMMs vs LLMs: Understanding the Differences

LLM Models

领英推荐

Ankush Seth的更多文章

The Invisible Threat: How Prompt Injection and Leakage Undermine Security in LLM Applications

Gemini Vs GPT-4 - Battle of the titans

Understanding Back Propagation in human terms

Building a Machine Learning Pipeline – Deployment

Building a Machine Learning Pipeline – Modeling

Building a Machine Learning Pipeline – Exploration and Data Processing

社区洞察

其他会员也浏览了

ChatGPT vs Gemini; Uncertainty Quantification in GenAI; GPT-4 vs. GPT-4V vs. Humans On Abstraction and Reasoning; Private vs Public LLMs; and More.

FINE-TUNING LARGE LANGUAGE MODELS (LLMS) IN 2024

Artificial Intelligence: A Double-Edged Sword in the World of Information and Misinformation

10 Mind-Blowing Things You Didn't Know GPT-4 Could Do

A Practical introduction to Large Language Models (LLMs)

Customizing and optimizing methods for Large Language Models (LLMs)

DeepSeek vs. OpenAI: The AI Disruptor Challenging Global Tech Dominance

Everything You Need to Know About Large Language Models

LMMs vs LLMs: Understanding the Differences

LLM Models