登录查看更多内容

Discriminative vs Generative model

Ibrahim Sobh - PhD

?? Senior Expert of Artificial Intelligence, Valeo Group | LinkedIn Top Voice | Machine Learning | Deep Learning | Data Science | Computer Vision | NLP | Developer | Researcher | Lecturer

发布日期: 2018年6月23日

+ 关注

Broadly speaking, there are two main approaches to understand (model) the world and making decisions.

Discriminative Models: Directly maps features to class labels
Generative models: Models the distribution of features for each class.

Discriminative Models:

Consider a simple classification problem, where we need to distinguish between dogs and cats. Given the training data, we try to find the decision boundary that separates the two classes. At testing time, we check on which side the new input falls to decide whether it is a dog or cat.

The algorithm is trying to learn p(y|x) directly, to map from inputs to labels.

where y is the class label (dogs or cats) and x represents features.

Example: Logistic regression model

Generative Models:

Here we try to build a model of what dogs look like, and another model of what cats look like. At test time, we check whether the new sample looks more like cats or more like dogs.

The algorithm is trying to model p(x|y) and p(y) to understand the data

where p(x|y) models the distribution of features belonging of a certain class. And p(y) is called the class prior, the probability of the class.

Having both p(x|y) and p(y) modeled, Bayes Rule can be used to derive the posterior distribution

p(y|x) = p(x|y)p(y)/p(x)

To make predictions, we find the y that maximizes the term p(y|x), which class gives larger probability given the features, and in this case we can get rid of the denominator

argmax_y p(y|x) = argmax_y p(x|y) p(y)

Example: Na?ve Bayesian Model

Which one to use?

The answer is it depends! Here we discuss the difference between Logistic regression model and the Gaussian Discriminant Analysis model (GDA). GDA is a generative learning algorithm that assumes p(x|y) is distributed according to multivariate normal distribution. This assumption is valid in many cases. In this way, GDA makes strong modeling assumptions and usually works very well under theses assumptions. On the other hand, logistic regression is a discriminative algorithm used to model p(y|x) directly. Logistic regression makes weaker assumptions and hence more robust when data is not Gaussian.

Regards.

要查看或添加评论，请登录

Ibrahim Sobh - PhD的更多文章

The Evolution and Applications of Attention Mechanisms in Deep Learning: A Comprehensive Survey

2025年3月1日

The Evolution and Applications of Attention Mechanisms in Deep Learning: A Comprehensive Survey

Article created by Perplexity Deep Research. Prompt: "You are a deep-learning experienced researcher.

1 条评论
The Judicial Cognitive Process: From Case Inception to Judgment and the Promise of AI Augmentation

2025年3月1日

The Judicial Cognitive Process: From Case Inception to Judgment and the Promise of AI Augmentation

Research Report Created by Perplexity Deep Research My Research Question : "Now I want to dig deeper in the human judge…

3 条评论
How to Learn Artificial Intelligence: A Beginner’s Guide

2024年5月31日

How to Learn Artificial Intelligence: A Beginner’s Guide

Artificial Intelligence (AI) is a fascinating field that simulates human intelligence and task performance using…
[????????????] ?????????????????? ???????????? explained with code ??

2023年1月28日

[????????????] ?????????????????? ???????????? explained with code ??

"During the last two years there has been a plethora of large generative models such as ChatGPT or Stable Diffusion…

2 条评论
A conversation with ChatGPT about AI, study roadmap, applications, interview questions with answers, salaries, and more!

2023年1月21日

A conversation with ChatGPT about AI, study roadmap, applications, interview questions with answers, salaries, and more!

Hello everyone, and thank you all for being here today! Let me introduce our new star, the ChatGPT, who will discuss…
10 Object detectors with code [YOLOF, YOLOX, DETR, Deformable DETR, SparseR-CNN, VarifocalNet, PAA, SABL, ATSS, Double Heads]

2022年2月17日

10 Object detectors with code [YOLOF, YOLOX, DETR, Deformable DETR, SparseR-CNN, VarifocalNet, PAA, SABL, ATSS, Double Heads]

In this article, 10 well-known pre-trained object detectors are loaded and used in a standard and easy way. YOLOF: You…

6 条评论
FNet: Do we need the attention layer at all? [Explained with code]

2021年10月30日

FNet: Do we need the attention layer at all? [Explained with code]

FNet: Mixing Tokens with Fourier Transforms "In this work, we investigate whether simpler token mixing mechanisms can…
Patches Are All You Need! [with code]

2021年10月28日

Patches Are All You Need! [with code]

"It is only a matter of time before Transformers become the dominant architecture for vision domains, just as they have…
MLP is all you need! [with code]

2021年10月23日

MLP is all you need! [with code]

From Google: MLP-Mixer: An all-MLP Architecture for Vision Main idea: "While convolutions and attention are both…

2 条评论
9 Steps for solving any machine learning problem

2021年8月28日

9 Steps for solving any machine learning problem

In this article, we will present a universal blueprint that we can use to attack and solve any machine-learning…

2 条评论

See all articles

Discriminative vs Generative model

Ibrahim Sobh - PhD

?? Senior Expert of Artificial Intelligence, Valeo Group | LinkedIn Top Voice | Machine Learning | Deep Learning | Data Science | Computer Vision | NLP | Developer | Researcher | Lecturer

Ibrahim Sobh - PhD的更多文章

社区洞察

其他会员也浏览了

What Is Polynomial Regression in Machine Learning?

Effective XGBoost by Matt Harrison

Dinner with Data Buddies: Demystifying the ROC Curve

Support Vector Machines (SVM) in Plain English

XGBoost

Understanding Logistic Regression in Machine Learning: Sigmoid Function, Log-Likelihood Estimation, Class Imbalance Adjustment, and More

Different Loss Functions

From Data to Deployment: A Casual Guide to the Machine Learning Process

Support Vector Machine (SVM)

Logistics Regression using Gradient Descent

Ibrahim Sobh - PhD的更多文章

The Evolution and Applications of Attention Mechanisms in Deep Learning: A Comprehensive Survey

The Judicial Cognitive Process: From Case Inception to Judgment and the Promise of AI Augmentation

How to Learn Artificial Intelligence: A Beginner’s Guide

[????????????] ?????????????????? ???????????? explained with code ??

A conversation with ChatGPT about AI, study roadmap, applications, interview questions with answers, salaries, and more!

10 Object detectors with code [YOLOF, YOLOX, DETR, Deformable DETR, SparseR-CNN, VarifocalNet, PAA, SABL, ATSS, Double Heads]

FNet: Do we need the attention layer at all? [Explained with code]

Patches Are All You Need! [with code]

MLP is all you need! [with code]

9 Steps for solving any machine learning problem

社区洞察

其他会员也浏览了

What Is Polynomial Regression in Machine Learning?

Effective XGBoost by Matt Harrison

Dinner with Data Buddies: Demystifying the ROC Curve

Support Vector Machines (SVM) in Plain English

XGBoost

Understanding Logistic Regression in Machine Learning: Sigmoid Function, Log-Likelihood Estimation, Class Imbalance Adjustment, and More

Different Loss Functions

From Data to Deployment: A Casual Guide to the Machine Learning Process

Support Vector Machine (SVM)

Logistics Regression using Gradient Descent