登录查看更多内容

#2 Coding Multilayer Perceptrons

Riya Chhikara

Data Scientist at The Economist | Guest Teacher at LSE

发布日期: 2024年3月11日

There are many problems in real life which are easy to solve. You have two choices. In any new decision, you either go with Choice 1, or Choice 2. In machine learning, if we can divide any two classes with a straight line, this means it is linearly separable. If not, they are not linearly separable.

In deep learning, things get complicated. Now we want to predict something where we have an 'N' number of inputs. 'N' can be as large as possible. And after all the calculations, and training, we want to generate a 'Z' number of outputs. They can also be as large as possible.

The most famous example is the handwritten number dataset by MNIST. Here, the inputs are digits from 0 to 9. The output tells us probabilities of what digit it is. For example, there is a 0.3 chance that the number is 3, a 0.5 chance it is an 8, 0.2 chance it is 6. All these add up to 1.

In artificial neural networks, a multilayer perceptron (MLP) has three layers: input, hidden, and output.

Input:

The first box, where we put our information, can take different kinds of numbers depending on what we're working on. It could be anything from numbers representing colours in pictures to numbers showing how much someone likes something.

Desired Output:

The last box, where we get our answer from, gives us numbers that match what we want. For example, if we're trying to guess if a picture has a cat or a dog, it might give us numbers showing how likely it is for each animal.

Hidden Layers:

The middle box, the "thinking" box, can have more than one layer. Think of it as having more helpers inside. The more layers, the more helpers, and the better we can understand complicated things.

Feedforward Process:

When we put our information into the first box, it gets passed through the chain. Each box takes what the previous one gave and does something with it.
In the "thinking" box, each helper adds up the numbers it gets, kind of like mixing ingredients in a recipe. Then, it passes the total through a special filter that decides how much of it should go through.
This process happens from the first box to the last, and in the end, we get an answer from the last box.

领英推荐

Reviews of Papers on Geometric Learning - 2024

Patrick Nicolas 1 个月前

Differentiable Manifolds

Patrick Nicolas 11 个月前

OpenAI’s New o1 Model: A Leap Forward in AI Reasoning…

Javier Gil 6 个月前

Project

Using Keras (the Tensorflow high-level API), a standard model workflow looks like this:

I used the California dataset to predict house values:

Exploring the dataset: When plotted the Latitude, and Longitude, this is how they relate to the house prices. As expected, Los Angeles and San Franciso are hubs where prices are highest.

Model building: Divided the dataset into train and test. Built a model with 3 layers. You can see the clips of the code here:

This is the final interpretation of the model:

Github Repo: MLP for California Housing Price Regression problem

Sources

Video on Neural Networks: MLP
Dive into Deep Learning
Scikit Learn Documentation
TensorFlow Documentation

100 Days of Computer Vision

838 位关注者

要查看或添加评论，请登录

Riya Chhikara的更多文章

#57 Vintage Watch Finder: AI in Luxury Watch Shopping

2024年10月21日

#57 Vintage Watch Finder: AI in Luxury Watch Shopping

Got a cool idea ! We have Google Lens where you can upload images to search for the items. I want to build a…
#56 Connecting the app to AWS S3 bucket

2024年9月22日

#56 Connecting the app to AWS S3 bucket

Now that QualScan works well, and we have integrated Postgres tables into the workflow, we have one more thing left to…
#55: How to build a solid backend for a scalable app?

2024年9月22日

#55: How to build a solid backend for a scalable app?

Now that we have a functional app with a decent interface, we can focus on the backend database storage. I used…
#54: How to integrate alert system into a machine vision app ?

2024年9月20日

#54: How to integrate alert system into a machine vision app ?

This will be a tutorial with code snippets. So, if you are building/ planning to build your app in Python, and want to…
# 53 The app now tracks defects in real-time

2024年9月19日

# 53 The app now tracks defects in real-time

What do real time quality dashboards 'really look' like? I found some results on Google which seemed pretty…
#52: Looks better than yesterday

2024年9月18日

#52: Looks better than yesterday

Today, I made some functional changes. Looks better, and fixed the slider issue.
#51: And the winner for the final model is VGG16

2024年9月17日

#51: And the winner for the final model is VGG16

Quick Recap: Yesterday we created an app that took product images as inputs and predicted the % of defects in it. The…

2 条评论
#50: Machine Vision for checking defects

2024年9月16日

#50: Machine Vision for checking defects

BACK AT IT ! Well, today I read about machine vision used in manufacturing setups. We know that humans can inspect only…
#49: Product Design for Smarter iPhone Search

2024年6月22日

#49: Product Design for Smarter iPhone Search

In the previous article, I mentioned 5 main improvements to be made in the iPhone photo Search. Today, I design…
#48 Tech Review on iPhone's Image Search

2024年6月22日

#48 Tech Review on iPhone's Image Search

As a phone user, I found a pain point in accessing photos from my gallery. Today, I study all the features that Apple…

See all articles

#2 Coding Multilayer Perceptrons

Riya Chhikara

Data Scientist at The Economist | Guest Teacher at LSE

领英推荐

Project

100 Days of Computer Vision

838 位关注者

Riya Chhikara的更多文章

社区洞察

其他会员也浏览了

OpenAI's o1 Model: Einstein in a Box - A Breakthrough in AI Reasoning

Demystifying artificial intelligence: the human behind the machine, or what does an AI scientist really do?

? #ICML2024 accepted! CARTE: Pretraining and Transfer for Tabular Learning

DeepSeek Progression

Learning to distill ML models

From Deepfake Technology to Genetic Programming

Day 3 of the Machine Learning: Teach by Doing Project

Use of Machine Learning in Model Training

Kornia (PyTorch)

Linear Algebra: Vectors, Matrices in Deep Learning : Part II

领英推荐

Project

100 Days of Computer Vision

838 位关注者

Riya Chhikara的更多文章

#57 Vintage Watch Finder: AI in Luxury Watch Shopping

#56 Connecting the app to AWS S3 bucket

#55: How to build a solid backend for a scalable app?

#54: How to integrate alert system into a machine vision app ?

# 53 The app now tracks defects in real-time

#52: Looks better than yesterday

#51: And the winner for the final model is VGG16

#50: Machine Vision for checking defects

#49: Product Design for Smarter iPhone Search

#48 Tech Review on iPhone's Image Search

社区洞察

其他会员也浏览了

OpenAI's o1 Model: Einstein in a Box - A Breakthrough in AI Reasoning

Demystifying artificial intelligence: the human behind the machine, or what does an AI scientist really do?

? #ICML2024 accepted! CARTE: Pretraining and Transfer for Tabular Learning

DeepSeek Progression

Learning to distill ML models

From Deepfake Technology to Genetic Programming

Day 3 of the Machine Learning: Teach by Doing Project

Use of Machine Learning in Model Training

Kornia (PyTorch)

Linear Algebra: Vectors, Matrices in Deep Learning : Part II