登录查看更多内容

A Bite-Sized Guide to Neural Networks: Unraveling the Magic Sandwich Analogy

Anurag Pola

SE-II @ JPMorganChase | Python, React, AWS and Flutter Developer | Discovering Knowledge Graphs, AI/ML, Pega, Data Analytics and more...

发布日期: 2023年8月20日

+ 关注

Neural Networks

Deep Learning → ANN (Artificial Neural Networks) (Also known as cybernetics, or connectionism)
Modelled after the way our brain works → Biological Neural Networks → Original and far more complex
ANNs were invented when trying to theorise how the brain works.
ANN == Imitation brains
Bunch of simple chunks of software, each able to perform very simple math.
Each chunk is called "cell" or "neuron"
The power of a neural network lies in how these cells are connected.
Most common ANNs we create have as many neurons as a worm

Unlike a human, the neural net is at least able to devote its entire one-worm-power brain to the task at hand (if not distracted with extraneous data). But how can we solve problems with a bunch of interconnected cells?
The most powerful neural networks have neurons more than a single honeybee (these take months and tens of thousands of dollars to train)
ANNs might be able to approach the number of neurons in the human brain by around 2050. Does this mean AI has intelligence of human then? → Not even close
Human brain neurons are so complex that each human neuron is more like a complete many-layered neural network all by itself. So rather than being a neural network made of 86 billion neurons, human brain is a neural network made of 86 billion neural networks.
Our brain has far more complexities than ANNs, including many we don’t fully understand yet.

The Magic Sandwich Hole

Assume a magic hole that produces a random sandwich every few seconds
Sandwiches are very, very random and we have to sort them
We will try to automate the job
We are building a neural network to look at each of the sandwich and decide whether it’s good. Ignore how it recognizes the ingredients and hoe it picks up each sandwich.
Also if its not consumable it throws the sandwich into the recycling chute

Now, we have a bunch of inputs → single output
A simple black box of the algorithm would look like this

The overall expectation is something like this

Let’s look into the black box now
First a simple way of doing it would be → each ingredient getting a different weight → good ones get 1 and the ones we want to avoid will get 0. The neural network would look like this

Mud and eggshells will give 0+0 = 0

Peanut-butter-and-marshmellow 1+1=2 ? flutternutter!!!!

Simple one-layer neural network is not sophisticated enough to recognize that some ingredients, while delicious on their own, are not delicious in combination with certain others. So it will be susceptible to something we’ll call the big sandwich bug: a sandwich that contains mulch might still be rated as tasty if it contains enough good ingredients to cancel out the mulch.
To get a better neural network, we’re going to add another layer

The new layer is called a hidden layer, because the user only sees the inputs and outputs. This isn’t deep learning yet, it would require more layers, but we are getting there.

领英推荐

Backpropagation in Artificial Neural Networks

Doug Rose 3 周前

Understanding Backpropagation in Neural Networks: A…

Doug Rose 9 个月前

The History of Neural Networks: Unveiling the Legacy…

Doug Rose 8 个月前

The new cell, let’s call it punisher. We will give this a huge negative weight and connect it with everything bad.

Deli Sandwich cell (for Chicken-and-cheese-type sandwiches bcz they are gooood)- we will add a modest weight of 1. but if we get too excited and assign it a very hight weight, we’ll be in danger of making the punisher less powerful.

Now adding marshmallow to the Deli Sandwich would make ie less tasty, we’ll need other cells that specifically look for and punish incompatibilities.
Let’s call it cluckerfluffer

Activation Function - without it, the cell would punish all the sandwiches that contain chicken or marshmallow. So to avoid that, we give it a threshold → Here 15 could be used as individually both have only 10 and when combined they are 20 ? Boom!. The activated cell will punish any combination of ingredients that exceeds the threshold.
With all the cells connected in similar sophisticated configurations, we have a neural net that can sort sandwiches.

The Training Process

The basic point of machine learning is that we don’t have to set up the neural network by hand.
As the neural net rates each sandwich, it needs to compare its ratings against those of a panel of cooperative sandwich judges. Note: never volunteer to test the early stages of a machine learning algorithm.
We will start the previous one from scratch, with random values

It hates cheese, loves marshmallow, rather fond of mud and doesn’t really care of eggshells.
Now our neural net has a chance to improve. From this one sandwich, it doesn’t know what the problem is. But if it looks at a batch of ten sandwiches, it can discover that if it had in general given mud a lower weight → it will match human judges a bit better.

After thousands more iterations and tens of thousands of sandwiches, the human judges are very, very sick of this, but the neural network is doing a lot better.

The neural net, as we saw before, needs more sophisticated structure with hidden layers to make accurate predictions.

Pitfall : Class Imbalance → Only a handful of every thousand sandwiches are delicious. Rather than go through all the trouble of figuring out how to weight each ingredient, the neural net may realize it can achieve 99.9% accuracy by all sandwiches as terrible, no matter what.

To combat class imbalance, pre-filter the sandwiches to have approximately equal proportions - delicious and awful. Even then, the neural net might not learn about ingredients that are usually to be avoided but delicious in very specific circumstances. Marshmallow is a great example, if it sees it very rarely, it may decide to reject anything that contains marshmallow.
Class imbalance related problems show up all the time in practical applications, usually when we ask AI to detect a rare event. Ex: when will someone leave from a company, detecting fraudulent logins, medical imaging, interesting celestial events like detecting a solar flare, etc. All because of not enough data for those rare events.

要查看或添加评论，请登录

Anurag Pola的更多文章

The ABCs of BERTopic: A Beginner's Guide

2023年12月30日

The ABCs of BERTopic: A Beginner's Guide

Pre-Context : What is Topic Modeling?? Topic modeling is a natural language processing (NLP) technique used to identify…
To AI, or to Not AI: When not to use AI to solve a problem?

2023年9月9日

To AI, or to Not AI: When not to use AI to solve a problem?

Worrying about an AI takeover is like worrying about overcrowding on Mars - ML Researcher Andrew Ng Today’s AI is not…
Bits of Evolution, Bytes of AI: Merging Worlds with Evolutionary Algorithms

2023年8月11日

Bits of Evolution, Bytes of AI: Merging Worlds with Evolutionary Algorithms

A little bit basics on AI ?? AI refines by making a guess Simplest method is to travel in the direction of improvement…

2 条评论
Python Behave 101: Where BDD Meets Pythonic Wizardry

2023年7月9日

Python Behave 101: Where BDD Meets Pythonic Wizardry

Behavior-Driven Development (BDD) ?? It is a software development methodology that emphasizes collaboration and…

2 条评论
Navigating the Information Age: The "Second Brain" as Your Mental GPS

2023年7月2日

Navigating the Information Age: The "Second Brain" as Your Mental GPS

Our brain is meant for having ideas, not holding them. - David Allen What is a Second Brain? ?? In today's busy world…

2 条评论
Knock, Knock, AI: How Artificial Intelligence Learns to Write Jokes

2023年5月6日

Knock, Knock, AI: How Artificial Intelligence Learns to Write Jokes

Imagine you are trying to build a program that can write "Knock-Knock" jokes. Now there are two ways you can do this…
Looker Studio: The Kaleidoscope of Analytics

2023年4月29日

Looker Studio: The Kaleidoscope of Analytics

What is Looker Studio? ?? It is a business intelligence and data analytics platform. It allows users to create…
Brainstorming's Bizarre Cousin: Introducing Reverse Brainstorming

2023年4月15日

Brainstorming's Bizarre Cousin: Introducing Reverse Brainstorming

What? ?? When it comes to problem-solving and idea generation, brainstorming is a popular technique that encourages…

2 条评论
Visual Alphabet: The Essential Design Elements

2023年4月1日

Visual Alphabet: The Essential Design Elements

Things are built in a specific way so that people can get an appealing effect towards it. But how to build such things…
Why Being an Amateur Matters : Celebrating the Joy of Learning for Its Own Sake

2023年3月25日

Why Being an Amateur Matters : Celebrating the Joy of Learning for Its Own Sake

“That’s all any of us are: amateurs. We don’t live long enough to be anything else.

See all articles

A Bite-Sized Guide to Neural Networks: Unraveling the Magic Sandwich Analogy

Anurag Pola

SE-II @ JPMorganChase | Python, React, AWS and Flutter Developer | Discovering Knowledge Graphs, AI/ML, Pega, Data Analytics and more...

Neural Networks

The Magic Sandwich Hole

领英推荐

The Training Process

Anurag Pola的更多文章

社区洞察

其他会员也浏览了

What Are Artificial Neural Networks - A Super-Simple Explanation For Anyone

7 Applications of Convolutional Neural Networks

A Guide into Activation Functions in Neural Networks

Mechanistic Interpretability: Illuminating the Black Box of Neural Networks

Convolutional Neural Networks: A Comprehensive Guide Exploring the power of CNNs in image analysis

Neural Networks, Brain, and AI

Understanding Back Propagation in Neural Networks

Artificial Neural Networks & Adversarial Attacks

Softmax: A Comprehensive Guide

Neural Networks

The Magic Sandwich Hole

领英推荐

The Training Process

Anurag Pola的更多文章

The ABCs of BERTopic: A Beginner's Guide

To AI, or to Not AI: When not to use AI to solve a problem?

Bits of Evolution, Bytes of AI: Merging Worlds with Evolutionary Algorithms

Python Behave 101: Where BDD Meets Pythonic Wizardry

Navigating the Information Age: The "Second Brain" as Your Mental GPS

Knock, Knock, AI: How Artificial Intelligence Learns to Write Jokes

Looker Studio: The Kaleidoscope of Analytics

Brainstorming's Bizarre Cousin: Introducing Reverse Brainstorming

Visual Alphabet: The Essential Design Elements

Why Being an Amateur Matters : Celebrating the Joy of Learning for Its Own Sake

社区洞察

其他会员也浏览了

What Are Artificial Neural Networks - A Super-Simple Explanation For Anyone

7 Applications of Convolutional Neural Networks

A Guide into Activation Functions in Neural Networks

Mechanistic Interpretability: Illuminating the Black Box of Neural Networks

Convolutional Neural Networks: A Comprehensive Guide Exploring the power of CNNs in image analysis

Neural Networks, Brain, and AI

Understanding Back Propagation in Neural Networks

Artificial Neural Networks & Adversarial Attacks

Softmax: A Comprehensive Guide