登录查看更多内容

Early adopter version of my book - explaining machine learning algorithms as a hidden function that maps x and y

Ajit Jaokar

发布日期: 2024年10月17日

Background

I shared before about the ?early adopter version of my forthcoming book - mathematical foundations of data science. If you have messaged me - I will come back to you soon.

To put the idea of the book in context

I am creating a small community for my book - mathematical foundations of data science. You get pdf when released but you also chapters as they are released and you get to engage and ask questions. The price is a one off 40 USD. If you are interested please DM me. I am trying to keep spaces limited since I want to learn from feedback so that's an important criterion as well.

The idea of the book is simple: In an age when a majority of the code could be LLM generated, its very useful to approach AI from first principles i.e. from the maths. The good news is .. there are only four things to know: linear algebra, statistics, optimization and probability theory. The bad news is: its not easy to tie these four ideas to every machine learning and deep learning algorithm considering that the field itself is rapidly evolving. In this sense, the book helps by creating a concise structure. Since these ideas are known to many people at A levels (around age 18) - the book creates a foundation to know AI based on ideas that you already know - even if you have studied them years ago!

In this post, I explain the idea of a hidden/ mapping function and how this idea can be used to understand all machine learning and deep learning algorithms.??

I have discussed this idea before? ?Mathematical foundations of Data Science: Machine learning and Deep Learning algorithms expressed as a mapping of a hidden function?

In machine learning and deep learning, a hidden function refers to a mathematical function that transforms the? input data into intermediate representations that contribute to the final output. For example, we can think of the "hidden layers" of a deep neural network as playing the role of a hidden function. We can think of the hidden function as a process that transforms inputs (like x) into outputs (like y) through some internal computation that isn't directly observable, hence the name "hidden." i deep learning, this function could involve a combination of weights, biases, and activation functions that are applied to the inputs in the hidden layers.

One of the key benefits of hidden functions is their ability to model complex and nonlinear relationships between x and y. Simple models like linear regression only capture linear relationships, but with hidden functions, neural networks can model complex, nonlinear relationships that exist in real-world data.??

Hidden functions are key to how machine learning models generalize. By learning useful representations of the data through the hidden functions, a model can make accurate predictions not just on the training data but also on unseen data.

领英推荐

The Art and Science of Algorithm Selection in Machine…

Bryce Undy 1 年前

All You Need to Know About TensorFlow

Blockchain Council 11 个月前

Complete Data Science BootCamp!

Free Online Courses With Certificates 1 年前

From a learning standpoint, we can extrapolate this idea further and view any machine learning and deep learning algorithm as being modelled by a hidden function or rule as below.

I used a combination of chatGPT and personal insights for this.?

1. Supervised Learning Algorithms

Linear Regression: The hidden rule is the linear relationship between the input features and the target variable (how a change in the input affects the output in a straight-line manner).
Logistic Regression: The hidden rule is the probability boundary between two classes, learned through the input features, which separates the outcomes into "yes" or "no."
Decision Trees: The hidden rule is the sequence of decision boundaries (questions) that best splits the data into different classes or predictions.
Random Forest: The hidden rule is the collective decision-making process of multiple decision trees, where each tree contributes to a more reliable prediction by finding its own decision boundaries.
Support Vector Machines (SVM): The hidden rule is the optimal hyperplane (or boundary) that maximizes the margin between different classes in the data.
k-Nearest Neighbors (k-NN): The hidden rule is that similar data points are close to each other. The algorithm tries to learn that nearby data points share similar outcomes.
Gradient Boosting Machines (GBM): The hidden rule is to learn how to improve from mistakes by sequentially building models that correct errors from previous ones.
XGBoost: The hidden rule is similar to GBM, but it also tries to learn how to balance accuracy and simplicity by preventing the model from becoming too complex.
AdaBoost: The hidden rule is to learn to focus more on difficult cases, improving predictions by giving more weight to examples that were previously misclassified.
Naive Bayes: The hidden rule is the probabilistic relationship between features and the target, assuming that all features are independent. It calculates how likely an outcome is based on the occurrence of features.

2. Unsupervised Learning Algorithms

K-Means Clustering: The hidden rule is to learn the groupings (clusters) of data points that are most similar to each other.
Hierarchical Clustering: The hidden rule is to learn how to group similar data points into a hierarchy, starting from smaller groups and combining them into larger clusters.
DBSCAN: The hidden rule is to identify dense regions in the data and group points that are close together, while ignoring isolated points (outliers).
Principal Component Analysis (PCA): The hidden rule is to learn the main directions of variation in the data, so that it can reduce the complexity while retaining the most important information.
t-SNE: The hidden rule is to learn how to preserve the structure of relationships between data points when visualizing high-dimensional data in a lower-dimensional space.
Autoencoders: The hidden rule is to learn a compressed representation of the data that still contains the most important features, which can be used to reconstruct the original data.?

3. Reinforcement Learning Algorithms

Q-Learning: The hidden rule is to learn the value of taking specific actions in different situations to maximize the total reward over time.
Deep Q-Networks (DQN): The hidden rule is to learn how to approximate the best actions in complex environments using a neural network to estimate future rewards.

4. Deep Learning Algorithms

Multilayer Perceptron (MLP): The hidden rule is to learn complex, nonlinear relationships between inputs and outputs by passing data through layers of neurons.
Convolutional Neural Networks (CNN): The hidden rule is to learn how to recognize patterns in the data, such as edges, shapes, or objects in images, by applying filters at different levels.
Recurrent Neural Networks (RNN): The hidden rule is to learn how to capture dependencies over time, processing sequential data by remembering information from previous inputs.
LSTM Networks: The hidden rule is to learn which parts of the sequence are important to remember or forget over time, making it effective at handling long-term dependencies in data like text or time-series.
Generative Adversarial Networks (GANs): The hidden rule is for the generator to learn how to create data that is indistinguishable from real data, while the discriminator learns to detect fake data.
Transformer Networks: The hidden rule is to learn how to pay attention to important parts of the input data, like focusing on key words or parts of a sentence in natural language processing.
BERT: The hidden rule is to learn how to understand the context of a word in a sentence by looking at both the words before and after it, improving its understanding of language.
Variational Autoencoders (VAEs): The hidden rule is to learn how to encode data into a simpler latent space and then generate new data from that space, while ensuring the generated data is similar to the original.

?If you want to be part of the early adopter version of the book as above, please DM me here.

Artificial Intelligence

115,373 位关注者

Habiba Zaman

Sales And Marketing Specialist at Amazon virtual assistant and freelancer

5 个月

Insightful

Janos Ferenc M.

Electronic and Software Engineering and QT enthusiasts

5 个月

Wow, thanks for sharing.

Saba S.

Mission led Innovation at pace for Government with expertise in AI, emerging technology, national security and resilience | Trustee | Non-Exec | Board Fellow, Royal College of Art and Design

5 个月

Informative giving a large complex subject some structure.

Farhan L.

AI| PM |LLM | Digital Transformation|Handler | PMO | Cloud | Data Governance | RCIC

5 个月

Very informative

Madhavan Ramani

Data Science, Productionise Models, Realise value through Data

5 个月

Excited for your book

1 次回应

查看更多评论

要查看或添加评论，请登录

Ajit Jaokar的更多文章

LLMs as a wood wide web - Giant Associative Memory

2025年3月24日

LLMs as a wood wide web - Giant Associative Memory

We just announced our Oxford AI summit. If you want to meet me and our team in Oxford see The Oxford Artificial…

2 条评论
Are we reskilling - deskilling or unskilling developers

2025年3月22日

Are we reskilling - deskilling or unskilling developers

This week, when I presented at the European Parliament on AI - someone asked me a question after the talk Are we…

7 条评论
Demonstrating the power of deep research at EU Parliament presentation

2025年3月21日

Demonstrating the power of deep research at EU Parliament presentation

This week, I presented a talk at the EU parliament on AI In it, I shared how the task of MEP assistants could be…

7 条评论
The evolution of the AI Risk Register- the state of the art

2025年3月17日

The evolution of the AI Risk Register- the state of the art

As I write this, Alphabet is in talks to acquire a cybersecurity firm for 30 billion USD The whole #AI and…

4 条评论
Reskilling for AI - Building Tools is itself the learning experience

2025年3月16日

Reskilling for AI - Building Tools is itself the learning experience

Background The famous starting scene from Space Odyssey 2001 where the ape throws a bone which cuts into a spaceship -…

2 条评论
Creating a prompt to demonstrate meta-cognition using Role play and Socratic reasoning

2025年3月15日

Creating a prompt to demonstrate meta-cognition using Role play and Socratic reasoning

I shared this idea with my class It's adapted from a previous idea I developed for learners on Autism spectrum Using…

2 条评论
Multi-modal AI lab in collaboration with our digital twins course at the University Of Oxford

2025年3月12日

Multi-modal AI lab in collaboration with our digital twins course at the University Of Oxford

After the success of our collaboration in #AI and #agtech - which was recently covered by both Satya Nadella and Elon…

2 条评论
The responsibility of reskilling for AI is primarily with the individual

2025年3月12日

The responsibility of reskilling for AI is primarily with the individual

In the previous post Re-skilling for AI - which jobs will AI impact is the limiting question Nicolas Escherich asked ?…

5 条评论
Re-skilling for AI - which jobs will AI impact is the limiting question

2025年3月11日

Re-skilling for AI - which jobs will AI impact is the limiting question

Background Yesterday, I posted the question - Does teaching using AI call for the Inverse Bloom’s taxonomy instead of…

5 条评论
Does teaching using AI call for the Inverse Bloom’s taxonomy instead of the traditional Bloom's taxonomy?

2025年3月10日

Does teaching using AI call for the Inverse Bloom’s taxonomy instead of the traditional Bloom's taxonomy?

Background I have been sharing ideas about creating an open syllabus to teach AI and working with teachers on this…

14 条评论

See all articles

Early adopter version of my book - explaining machine learning algorithms as a hidden function that maps x and y

Ajit Jaokar

Background

领英推荐

1. Supervised Learning Algorithms

2. Unsupervised Learning Algorithms

3. Reinforcement Learning Algorithms

4. Deep Learning Algorithms

Artificial Intelligence

115,373 位关注者

Ajit Jaokar的更多文章

社区洞察

其他会员也浏览了

How are Jacobian and Hessian matrices used in machine learning?

Machine Learning Libraries

Artificial Intelligence #5 : A taxonomy of machine learning and deep learning algorithms

Issue #203 - THE ML ENGINEER ??

Mathematical foundations of data science and AI: Conceptions and misconceptions in learning

Artificial Intelligence #64: Statistical inference: A good way to understand the mathematical foundations of machine learning

Tensors, TensorRank, and TensorFlow: Simplified!!

AI Framework for Beginners: TensorFlow

Machine Learning Libraries

The Unsung Hero of Data Science: Mathematics

Background

领英推荐

1. Supervised Learning Algorithms

2. Unsupervised Learning Algorithms

3. Reinforcement Learning Algorithms

4. Deep Learning Algorithms

Artificial Intelligence

115,373 位关注者

Ajit Jaokar的更多文章

LLMs as a wood wide web - Giant Associative Memory

Are we reskilling - deskilling or unskilling developers

Demonstrating the power of deep research at EU Parliament presentation

The evolution of the AI Risk Register- the state of the art

Reskilling for AI - Building Tools is itself the learning experience

Creating a prompt to demonstrate meta-cognition using Role play and Socratic reasoning

Multi-modal AI lab in collaboration with our digital twins course at the University Of Oxford

The responsibility of reskilling for AI is primarily with the individual

Re-skilling for AI - which jobs will AI impact is the limiting question

Does teaching using AI call for the Inverse Bloom’s taxonomy instead of the traditional Bloom's taxonomy?

社区洞察

其他会员也浏览了

How are Jacobian and Hessian matrices used in machine learning?

Machine Learning Libraries

Artificial Intelligence #5 : A taxonomy of machine learning and deep learning algorithms

Issue #203 - THE ML ENGINEER ??

Mathematical foundations of data science and AI: Conceptions and misconceptions in learning

Artificial Intelligence #64: Statistical inference: A good way to understand the mathematical foundations of machine learning

Tensors, TensorRank, and TensorFlow: Simplified!!

AI Framework for Beginners: TensorFlow

Machine Learning Libraries

The Unsung Hero of Data Science: Mathematics