登录查看更多内容

Understanding the fashion and chronology of algorithms

Ajit Jaokar

发布日期: 2024年7月3日

We don't think of algorithms in terms of fashionable trends .. but in some cases, its helpful to do so to gain a better understanding of how they are used in terms of solving problems.

The purpose of an algorithm is to solve a problem. In our case, for data science, the problem is data driven. The choice of the tool(algorithm) is made in terms of the problem at hand, the resources, constraints and also fashion(trend) - much on the lines of Maslow's quote "If the only tool you have is a hammer, you tend to see every problem as a nail.”

Over the last 50 years, machine learning and statistical algorithms have evolved significantly.

In the pre-1950s , we started with statistical methods. In the 1960s, we saw the introduction of Bayesian approaches for probabilistic inference to complement the frequentist approaches. However, from the standpoint of machine learning as we have seen before, both these approaches are statistical (because the underlying data distribution should be knowable).

The 1970s experienced an AI winter due to pessimism about machine learning effectiveness.

Slowly in the 1990s, we saw a shift to data driven approaches, with the popularity of support-vector machines (SVMs). SVMs helped to solve non linear problems through the kernel trick.

The 2000s saw increased use of kernel methods and unsupervised machine learning. The 2010s were defined by the feasibility of deep learning, making machine learning integral to many services and applications.

Finally, the 2020s have been characterized by generative AI, leading to revolutionary models like ChatGPT and Stable Diffusion, significantly impacting public consciousness and the commercial potential of AI.

Depending on who you ask, these perspectives may differ in emphasis, but the overall trend is accurate. Today, we are going towards more complex, non linear data driven problems which are driven by rich media. This is taking us away from traditional statistical roots because increasingly, we find that the distribution is unknowable in these problem statements.

Today, people tend to want to apply deep learning because its in fashion, Two decades ago, that maybe SVM .. two more decades ago - the dominant algorithm could be decision trees or logistic regression.

Understanding the dominant algorithm in an era helps us to decouple the trend effect

So, here are the main eras and their algorithms

1950s - 1960s

Linear Regression: Statistical modeling technique for predicting a linear relationship between variables.

1970s

Decision Trees: Introduced as a machine learning algorithm for decision-making.

Nearest Neighbors: The k-nearest neighbors algorithm for pattern recognition.

Generalized Linear Models (GLMs) by John Nelder and Robert Wedderburn in 1972, extending linear regression model to accommodate response variables that have error distribution models other than a normal distribution,. GLMs unify various types of regression models, including: Linear Regression for normally distributed responses; Logistic Regression for binary responses; Poisson Regression for count data.

领英推荐

Working on the MACHINE INTELLIGENCE...

Suraj Gupta 5 年前

Look Inside Your Box

Andrew Silver 7 年前

Adversarial Examples Are Not Bugs

Christopher D. 5 年前

1980s - 1990s

Support Vector Machines (SVM): Developed for classification and regression by Vapnik and Cortes.

Random Forest: Introduced as an ensemble learning method.

Naive Bayes: Probabilistic classifier based on Bayes' theorem.

Efron's Bootstrap Method: Statistical technique for estimating the sampling distribution of a statistic.

Hastie and Tibshirani's work on Generalized Additive Models (GAM): An extension of linear models to include non-linear components.

2000s - 2010s

Kernel Methods (e.g., Gaussian Processes): Non-parametric models for regression and classification.

Deep Learning Resurgence: Breakthroughs in training deep neural networks, leading to the popularity of architectures like Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs).

2010s - Present

XGBoost (Extreme Gradient Boosting): Powerful gradient boosting algorithm for classification and regression.

Deep Learning Dominance: Advancements in architectures (e.g., Transformer models), pre-training, and transfer learning.

For example, adaboost was introduced in 1990s. It laid the foundations of other boosting algorithms like xgboost, catboost, GBM(Gradient boosting machines) , lightGBM etc.

Sometimes, we see all these grouped together - and missing the chronology (and the prevailing trend) causes you to lose context.

In other words, you would (likely) not use adaboost in 2024, but would need to understand adaboost as a foundation to xgboost (which is likely what you would use)

image source: https://pixabay.com/photos/catwalk-models-women-fashion-1840941/

Artificial Intelligence

112,214 位关注者

Mark Stouse

3 个月

Yet the reality here is obscured by this chronology, and that is that 1. the quest for causal understanding remains the dominant question, and 2. MVR in its various forms remains the most practical collection of algorithms on a daily basis, particularly given the fact that the Lean Data requirements for causal models fit more business realities than Big Data / ML. The most recent research at Stanford into the Big Data supply chain shows that the ability to “feed the beast” after early 2026 will depend almost entirely on synthetic data, accelerating ML’s tendency to regress to a mean. Riffing on the fashion metaphor, we may be going Back to the Future here.

Clyde Johnson

CEO and Founder In2netCISO

3 个月

Thanks for placing these algorithms in historical context as it makes you really appreciate the evolution of the problem/solution space (anyway at least for me)

2 次回应

Elton Brown

Advisory Committee Member @ SCLAA | Driving AI-powered supply chain solutions

3 个月

Algorithms fascinated me as a kid. From the Euclidean Algorithm in Greece to Newton's Method and Fourier Transforms, it is quite amazing how humans can concieve of these 'technologies' to help make better decisions faster. The rate of change in the last 70 years has been thousands of times faster than in the preceeding 2,200 years. It is mind boggling to think of how fast change could become over the next 70 years. Thankyou Ajit Jaokar for your great article.

2 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

Understanding the fashion and chronology of algorithms

Ajit Jaokar

1950s - 1960s

1970s

领英推荐

1980s - 1990s

2000s - 2010s

2010s - Present

Artificial Intelligence

112,214 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

IS DATA SCIENCE/MACHINE LEARNING/AI TRULY OVERHYPED?

New Evals for Better Models, AI Research Papers Made Easier to Understand, Train Your Own Flux LoRA, and More

Beyond RAG: AI Search Using Graph DBs

AI vs. AI - Equilibrium: An hour of need

Deep Thought vs. Deep Reflection (R.O.L.A.N.D.)

Interpreting Patterns in Multi-Variate Multi-Horizon Time-Series Forecasts from Google's Temporal Fusion Transformer Model

Solving Complex Problems with AI

Will the real ML problem please stand up

Key Differences Between Data Science and Artificial Intelligence?

Unveiling the Future: A Deep Dive into the 20 Key Data Science Trends Shaping Future

1950s - 1960s

1970s

领英推荐

1980s - 1990s

2000s - 2010s

2010s - Present

Artificial Intelligence

112,214 位关注者

Early adopter version of my book - the metric is the north star

2024年10月15日

Early adopter version of my book - mathematical foundations of data science - key maths ideas you should know for mathematical foundations of AI

2024年10月12日

How to teach AI using chatGPT to a firefighter from Iceland? (ie domain experts/non developers) - with design and development prompts

2024年10月12日

Mix tape books - combining literature and science fiction genres to attract readers to books - GPT publishing

2024年10月11日

Using AI to build tools - Github copilot and software engineering

2024年10月9日

How to talk to your enterprise AI chatbot

2024年10月8日

Enterprise AI assistants

2024年10月6日

A scaffolding based pedagogy for AI assisted education

2024年10月5日

AI and Cybersecurity: A list of segments and patterns covering machine learning, generative AI and LLMs

2024年10月5日

Low code data scientist: An end to end approach for code generation for non developers

2024年10月4日

社区洞察

其他会员也浏览了

IS DATA SCIENCE/MACHINE LEARNING/AI TRULY OVERHYPED?

New Evals for Better Models, AI Research Papers Made Easier to Understand, Train Your Own Flux LoRA, and More

Beyond RAG: AI Search Using Graph DBs

AI vs. AI - Equilibrium: An hour of need

Deep Thought vs. Deep Reflection (R.O.L.A.N.D.)

Interpreting Patterns in Multi-Variate Multi-Horizon Time-Series Forecasts from Google's Temporal Fusion Transformer Model

Solving Complex Problems with AI

Will the real ML problem please stand up

Key Differences Between Data Science and Artificial Intelligence?

Unveiling the Future: A Deep Dive into the 20 Key Data Science Trends Shaping Future