登录查看更多内容

Essential Algorithms Every ML Engineer Needs to Know

Christopher D.

Co-Founder @ Dicer.ai | Serial Entrepreneur

发布日期: 2018年11月19日

Essential Algorithms Every ML Engineer Needs to Know

Machine learning as a field has been around for a long time before deep neural networks took over the scene. Here are a list of the algorithms you need to know, so you can tackle any problem that comes your way. This isn’t an exhaustive list, but your bases will be mostly covered.

Regression Algorithms

Regression algorithms model relationships between variables. Originally a technique from statistics they have become an important tool in every Machine learning engineer’s tool kit.

Common Regression algorithms

Least Squares Regression
Linear Regression
Logistic Regression

Coursera Course by Johns Hopkins on regression models

Clustering Algorithms

Clustering algorithms can divide data points in to groups with similar properties.They work by finding inherent structures in data to best organize data in to distinct groups. Things in the group are more closely related to each other then things in other groups.

There are two types of clustering algorithms. hard clustering refers to when a data point is in a group or not. soft clustering refers to when a data point can belong to many different groups to different degrees.

Common Clustering algorithms

K-means
Hierarchical Clustering

Amazing introductory video on clustering

Dimensionality reduction algorithms

When the number of features is very large compared to the number of data points you have. Dimensional reduction algorithms help you reduce the number of features to only what is necessary for the problem at hand. They can remove redundant or useless features, helping you get better results.

There are two ways that dimensional reduction algorithms work. The first method is through feature selection, where the algorithm picks a subset of the available features. The second way is feature extraction, which reduces the data in a high dimensional space to a lower one.

Common Dimensionality reduction algorithms

Principle component analysis
Low Variance Filter
High Correlation Filter
Random Forests
Backward Feature Elimination / Forward Feature construction

This is not a exhaustive list, just some that I have used. If you want to read up on this some more as well as see the ROI for some of these algorithms check out KDnuggets blog post on it.

Decision tree algorithms

decision trees create models of decisions made on values from your data. A fork is made in the tree structure until there is a prediction for every data point. Their results are easy to understand unlike other algorithms (Deep Learning) and they are easy to use on many different data types.

Common decision tree algorithms:

Classification and Regression Tree
C4.5 and C5.0
Random Forests
Chi-squared automatic interaction detector

Analytics Vidhya has a great article that goes in depth on decision trees. Listing out the different algorithms and their advantages and disadvantages

Deep Learning

The hype behind machine learning and “AI” is caused by deep learning. They are modern versions of artificial neural networks that exploit cheap computation to train ever larger neural networks. They are powerful universal function approximates that have proven their ability in solving some of the hardest problems. See Alpha Go.

Common Deep learning algorithms:

Stacked Auto-encoders
Convolution Neural networks
Recurrent neural networks
Capsule Networks (more information here)

Check out this book snippet. It goes over the major architectures for deep learning.

Take away

If you serious about machine learning you have to understand the tools that are available to you. Having a good understanding of these tools will give you a leg up on any problems you come across.

要查看或添加评论，请登录

Christopher D.的更多文章

The Most Popular Machine Learning Courses

2020年12月3日

The Most Popular Machine Learning Courses

Best 9 Machine Learning Courses and Certifications Machine Learning(ML) is an exciting and fast-paced field. The…
Top AI and Machine Learning Books for Business Leaders

2020年10月15日

Top AI and Machine Learning Books for Business Leaders

Highly Recommended AI and ML Books for Business Leaders Imagine what it would be like if you could get knowledge and…
The Best Artificial Intelligence and Machine Learning Books in 2020

2020年10月7日

The Best Artificial Intelligence and Machine Learning Books in 2020

AI and Machine learning (ML) technologies are rapidly evolving. I mean, we all have witnessed the disruption that the…
Julia Language in Machine Learning: Algorithms, Applications, and Open Issues

2020年4月6日

Julia Language in Machine Learning: Algorithms, Applications, and Open Issues

This research summary is just one of many that are distributed weekly on the AI scholar newsletter. To start receiving…
A Python Natural Language Processing Toolkit for Many Human Languages

2020年3月23日

A Python Natural Language Processing Toolkit for Many Human Languages

Stanza | Python-based NLP Toolkit This research summary is just one of many that are distributed weekly on the AI…

1 条评论
The Engineers Guide to Machine Learning: Data processing | Data Types

2020年3月19日

The Engineers Guide to Machine Learning: Data processing | Data Types

Introduction Wow, what a crazy couple of months. I’ve started a new job as this change has understandably taken up a…

2 条评论
Machine Learning in Python: Main Developments and Technology Trends in Data Science, Machine Learning, and Artificial Intelligence

2020年3月18日

Machine Learning in Python: Main Developments and Technology Trends in Data Science, Machine Learning, and Artificial Intelligence

This research summary is just one of many that are distributed weekly on the AI scholar newsletter. To start receiving…

3 条评论
AI in Games: 5 Ways Developers Can Benefit

2020年3月17日

AI in Games: 5 Ways Developers Can Benefit

Part 1 of a series on using AI to make better mobile games Imagine a digital world, tailor-made for every gamer…
Researchers Found Out That AI Significantly Improves Gleason Grading of Prostate Biopsies by Pathologists

2020年2月19日

Researchers Found Out That AI Significantly Improves Gleason Grading of Prostate Biopsies by Pathologists

This research summary is just one of many that are distributed weekly on the AI scholar newsletter. To start receiving…

2 条评论
Facebook AI: The First End-to-End Many-to-One Multilingual Model for Spoken Language Translation

2020年2月12日

Facebook AI: The First End-to-End Many-to-One Multilingual Model for Spoken Language Translation

This research summary is just one of many that are distributed weekly on the AI scholar newsletter. To start receiving…

2 条评论

See all articles

Essential Algorithms Every ML Engineer Needs to Know

Christopher D.

Co-Founder @ Dicer.ai | Serial Entrepreneur

Essential Algorithms Every ML Engineer Needs to Know

Regression Algorithms

Clustering Algorithms

Dimensionality reduction algorithms

Decision tree algorithms

Deep Learning

Take away

Christopher D.的更多文章

社区洞察

其他会员也浏览了

TensorFlow - Aamir?P

Kaggle “Dogs vs. Cats” Challenge?—?Complete Step by Step Guide?—?Part 2

The misguided intuition I had to unlearn to come to grips with modern machine learning

Mix It Up!!!!

Understanding Types of Classifiers in Machine Learning

A simple CNN In TensorFlow: Practical CIFAR-10 Guide

From Perceptrons to Transformers: The Swift Evolution of Machine Learning

The 10 Deep Learning Methods AI Practitioners Need to Apply

Binary Classification in Neural Networks with Tensorflow

Essential Algorithms Every ML Engineer Needs to Know

Regression Algorithms

Clustering Algorithms

Dimensionality reduction algorithms

Decision tree algorithms

Deep Learning

Take away

Christopher D.的更多文章

The Most Popular Machine Learning Courses

Top AI and Machine Learning Books for Business Leaders

The Best Artificial Intelligence and Machine Learning Books in 2020

Julia Language in Machine Learning: Algorithms, Applications, and Open Issues

A Python Natural Language Processing Toolkit for Many Human Languages

The Engineers Guide to Machine Learning: Data processing | Data Types

Machine Learning in Python: Main Developments and Technology Trends in Data Science, Machine Learning, and Artificial Intelligence

AI in Games: 5 Ways Developers Can Benefit

Researchers Found Out That AI Significantly Improves Gleason Grading of Prostate Biopsies by Pathologists

Facebook AI: The First End-to-End Many-to-One Multilingual Model for Spoken Language Translation

社区洞察

其他会员也浏览了

TensorFlow - Aamir?P

Kaggle “Dogs vs. Cats” Challenge?—?Complete Step by Step Guide?—?Part 2

The misguided intuition I had to unlearn to come to grips with modern machine learning

Mix It Up!!!!

Understanding Types of Classifiers in Machine Learning

A simple CNN In TensorFlow: Practical CIFAR-10 Guide

From Perceptrons to Transformers: The Swift Evolution of Machine Learning

The 10 Deep Learning Methods AI Practitioners Need to Apply

Binary Classification in Neural Networks with Tensorflow