登录查看更多内容

The Most Important Algorithm in the World of Randomness

Jesus Rodriguez

CEO of IntoTheBlock, Co-Founder, Co-Founder of LayerLens, Faktory,and NeuralFabric, Founder of The Sequence AI Newsletter, Guest Lecturer at Columbia, Guest Lecturer at Wharton Business School, Investor, Author.

发布日期: 2018年1月30日

In a previous post, we discussed the relevance of Monte Carlo methods in the deep learning ecosystem as an alternative to more traditional Las Vegas techniques. Essentially, both techniques fall under the umbrella of randomized methods but Las Vegas techniques focused on providing an exact answer while Monte Carlo methods provide an approximate exact answer based on a probabilistic distribution. The efficiency of Monte Carlo techniques when operating in large, multi-dimensional datasets have made it a favorite of deep learning practitioners. From sampling data, to regularization or optimization techniques, Monte Carlo methods have become an important building block of modern deep learning solutions.

There are several Monte Carlo techniques that have been widely implemented in modern deep learning platforms. The best known member of the Monte Carlo family is a technique that brings Markov chains into the world of randomness and is known by the name of Markov Chain Monte Carlo methods (MCMC).

The main objective of MCMC models is to obtain information about distributions using Markov random walks algorithms. This is fancy way of saying that MCMC techniques are able to learn the fundamental attributes of a probabilistic distribution without sampling all its members. Reading this you might be confused. Isn’t the role Monte Carlo methods to draw examples from a distribution? If so, how are MCMCs any different?

The main difference of MCMC methods comes from the usage of Markov chains to generate the samples using a special sequential process. While standalone Monte Carlo methods are able to generate samples from a distribution, there are many scenarios in which there is no tractable methods to draw exact examples from a dataset. Markov chains complement traditional Monte Carlo methods by using a model in which each random sample is used as a stepping stone to generate the next random sample (hence the chain). A unique benefit of the chain is that, although each new sample depends on the one before it, new samples do not depend on any samples before the previous one (this is the “Markov” property).

MCMC in Action

Let’s use a classic example in machine learning literature to illustrate the value of MCMC models. Suppose that a professor is interested in learning the average of test scores in a student population. While the mean test score is unknown, the lecturer knows that the scores are normally distributed with a standard deviation of 15. So far, the lecturer has observed a test score of a single student: 100. One can use MCMC to draw samples from the target distribution, in this case the posterior, which represents the probability of each possible value of the population mean given.

In order to draw samples from the distribution of test scores, MCMC starts with an initial guess: just one value that could be drawn from the distribution. Let’s assume this initial guess is 110. MCMC is then used to produce a chain of new samples from this initial guess. Each new sample is produced by two simple steps: first, a proposal for the new sample is created by adding a small random perturbation to the most recent sample; second, this new proposal is either accepted as the new sample, or rejected (in which case the old sample retained). By continuously repeating this process, an MCMC model should produce a series of samples that are very close to the original probabilistic distribution.

要查看或添加评论，请登录

Jesus Rodriguez的更多文章

Robust Agents Are All We Need: Faktory Emerges from Stealth Mode with a Private?Alpha

2024年2月28日

Robust Agents Are All We Need: Faktory Emerges from Stealth Mode with a Private?Alpha

Last year, I had the unique opportunity to incubate a new project in the autonomous agents space, alongside a…

1 条评论
Google’s BLEURT is BERT for Evaluating Natural Language Generation Models

2020年5月27日

Google’s BLEURT is BERT for Evaluating Natural Language Generation Models

Natural language generation(NLG) is one of the fastest growing areas of research in deep learning. NLG applications are…
Two Deep Learning Frameworks and an AI Super-Computer: Microsoft Launches New Efforts to Achieve Large-Scale AI

2020年5月25日

Two Deep Learning Frameworks and an AI Super-Computer: Microsoft Launches New Efforts to Achieve Large-Scale AI

Training models with massive datasets is becoming the norm in modern deep learning applications. Some of the latest…
Uber Open Sources a New Framework for Designing Optimal Statistical Experiments

2020年5月18日

Uber Open Sources a New Framework for Designing Optimal Statistical Experiments

Rapid experimentation is a key element of modern software development. The raise in popularity of machine learning, has…
Uber Unveils Its New Data Quality Management Solution

2020年5月13日

Uber Unveils Its New Data Quality Management Solution

Data quality management is one of those often forgotten aspects of machine learning workflows. Small inconsistencies or…
LinkedIn Open Sources a Small Component to Simplify the TensorFlow-Spark Interoperability

2020年5月7日

LinkedIn Open Sources a Small Component to Simplify the TensorFlow-Spark Interoperability

Interoperating TensorFlow and Apache Spark is a common challenge in real world machine learning scenarios. TensorFlow…
Google Unveils TAPAS, a BERT-Based Neural Network for Querying Tables Using Natural Language

2020年5月6日

Google Unveils TAPAS, a BERT-Based Neural Network for Querying Tables Using Natural Language

Querying relational data structures using natural languages has long been a dream of technologists in the space. With…
Facebook Open Sources Blender, the Largest-Ever Open Domain Chatbot

2020年5月4日

Facebook Open Sources Blender, the Largest-Ever Open Domain Chatbot

Natural language understanding(NLU) has been one of the most active areas adopting state-pf-the-art deep learning…

2 条评论
Microsoft Research Unveils Three Efforts to Advance Deep Generative Models

2020年4月27日

Microsoft Research Unveils Three Efforts to Advance Deep Generative Models

Generative models have been an important component of machine learning for the last few decades. With the emergence of…
Facebook and Amazon Bring Two Projects to PyTorch 1.5 that Streamline the Lifecycle of Production-Ready Deep Learning Models

2020年4月22日

Facebook and Amazon Bring Two Projects to PyTorch 1.5 that Streamline the Lifecycle of Production-Ready Deep Learning Models

PyTorch is one of the fastest growing open source projects in the deep learning space. Initially incubated by Facebook,…

See all articles

The Most Important Algorithm in the World of Randomness

Jesus Rodriguez

CEO of IntoTheBlock, Co-Founder, Co-Founder of LayerLens, Faktory,and NeuralFabric, Founder of The Sequence AI Newsletter, Guest Lecturer at Columbia, Guest Lecturer at Wharton Business School, Investor, Author.

MCMC in Action

Jesus Rodriguez的更多文章

社区洞察

其他会员也浏览了

Deep Learning Resources and Study Path For Aspiring Data Scientist

My Review on Deep learning Book "The Deep Learning with Keras Workshop"

From Research To Reality: Deep Learning Methods on Time Series Forecasting on Financial Data

What is machine learning, and how does it differ from other algorithms, particularly deep learning?

posteriors: Normal Computing’s library for Uncertainty-Aware LLMs

A simple CNN In TensorFlow: Practical CIFAR-10 Guide

Evolution of Machine Learning: From Regression to Transformers Models

Configure Deep Learning Architecture

Motivation for Integrating Symbolic Mathematics with Deep Learning

Deep Learning Reading List: The Essentials

MCMC in Action

Jesus Rodriguez的更多文章

Robust Agents Are All We Need: Faktory Emerges from Stealth Mode with a Private?Alpha

Google’s BLEURT is BERT for Evaluating Natural Language Generation Models

Two Deep Learning Frameworks and an AI Super-Computer: Microsoft Launches New Efforts to Achieve Large-Scale AI

Uber Open Sources a New Framework for Designing Optimal Statistical Experiments

Uber Unveils Its New Data Quality Management Solution

LinkedIn Open Sources a Small Component to Simplify the TensorFlow-Spark Interoperability

Google Unveils TAPAS, a BERT-Based Neural Network for Querying Tables Using Natural Language

Facebook Open Sources Blender, the Largest-Ever Open Domain Chatbot

Microsoft Research Unveils Three Efforts to Advance Deep Generative Models

Facebook and Amazon Bring Two Projects to PyTorch 1.5 that Streamline the Lifecycle of Production-Ready Deep Learning Models

社区洞察

其他会员也浏览了

Deep Learning Resources and Study Path For Aspiring Data Scientist

My Review on Deep learning Book "The Deep Learning with Keras Workshop"

From Research To Reality: Deep Learning Methods on Time Series Forecasting on Financial Data

What is machine learning, and how does it differ from other algorithms, particularly deep learning?

posteriors: Normal Computing’s library for Uncertainty-Aware LLMs

A simple CNN In TensorFlow: Practical CIFAR-10 Guide

Evolution of Machine Learning: From Regression to Transformers Models

Configure Deep Learning Architecture

Motivation for Integrating Symbolic Mathematics with Deep Learning

Deep Learning Reading List: The Essentials