A Million Monkeys On Typewriters –?What Traditional PhDs Missed About Machine Learning
Uri Pomerantz
Applying AI and fintech for good | Entrepreneur | Investor | Dad | Writer
Introduction
In this blog post, I want to explore what traditional PhDs got wrong about machine learning over the last few decades.?
This involves a bit of throwing academia under the bus, but it's crucial to explain how fundamental research is done in academia and what was missed in terms of the breakthroughs in AI.?
This insight is valuable whether you're involved in venture investing or running a startup.
My Experience at Harvard & The Traditional Academic Approach to Research
Years ago, I was a grad student at Harvard studying advanced econometrics.?
My studies included statistics, econometrics, and both micro and macro theory—essentially the first two years of a PhD in economics.?
In this academic setting, you start with a very careful thesis of the world and then run statistical models to prove whatever you're out to demonstrate.?
For example, you might investigate the biggest drivers of attainment from an educational program: is it the quality of the schools, the teacher, the location, the materials, or the availability of a school lunch program??
You then create randomized trials, ideally triple blind, and develop a very careful theory of the world.?
This theory involves independent variables that should have some type of connection, ideally a causal connection, with a dependent variable that you're trying to predict.
Pre-baked Regression Superstar
This research process often takes the shape of some type of regression.?
You carefully track and measure a set of different variables, trying to create some kind of instrumental variable or connection to a dependent variable.?
You predict something based on your theory of the world, carry out a carefully planned experiment, do a whole bunch of statistical cleanup, and arrive at a conclusion.?
You present it, get it peer-reviewed, published, and then syndicate.
The Evolution of Machine Learning
In contrast, the modern approach, particularly in machine learning, involves a lot more guesswork and fine, continuous tweaks at scale.?
This process can be likened to having a million monkeys at typewriters miraculously creating a beautifully published Encyclopedia Britannica through a bit of training and sheer brute force.?
This is essentially how machine learning works: guesswork and fine continuous tweaks (aimed at minimizing whatever loss function you define) until brilliance emerges.
领英推荐
Why Does This Work?
Machine learning involves predicting some kind of outcome using features or variables.?
The process, at a high level for supervised learning, involves five steps:
Implications for Venture Investors and Executives
The Importance of Data
First, it's all about the data.?
With bigger models and larger context windows, what you feed into the model matters immensely. Whether it's your code base, every Slack message from your company, emails, customer interactions, or videos, the quality of data determines the quality of your result.?
This is why, for example, the Reddit IPO popped due to a significant contract with Google focused on data quality.
The Future: Synthetic Data
Looking ahead, venture investors should consider the opportunities for synthetic data—where we can manufacture data and build models that can more effectively train themselves. This can speed up the process and improve results further.
Broader Societal Implications
For broader society, it's all about encouraging access. We're starting to see the first performant open-source models, which hopefully will become the norm. With this technology becoming free to access for standard consumer use cases, countries and companies can gain significant advantages.
The Role of Smart Government
Countries thinking about building sovereign clouds can look to modern equivalents of Singapore, which transformed from a poor country in the 1960s to one of the wealthiest per capita through a focus on services and the knowledge economy.?
There's a significant play for smart government and companies training their employees, building the right incentives, and encouraging open-source software and accessibility (just ask Larry Ellison and some of the leadership at Oracle about how they see the future unfolding).
Conclusion
We are living in a world where proverbial million monkeys are producing brilliance.?
Now, in practice, you’ll be using things like an optimized back propagation algorithm (using calculated gradients rather than random values) to speed up and optimize the training process and get better results with less compute and time.
But in essence, you’d get similar results if you just randomly tweaked the parameters yourself (again, this is very time-consuming if you’ve got hundreds of billions of parameters).?
However, overall—with quality data and adequate compute power, you can achieve better results and succeed—and you don’t need a PhD or a carefully pre-crafted theory of the world to have your machine learning work bear fruit.
Creating Creators; Georgetown Professor & Founder of Manuscripts
3 个月Great point! It's fascinating how machine learning leverages vast amounts of data and computational power to uncover patterns that traditional methods might miss. ??