登录查看更多内容

"Fail fast" vs Machine learning.

Chris Pedder

Chief Data Officer @ OBRIZUM | Board advisor | Data transformation leader | Posting in a personal capacity.

发布日期: 2019年4月13日

Yep, you read that right. There can be only one...

Of course, not really. But there are issues. To get to that point, first we have to duck back into a time when Highlander was still a thing. Well, almost. The first big tech boom of the internet bubble created the conditions in which many of the modern giants were forged, a time of easy money and a fail fast culture. Software development, which had been a careful, herbivorous process before became stripped down and lean, fast and effective, and this new approach saw the launch of some of the biggest companies out there today. Three in particular succeeded off the back of apparently very different approaches, Amazon (fastest) got there first and strove to maintain their lead, Google (smartest) leveraged their research capabilities and Apple (cutest) out-marketed everyone else. People saw the new era, and they wanted to embrace a new approach to software that seemed to make these fast-moving companies succeed.

Agile at its core is a very sensible approach to software development. It's basically local optimization under constraints. For those of you familiar with machine learning it's gradient descent. Take a look at where you are, take some time (usually a two-week sprint) to look around, follow the path that leads to the next local optimum you can see, rinse and repeat. Except there's a third rule in there that often gets forgotten, but in many ways is the most important of them all - if your path downhill bifurcates, take the path that looks like it has more options in the future, rather than hemming yourself in. Just as good software engineers know this, so do good chess players - choose mobility over materiality, or end up in the dreaded Zugswang position (https://en.wikipedia.org/wiki/Zugzwang).

So why do I think this is problematic? Well, in principle I don't, the way in which the problem emerges isn't from the implementation, it's from tradition. Machine learning systems are fundamentally different from non-ML software development - for one thing, they are dynamic, almost living things, and there is no real concept of doing things so they stay done. As new data comes along, you need to retrain and evolve your model to do the same thing. They also have many moving parts, which cannot be individually optimized. In normal code development, you can write an object or a routine, optimize it and then incorporate it into a larger ecosystem. Machine learning systems aren't amenable to this modular approach, in fact, that is kind of the point - they are, after all, *non-linear* function approximators. So it's probably not surprising that you can't make them out of linearly connected modules and have them work optimally. Instead, they require a global search, and this is where a system like agile designed for local exploration can lead you into problems - like a lot of research problems, you can spend your whole two-week sprint exploring dead space, and producing no progress. Adherents of fail-fast would say "okay, you're in a dead end, stop", but the reality is that you are rarely wasting time, you're just doing your training to make progress. Like a reinforcement-learning robot, you have to fall over more than a few times before you get to the finish line.

So how do we do it better? Well, coming from a physics research background, ML projects are much more like research problems than they are conventional software development. Whilst there are many problems with the practical details of how physics research is funded, we can take away some salient lessons on what works well. So here are my suggestions.

Give ML staff the freedom to look for their own applications within the business, rather than giving them problems to solve. They will naturally seek out problems that are most amenable to what they can do.
Allow them to "apply" for time to work on projects. The initial block of time should be easy to get, and allow for them to make mistakes without those mistake being detrimental to the project's success. The average time to build an ML model in industry is 59 working days, with even longer needed to productionize.
Allow extensions to the initial period contingent on there being measurable results, and a high probability of success during the extension period.
Involve non-ML specific team members in ML projects, and make your ML staff present and explain their work to the rest of their colleagues. Under no circumstances allow ivory towers to be built!
Work hard to deal with expectation management - it's a hot field, it's very hyped, and there are more than a few people out there who believe in magic. Magic is for kids, a lot of sweat and a lot of wrong turns go into making a working ML model.

Based on my experiences of working in deep learning, I suggest an update to the old adage for the machine learning age. It shouldn't be "fail fast". It should be "work fast, fail often, succeed when it matters most".

要查看或添加评论，请登录

Chris Pedder的更多文章

Conform to be free.

2023年4月24日

Conform to be free.

As a sometimes awkward, sometimes I’m sure downright frustrating teenager, who just wanted to be, I always remember my…

4 条评论
What is emergence in neural networks?

2023年4月11日

What is emergence in neural networks?

Large language models & emergence. If you’re reading this, I don’t need Bayes’ theorem to tell me that there’s a very…

10 条评论
How to survive ML research

2023年1月21日

How to survive ML research

How (and why?) to stay ahead. I’ve seen numerous articles about how to “stay ahead” in ML research in the last two…

5 条评论
Why “speed” is a bad metric for success.

2022年9月28日

Why “speed” is a bad metric for success.

To start, two aphorisms: “If you want to go fast, go alone. If you want to go far, go together” - African proverb.

3 条评论
Why I love UX/UI as an ML engineer.

2022年5月24日

Why I love UX/UI as an ML engineer.

“There’s a truth, universally accepted, that an AI startup in posession of funding must be in search of good UX…
Building a data company in 2022.

2022年4月20日

Building a data company in 2022.

I've had a pretty varied career in machine learning and software development. I've worked for ten person startups and…

6 条评论
Don’t make a mesh (unless you have to…)

2022年3月27日

Don’t make a mesh (unless you have to…)

Apologies for the punny title, it’s a bit clickbaitey, but I want to talk a bit about one of the current hypes in…

9 条评论
What I learned from my first year in an innovation team.

2021年9月20日

What I learned from my first year in an innovation team.

I have spent the last year as part of Cisco's internal innovation program. As a result, I have read a lot of books and…

3 条评论
What makes NLP hard (and fun).

2020年8月31日

What makes NLP hard (and fun).

So it's 2020, and the much-anticipated AI-powered robot uprising is still very much in the indiscernible mists of the…

1 条评论
The "A" in AI?

2019年11月25日

The "A" in AI?

There’s really only one possible interpretation, and it’s “artificial”, isn’t it? For a long time, people would have…

See all articles

"Fail fast" vs Machine learning.

Chris Pedder

Chief Data Officer @ OBRIZUM | Board advisor | Data transformation leader | Posting in a personal capacity.

Chris Pedder的更多文章

社区洞察

其他会员也浏览了

How Machine Learning Operations Can Enhance Your Business?

Reasonance: Shaping the Future of End-to-End Machine Learning Workflows

How MLOps Improves the Lifecycle of Machine Learning Models

Machine learning operations: How to enhance AI projects and boost value streams

AutoML (Automated Machine Learning)

Unlocking the Power of MLOps: The Future of Machine Learning Deployment

MLOps - The Key to Scaling Machine Learning in the Enterprise Part - 2

model deployment

BentoML: Streamlining Machine Learning Model Deployment

Productionizing Machine Learning Models

Chris Pedder的更多文章

Conform to be free.

What is emergence in neural networks?

How to survive ML research

Why “speed” is a bad metric for success.

Why I love UX/UI as an ML engineer.

Building a data company in 2022.

Don’t make a mesh (unless you have to…)

What I learned from my first year in an innovation team.

What makes NLP hard (and fun).

The "A" in AI?

社区洞察

其他会员也浏览了

How Machine Learning Operations Can Enhance Your Business?

Reasonance: Shaping the Future of End-to-End Machine Learning Workflows

How MLOps Improves the Lifecycle of Machine Learning Models

Machine learning operations: How to enhance AI projects and boost value streams

AutoML (Automated Machine Learning)

Unlocking the Power of MLOps: The Future of Machine Learning Deployment

MLOps - The Key to Scaling Machine Learning in the Enterprise Part - 2

model deployment

BentoML: Streamlining Machine Learning Model Deployment

Productionizing Machine Learning Models