登录查看更多内容

Machine Learning vs. Traditional Software Development (ML4Devs Newsletter, Issue 11)

Satish Chandra Gupta

Data/AI Consultant ? I help startups & SMBs build effective, economical, and scalable data/ML/LLM-powered products. ? Ex- Amazon, Microsoft Research

发布日期: 2022年8月18日

In the previous issue, we examined the?MLOps ecosystem . It is a lot more complex compared to traditional software engineering projects. In this issue, let’s understand the differences between:

Traditional programs and Machine Learning
Software Development Life Cycle and ML Project Life Cycle

We will also examine the evolution of software development and the friction in assimilating ML into software development.

Traditional Programs vs. Machine Learning

In traditional programs, a developer designs logic or algorithms to solve a problem. The program applies this logic to input and computes the output.

But in Machine Learning, a model is built from the data, and that model is the logic. ML programs have two distinct phases:

Training:?Input and the expected output are used to train and test various models, and select the most suitable model.
Inference:?The model is applied to the input to compute results. These results are wrong sometimes. A mechanism is built into the application to gather user feedback on such occasions.

This feedback is added to the training data, and this is how a model?learns.

Let’s take the problem of detecting email spam and compare both methods.

Traditional programs detect spam by checking an email against a fixed set of heuristic rules. For example:

Does the email contain FREE, weight loss, or lottery several times?
Did it come from known spammer domain/IP addresses?

As spammers change tactics, developers need to continuously update these rules.

In Machine Learning Solutions, an engineer will:

Prepare a data set: a large number of emails labeled manually as spam or not-spam.
Train, test, and tune models, and select the best.
During inference, apply the model to decide whether to keep an email in the inbox or in the spam folder.
If the user moves an email from inbox to spam or vice versa, add this feedback to the training data.
Retrain the model to be up-to-date with the spam trends.

As you can notice traditional programs are?deterministic, but ML programs are?probabilistic. Both make mistakes. But the traditional program will require constant manual effort in updating the rules, while the ML program will?learn?from new data when retrained.

Software Development Evolution

Before comparing software and ML development life cycle, let’s see how software and its development process have evolved. Every tech gets commoditized and becomes accessible: Scientists to Engineers to Technicians to Everyone.

Up to 1980: Scientist Era

Developing software was an art, and required in-depth knowledge of computer science. There wasn’t much of a process. But computer theory was highly developed.

1980 - 2000: Engineer Era

There was a massive expansion of personal desktop applications, and complex business applications built using multi-tier client-server architecture. Building software required engineering teams with broad computer science knowledge.

领英推荐

Future Of AI In Software Development

IntelliaTech Solutions Pvt. Ltd. 4 个月前

How to effectively implement CI/CD in Machine Learning…

Amaboh Achu 10 个月前

Generative AI Copilots: Elevating the Horizon of Code…

Navveen Balani 1 年前

Waterfall Model was the first development process. It had a single pass of the Requirements, Design, Development, and Test phases. The project failure rate was high. Then the Iterative Development Model evolved having loops of these phases. High-risk issues were tackled first to avoid late failures.

2000 - 2020: Technician Era

Cloud applications arrived. These had mobile and browser front ends and distributed microservice architecture at the backend. Reusable components became available off-the-shelf as open-source software or pay-per-use SaaS. Assembling and developing complex applications became much easier and shorter.

With containers and DevOps, the process evolved into agile Continuous Integration and Continuous Delivery (CI/CD).

As software ate the world, it generated a lot of data. It fueled big data, analytics, data science, and machine learning.

2020 and beyond: Anyone Era

With No/Low-Code and Serverless, software development continues to be commoditized. It will become accessible to anyone and everyone.

Software Development Life Cycle vs ML Project Life Cycle

In the last 10 years, there has been an increasing number of ML-assisted applications operating on big data, with Edge + Cloud architecture. ML project development is often separate from the rest of the software development.

There are two common sources of friction:

Not Iterative:?While the software development follows the CI/CD DevOps loop,?Machine Learning Project Life Cycle ?is more like the waterfall model. Typically a data scientist develops a model and then hands it over to an engineer for implementing it in production.
Not Incremental:?Minor tweaks in requirements can force the data scientists and ML engineers to start over from the data collection and pipeline setup. Development cost is not proportional to the requirement-changes.

The software development life cycle, when it was transitioning from the waterfall to the iterative process, looked similar to how the ML project life cycle is now. Software development?effort estimations were equally unpredictable , and projects had?controversially high failure rates .

As ML tooling is maturing, ML project development is transitioning from waterfall silos to iterative, and from scientists to engineers era.

The Road Ahead

This is not the first time software development is facing a major paradigm shift. Moving to hyper-scale, distributed applications on the cloud was also a fundamental shift in software development. Developers figured out CI/CD to manage requirement changes and deploy cloud applications with unprecedented frequency. The same will happen with ML too.

It takes time to impart tribal knowledge of an organization and a business domain. It takes effort to become an expert in developing production-quality real-world applications. If developers can do that, I bet they can adapt to building ML apps too.

Machine Learning vs. Traditional Software Development (ML4Devs Newsletter, Issue 11)

Satish Chandra Gupta

Data/AI Consultant ? I help startups & SMBs build effective, economical, and scalable data/ML/LLM-powered products. ? Ex- Amazon, Microsoft Research

Traditional Programs vs. Machine Learning

Software Development Evolution

Up to 1980: Scientist Era

1980 - 2000: Engineer Era

领英推荐

2000 - 2020: Technician Era

2020 and beyond: Anyone Era

Software Development Life Cycle vs ML Project Life Cycle

The Road Ahead

Further Readings

ML4Devs

8,869 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

The Rise of AI in Software Development: Key Insights from the 2024 Docker AI Trends Report

Is AI About to Make Developers Redundant?

What are the real upskill needs for software engineers in the AI Era?

AI-Driven Development Model

Transforming Software Quality with Generative AI: Insights and Case Studies

The Collective Power of Multi-Agent LLM Systems: Enhancing AI with Proven Software Development Principles

MLOps Essentials: Doing Machine Learning Operations right with design patterns

The Future of Software Development Careers in the Age of AI and the Cloud

Could AI Replace Software Engineers and When Might That Happen?

Why Software Developers Won't Be Replaced by AI

Traditional Programs vs. Machine Learning

Software Development Evolution

Up to 1980: Scientist Era

1980 - 2000: Engineer Era

领英推荐

2000 - 2020: Technician Era

2020 and beyond: Anyone Era

Software Development Life Cycle vs ML Project Life Cycle

The Road Ahead

Further Readings

ML4Devs

8,869 位关注者

MLOps: All-in-One Platform vs Piecemeal Tools (ML4Devs Newsletter, Issue 18)

2022年12月21日

SQL Renaissance (ML4Devs Newsletter, Issue 17)

2022年11月26日

Which Data Pipeline Orchestration Tool Is Right For?You? (ML4Devs Newsletter, Issue 16)

2022年11月11日

Chasm of AI Security Between Research and Products (ML4Devs Newsletter, Issue 15)

2022年10月28日

What Are Data, Machine Learning, and MLOps Pipelines (ML4Devs Newsletter, Issue 14)

2022年10月8日

AI is Like Teenage?Sex… (ML4Devs Newsletter, Issue 13)

2022年9月23日

Should You Care About MLOps? Why and How Much? (ML4Devs Newsletter, Issue 12)

2022年9月9日

MLOps for Continuous Integration, Delivery, and Training (ML4Devs Newsletter, Issue 10)

2022年8月5日

When to (Not) Use Machine Learning (ML4Devs Newsletter, Issue 9)

2022年7月22日

Why Machine Learning Projects Fail (ML4Devs Newsletter, Issue 8)

2022年7月8日

社区洞察

其他会员也浏览了

The Rise of AI in Software Development: Key Insights from the 2024 Docker AI Trends Report

Is AI About to Make Developers Redundant?

What are the real upskill needs for software engineers in the AI Era?

AI-Driven Development Model

Transforming Software Quality with Generative AI: Insights and Case Studies

The Collective Power of Multi-Agent LLM Systems: Enhancing AI with Proven Software Development Principles

MLOps Essentials: Doing Machine Learning Operations right with design patterns

The Future of Software Development Careers in the Age of AI and the Cloud

Could AI Replace Software Engineers and When Might That Happen?

Why Software Developers Won't Be Replaced by AI