登录查看更多内容

Are you testing your ML models to the same extent as your other software?

Simon Williams

DevOps & Cloud Solution Sales Lead @ Eficode

发布日期: 2022年7月28日

Unit; Integration; Functional; End-to-End; Acceptance; Performance; Smoke; A/B Testing... the list could probably go on for a while.

Chances are if you are building software (and these days who isn't) you are doing most of, if not all, of the above tests as a standard part of your SDLC (software development lifecycle). You are confident in your test coverage and keep working at improving it so that you can have as good a change failure rate DORA score as possible. Great! This is where you either are now, or are looking to get to with aspirations of continuous development. Automate all the things and make your devs lives that much easier, catching and correcting problems before they get to production. Hopefully, you've shifted all of the testing left sufficiently that when your developers merge into the repo, they get all the information they need to know if what they built was good enough - before it needs to go to anyone in the ops or QA teams. (*cough* including security testing! *cough* )

The devs are happy because they have this quality gate to ensure they are doing their best possible work with some guard rails in case anything gets borked in the process when they merge back into main. The portfolio managers are happy because they get some nice dashboards and reports to show them how the whole of the portfolio is performing in terms of quality. Engineering managers likewise get to know how well each code base is performing and perhaps where they need to go back and look at a sprint for paying back some technical debt.

In other words, if you are building software, you probably have your QA process well defined and mature. It may well be that your CI pipelines can make your devs the first part of your QA process.

领英推荐

The Future of Testing

testRigor 8 个月前

Mean Green Testing Machine: Software Testing News -…

Ministry of Testing 11 个月前

Unmasking Bias: Testing Beyond Assumptions

The Test Chat 3 个月前

So, if you are doing all of this testing for your SDLC, are you also doing the same for your machine learning development?

If you are, great! We would love to hear more about what you are doing and how it is going for you.

If not, let's talk. TruEra (the company I work for) have built a model testing suite. It's based on our own academic research into the explainability of AI/ML models and is all about helping you improve the quality and trust in the work that your DS teams are doing. We're still young - only a couple of years old - but are seeing a large uptick in the interest of teams wanting to improve their model validation speed, improve collaboration and explainability for non-DS stakeholders around the business and make sure that they comply with any current or future regulations (hint: EU AI regs will likely become law in the next 24 months; NYC already has them for HR hiring; Singapore has produced guidance; the UK is going to spin out its own flavour etc etc etc).

So what do you think? Whether you are on the DS or Dev sides of the house, let me know (a) if this is just a false analogy - you can't assume that the model and software dev processes are the same with regards to testing requirements; (b) if this is bang on and something you hadn't considered before; (c) if I should leave the writing to the professionals in the future... ;)

Ryan S.

The ProductWins Pathway? for Product Leaders | Ex-Indeed, Cognizant & Workshare | Advisor | Speaker | Founder

2 年

Loved the read Simon Williams . Found it interesting and a good question to raise! So yeah, don't delete it!

1 次回应

Simon Williams

DevOps & Cloud Solution Sales Lead @ Eficode

2 年

Simon Mansfield should I delete this before the world sees it?

4 次回应

查看更多评论

要查看或添加评论，请登录

Simon Williams的更多文章

Build. Measure. Learn. An oldie, but a goodie.

2025年3月19日

Build. Measure. Learn. An oldie, but a goodie.

Sales enablement is really hard. I didn't realise before.
First week reflections

2024年1月29日

First week reflections

So having joined Eficode , I'm now 1 week back in the GitLab ecosystem and it's lovely to be back. Last Wednesday I…
Reflections on 1 year of XAI/RAI at TruEra

2023年4月14日

Reflections on 1 year of XAI/RAI at TruEra

A year in at TruEra, what have I learnt? RAI is still peripheral in most orgs. There may be a person appointed to look…

7 条评论

Are you testing your ML models to the same extent as your other software?

Simon Williams

DevOps & Cloud Solution Sales Lead @ Eficode

领英推荐

Simon Williams的更多文章

社区洞察

其他会员也浏览了

Top 19 AI Testing Tools for 2024

Machine Learning in Test Automation

Unveiling the Future: AI's Role in Enhancing Software Quality Through Regression Testing

AI in the Testing Arena: How Machine Learning is Revolutionizing QA

The Latest Trends in the Software Testing Business – September 2024 Edition

Machine Learning In Software Testing

Top 5 AI-Powered VS Code Extensions for Coding & Testing in 2025

Essential Skills for Thriving with AI-Powered Software Testing

Meet KaneAI: The AI That Makes Software Testing Easy for Everyone!

Boosting Productivity with AI Using Cursor.com Enhances Daily Testing Workflows

领英推荐

Simon Williams的更多文章

Build. Measure. Learn. An oldie, but a goodie.

First week reflections

Reflections on 1 year of XAI/RAI at TruEra

社区洞察

其他会员也浏览了

Top 19 AI Testing Tools for 2024

Machine Learning in Test Automation

Unveiling the Future: AI's Role in Enhancing Software Quality Through Regression Testing

AI in the Testing Arena: How Machine Learning is Revolutionizing QA

The Latest Trends in the Software Testing Business – September 2024 Edition

Machine Learning In Software Testing

Top 5 AI-Powered VS Code Extensions for Coding & Testing in 2025

Essential Skills for Thriving with AI-Powered Software Testing

Meet KaneAI: The AI That Makes Software Testing Easy for Everyone!

Boosting Productivity with AI Using Cursor.com Enhances Daily Testing Workflows