登录查看更多内容

Why We Code

Thomas W. Dinsmore

I write about machine learning tools and software.

发布日期: 2022年8月24日

"Code-first data science makes no sense." I guess that's true if you spend all day listening to vendor hype. If you talk to real data scientists, you know there's a reason they work with code. Here are some of them:

Speed. It takes seconds to write a line of code. “No-code” tools are slow. Click on the menu, the sub-menu, and the sub-sub-menu. Drag an operator to the canvas. Right-click and configure the operator.?

Working with “no-code” tools is like living in hell when you know how to code.???

Functionality. Code is richer than “no-code” tools, often 5x-10X richer. Compare any drag and drop UI with a code-based API for the same software. Software developers expose new features in APIs before building them into a “no-code” interface. Many features will never make it into the no-code version. That’s because there’s a limit to the number of operators you can stuff into a graphical UI.

Flexibility. Since “no-code” tools support fewer features, developers focus on ones they think are essential. That’s great if you do simple analysis; otherwise, you’re out of luck. Code is infinitely flexible. That’s why every commercial “no-code” tool for data science includes a “code node” capability where the user can insert code.

I’ve seen work done in a “no-code” tool where every single node in the workflow is a code node.?

Transparency. Code is what it is, and it’s open for inspection. That’s not always true for “no-code” tools. Data scientists are accountable for the accuracy of the work they do. When the analysis is wrong, you can’t blame the tool. The processing pipeline is completely visible when you work with code, from data to insight.

领英推荐

Deployment as a Critical Business Data Science…

Tom Davenport 4 年前

The Unsung Hero of Data Science: Delving into Data…

Iain Brown PhD 1 年前

The Data Advantage Matrix, reshaping data engineering…

Prukalpa ? 3 年前

Efficiency. Nobody codes a project from scratch. Data science teams curate and share reusable code components. You can tweak and tune code to improve runtime performance and minimize the impact on computing infrastructure. That’s not possible with no-code tools.?

Working with code does not rule out working with innovations like AutoML. Every leading AutoML tool supports a code-based API. The best AutoML tools are extensible; expert data scientists can add code to the algorithm. Many AutoML tools deliver pipelines as code packages that experts can review, modify and tune.

Coding skills are not scarce. The pool of people with Python skills greatly exceeds the pool of experienced data scientists. Of course, knowledge of Python does not make one a data scientist. Data scientists require knowledge and skills that are well beyond programming. But that is precisely the point – coding skill is not the critical bottleneck limiting the supply of data scientists.

"Code-first" data science is fast, functional, flexible, transparent, and efficient. Data scientists working with code can use the most advanced innovations in machine learning and artificial intelligence. Every prospective data scientist already knows at least one programming language; many are multi-lingual.

“No-code” tools are fine for some tasks. Graphics, for example. Dashboards. Data scientists often use these tools together with code-based tools when necessary. “No-code” tools are also great for simple analysis. Many managers and analysts prefer “no-code” tools, and that’s fine.?

But don’t confuse managers and analysts with data scientists. If I can screw in a lightbulb, that does not make me a “citizen electrician.” It makes me a homeowner with a lightbulb.

James Pearce

Statistician. Computer scientist. Data scientist.

2 年

Good article, and I know I am late to the party. When arguing for code and against no-code at organisations I have worked at, there are two other salient points. 1. Code is easier to test and put into production than no-code. It's understood by engineers and the technology arms of the organisation. 2. No-code tools lock you into vendors completely. How long will the vendor be around and remain relevant?

1 次回应

Christian Kaul

Data Modeling Aficionado and Senior Technical Consultant at virtual7 GmbH

2 年

I get what you’re trying to say but I would argue there are quite a few people that you don’t necessarily want to write code because they will write lots of ugly, redundant, hard-to-maintain code that someone (and you might very well be that someone) has to sort out and clean up later.

Cynthia O'Rourke

Data Scientist | Views expressed on LinkedIn are solely my own.

2 年

They both make sense, which is why SAS has been offering both code-based and point-and-click stats software for as long as I can remember, and I've watched heavy users of either (but almost never both) work with them daily in the biological sciences. For me, the code-v-no-code innovation around "no-code" ML (as opposed to point-and-click stats) packages is that they integrate with code-first ML packages. Coding gives you freedom and no-coding gives you abstraction. Having access to both at once is fantastic.

2 次回应

Des Viranna

C-Suite Executive | Digital & Data Transformation Leader | Driving Revenue Growth & Innovation | AI & GenAI Expertise | Ex-Microsoft

2 年

Does anyone find trend 1 concerning? Having someone who doesn’t understand how to interpret the model outputs create and deploy a model can be dangerous. (And yes, this also is a few years behind!)

3 次回应

Dee Acosta ??

GTM AI / Growth Driver / Trusted B2B Advisor / Operator / Perennial $1mm Quota Achiever

2 年

Only someone who doesn't know how to code would write coding makes no sense.

1 次回应

查看更多评论

要查看或添加评论，请登录

Thomas W. Dinsmore的更多文章

1,095 Days

2024年2月15日

1,095 Days

Three years ago today, my son killed himself. I want to thank everyone – friends, colleagues, and strangers – for the…

20 条评论
AI Will Take Your Job!!!

2024年2月12日

AI Will Take Your Job!!!

The Wall Street Journal says AI is coming for our jobs! ?? ?? ?? Uh-huh. Let's unpack this.

17 条评论
Yet Another Public Health Crisis

2024年1月22日

Yet Another Public Health Crisis

There’s a public health emergency brewing here in Massachusetts. Steward Health Care, a private company that operates…

17 条评论
More on AI Venture Funding

2024年1月16日

More on AI Venture Funding

Last week, I wrote about the whales-and-minnows pattern in 2023 AI venture funding. A picture is worth a thousand…

1 条评论
AI Venture Funding in 2023

2024年1月9日

AI Venture Funding in 2023

In 2021, venture funding for AI startups peaked at over $70 billion. Overall funding declined in 2022 and again in 2023.

11 条评论
We're In A Bubble When...

2023年12月14日

We're In A Bubble When...

You can play a fun game with friends while waiting in line at the Employment Office. Take turns completing this…

19 条评论
February 15, 2021

2023年2月15日

February 15, 2021

Two years ago today, my son Thomas killed himself. He was four weeks short of his 30th birthday.

83 条评论
Boom

2022年7月21日

Boom

The gaslighting never stops. Today, Dan Wright announced that he “made the difficult decision to step down as CEO.

28 条评论
Turmoil at DataRobot

2022年7月19日

Turmoil at DataRobot

DataRobot markets a top-notch product, and employs hundreds of talented and capable people, many of whom I think of as…

36 条评论
Retirement? No.

2019年7月10日

Retirement? No.

Earlier this year, I turned 65. Instantly, AARP sent me junk mail.

17 条评论

See all articles

Why We Code

Thomas W. Dinsmore

I write about machine learning tools and software.

领英推荐

Thomas W. Dinsmore的更多文章

社区洞察

其他会员也浏览了

Exclusive Sneak Peak At What Is Data Science!

Subject: ?? DATA Pill #124 - SQL Has Problems, RAG API, QueryGPT

Data Science 2.0: From Analytic Outputs to Business Outcomes

Meet Chanakya: The Platform Behind Anko’s Data-Driven Solutions

A Very Modern Data Stack

Refined Thinking like a Data Scientist Series

Analytics and Data Science News for the Week of February 14; Updates from BARC, Databricks, DataRobot & More

Building Industry-Level Data Science Projects: A Step-by-Step Guide.

Three conversations about data

How to get published in Gartner’s Magic Quadrant for Data Science and Machine-Learning Platforms?

领英推荐

Thomas W. Dinsmore的更多文章

1,095 Days

AI Will Take Your Job!!!

Yet Another Public Health Crisis

More on AI Venture Funding

AI Venture Funding in 2023

We're In A Bubble When...

February 15, 2021

Boom

Turmoil at DataRobot

Retirement? No.

社区洞察

其他会员也浏览了

Exclusive Sneak Peak At What Is Data Science!

Subject: ?? DATA Pill #124 - SQL Has Problems, RAG API, QueryGPT

Data Science 2.0: From Analytic Outputs to Business Outcomes

Meet Chanakya: The Platform Behind Anko’s Data-Driven Solutions

A Very Modern Data Stack

Refined Thinking like a Data Scientist Series

Analytics and Data Science News for the Week of February 14; Updates from BARC, Databricks, DataRobot & More

Building Industry-Level Data Science Projects: A Step-by-Step Guide.

Three conversations about data

How to get published in Gartner’s Magic Quadrant for Data Science and Machine-Learning Platforms?