登录查看更多内容

The future of analytics may be “low/no-code” … at first

George Mount

Analytics & AI for Modern Excel ?? LinkedIn Learning Instructor ?? Microsoft MVP ?? O'Reilly Author ??

发布日期: 2021年1月17日

You can’t get through much datascrolling these days without hearing about the rise to dominance of so-called “low- and no-code” data tools. “It’s bringing data to the people.” the typical post reads. “You no longer need to learn how to code to perform sophisticated insert analytics project here.”

I absolutely see the draw to low- and no-code data solutions: after how many tedious rounds of import pandas as pd and object of type 'closure' is not subsettable errors could you not? That said, I don’t see this trend as revolutionary or as permanent as it’s often claimed to be. Here’s why.

It’s an expected part of the adoption cycle

Take as an example of low- and no-code data products the ability to build Azure machine learning models right from Excel. It’s wild stuff, and there’s no question that it’s democratizing machine learning like never before. What’s not asked, though is what came before this innovation, and what comes next.

Innovation often arrives in waves, with one trend building on and ultimately supplanting the next. Analytics software is no exception. Each wave has been built on code, rolled out by low- and no-code graphical user interfaces (GUIs), and then supplanted again by code.

The waves of analytics software: low- and no-code is arrived at, then supplanted.

This is a greatly simplified adoption wave: doing it justice would require a dissertation (and I’ve studied innovation at the doctoral level, so I’m not kidding). The upshot is that low- and no-code has come and gone in analytics tools; let’s look at some examples.

Case study: SPSS

SPSS (Statistical Program for the Social Sciences) began in the late 1960s and by the next decade, was joined by S and SAS as a new wave of tools for exploratory data analysis and statistical programming.

Back then, computer scripts generally needed first to be compiled into a machine-readable file, and then run; this made it difficult to manipulate, visualize and analyze data on the fly. These tools were novel in that they allowed for bits and portions of a script to be executed and printed immediately, which greatly enabled iteration. Analysts could now focus on the data analysis rather than compiling the code.

At some point (And I’m not able to find the exact launch, so if someone does know please get in touch!) SPSS went a step further and added menu-driven options for writing programs. All menu choices generated syntax which could be saved, but the idea was that analysts could further focus less on the code and more on the data, hence democratizing analysis. Technically-savvy statisticians no longer had the monopoly on working with data.

Low- and no-code can get messy, as SPSS can attest. (Source: Greg Elvers, University of Dayton)

One fruit of this “no- and low-code” implementation is the above menu screenshot. There’s no one-size-fits-all answer to working with data, so trying to accommodate all needs into a single menu can result in, let’s say, a bloated interface. I used SPSS in grad school, and while it was great to be able to point-and-click my way to data analysis, hence focusing on the conceptual bits of the work, I quickly found it easier just to write the code than to navigate the menus. So, the SPSS syntax generation is a blessing… but it’s not the be-all, end-all.

SPSS’s menu was just one product of the computing revolution driven by the GUI. As point-and-click options, GUIs offered relatively low- or no-code features to data users. Another result of this revolution was the spreadsheet, which in the opinion of many was the first “killer app” of the personal computer. Business users now had computing ability at their fingertips, without necessarily needing to code.

Some assembly always required

Let’s stick with spreadsheets because they’re facing the same GUI dilemma as SPSS in the age of what I am calling “personalized cloud computing:” computer applications which rely on cloud capabilities for storing and working with data.

Excel’s Power Query is a show-stopping innovation allowing users to build extract, transform, load (ETL) pipelines right from a spreadsheet. (Similar tools for low/no-code data prep include Alteryx, Tableau Prep, etc.). While based on the M programming language, it includes menu-driven syntax generation, much like SPSS. Not a cloud application per se, Power Query is part of Microsoft’s larger “Power Platform” which is largely marketed as a cloud (and no/low code) solution.

Its menus can be used for most of a user’s needs… but not all. And indeed, a rite of passage for Power Query users is that they begin writing their own code:

Recently, I was trying to add an index number by sub-group in Power Query; this took quite a bit of doing between Power Query menus and M custom coding. By the end, I asked myself, Was this really any easier than just strictly coding? After all, Power Query doesn’t offer a dedicated integrated development environment with package management, like R or Python. It’s an odd soup of GUI and code, somewhat like SPSS.

Working with data is messy in more ways than any of us can count. And it’s this ambiguity that makes building a rigid set of menu options so difficult. I’ve yet to see a GUI that easily accomodates everything I want to do with my data in a simple user experience. Can it be done? Possibly given future technologies. But learning to code has untapped far more possibilities for me than any GUI ever has. So why wait for some idealized UX?

Code and GUIs, ying and yang

It’s at least worth pointing out that many claim the rise to R and Python is in part because they are purely-programmed applications. By offering great development environments and large open source code bases, it became possible to do nearly everything with these two languages… if you would learn to code. There’s little debate that if, perhaps not as the layer the user interacts with, code should be the primary artifact for how analytics is done.

So, why the change in heart to low- and no-code analytics solutions? Like I said earlier, it can get frustating to write the same calls and receive the same trivial errors time and again. So I get that these could be seen as roadblocks to greater data democracy. GUIs have had their time and place in the waves of analytics innovation, often when some application has hit a certain level of maturity. Code also plays a part to build the applications out so they can reach that maturity.

I don’t know what the next wave will be, but I’m certain that this current wave of low- and no-code solutions won’t last it. It may be that fewer coders are needed to get an innovation to the low/no-code part of the adoption, and that the product can genuinely do everything required of its users.

Until that time, I recommend data professionals learn a bit about coding. Maybe not every data solution requires it; that’s fine. But given where we’ve come from in the data world, I’m not inclined to say that the future is all low and no code.

Felix Zumstein

xlwings & SQLookup creator | O'Reilly author | LinkedIn instructor

4 年

I agree! I think that Power Query is awesome as long as you don't have to change or tweak the queries i.e. it's a good no-code solution but a bad low-code solution.

6 次回应

要查看或添加评论，请登录

George Mount的更多文章

Free white paper: Five things Excel users should know about Python

2021年8月2日

Free white paper: Five things Excel users should know about Python

If you’ve visited my blog before you’ll know that I see Excel as a valuable slice of the data analytics stack. In fact,…
Five Reasons I Wrote Advancing into Analytics

2021年3月6日

Five Reasons I Wrote Advancing into Analytics

The thing that struck me about writing a book is just how much it is like starting a business: you can do your research…
Five myths about learning data analytics

2020年9月19日

Five myths about learning data analytics

I grew to loathe math in school, which culminated in me getting a “D” in one semester of algebra. So it seems nuts that…

3 条评论
Why Excel is the best way to learn data analytics

2020年8月3日

Why Excel is the best way to learn data analytics

The more I advance into analytics, the more I come back to Excel as a teaching and prototyping cool. Yes, of course…
Demo guide: The central limit theorem in Excel

2020年6月7日

Demo guide: The central limit theorem in Excel

Excel is not just a powerful tool for doing data, but for learning it: spreadsheets provide an unparalleled opportunity…

2 条评论
What makes a good analytics course?

2020年5月2日

What makes a good analytics course?

I recently had the pleasure of speaking with John David Ariansen of the How to Get an Analytics Job podcast. I…
Demo guide: Coin tosses in Excel

2020年4月20日

Demo guide: Coin tosses in Excel

I love teaching data using examples from everyday life. And what’s more quotidian than coins? Plus, the third week of…
The law of large numbers, visually demonstrated in?Excel

2020年4月13日

The law of large numbers, visually demonstrated in?Excel

I know very little about gambling. But these very literal “games of probability” make great examples for teaching…
Building your data academy: Presentation and slides

2020年4月7日

Building your data academy: Presentation and slides

In September 2019, McKinsey called for the rise of the in-house “analytics academy” to up-skill employees’ data…
My new upcoming O’Reilly Media live Online Training sessions

2020年4月3日

My new upcoming O’Reilly Media live Online Training sessions

I’m excited to share that I have two new O’Reilly Online Learning courses coming up in April 2020: First Steps with…

See all articles

The future of analytics may be “low/no-code” … at first

George Mount

Analytics & AI for Modern Excel ?? LinkedIn Learning Instructor ?? Microsoft MVP ?? O'Reilly Author ??

It’s an expected part of the adoption cycle

Case study: SPSS

Some assembly always required

Code and GUIs, ying and yang

George Mount的更多文章

社区洞察

其他会员也浏览了

Zyrix DataZen! ZDZ for Data Analysis in 2025 - Analytics Insight:

How Machine Learning is Changing the Unstructured Data Analytics Game

Demystifying Data Analytics: From Numbers to Actionable Insights

What are the most in-demand skills in data science?

Five Essential Principles for Success as a Data Analyst

Data Science Unveiled: The New Age of Data-Driven Decision Making

Handling imbalanced data with SMOTE

From Raw Data to Actionable Insights: The Role of Preprocessing and Cleaning

OpenLink Data Twingler AI Agent Example

Bye-bye Excel, hello KNIME

It’s an expected part of the adoption cycle

Case study: SPSS

Some assembly always required

Code and GUIs, ying and yang

George Mount的更多文章

Free white paper: Five things Excel users should know about Python

Five Reasons I Wrote Advancing into Analytics

Five myths about learning data analytics

Why Excel is the best way to learn data analytics

Demo guide: The central limit theorem in Excel

What makes a good analytics course?

Demo guide: Coin tosses in Excel

The law of large numbers, visually demonstrated in?Excel

Building your data academy: Presentation and slides

My new upcoming O’Reilly Media live Online Training sessions

社区洞察

其他会员也浏览了

Zyrix DataZen! ZDZ for Data Analysis in 2025 - Analytics Insight:

How Machine Learning is Changing the Unstructured Data Analytics Game

Demystifying Data Analytics: From Numbers to Actionable Insights

What are the most in-demand skills in data science?

Five Essential Principles for Success as a Data Analyst

Data Science Unveiled: The New Age of Data-Driven Decision Making

Handling imbalanced data with SMOTE

From Raw Data to Actionable Insights: The Role of Preprocessing and Cleaning

OpenLink Data Twingler AI Agent Example

Bye-bye Excel, hello KNIME