Chasing Two Rabbits -                Bringing AI into the Enterprise

Chasing Two Rabbits - Bringing AI into the Enterprise

At Microsoft’s Inspire 2019 partner conference, CEO Satya Nadella said something that caught my attention. He said that in 2030, tech spend will be 10% of global GDP and will amount to a staggering $14 trillion dollars! But that was not the part that caught my attention, it was what he said immediately after, i.e. though the 10% is important, the remainder 90% of global GDP represents the opportunity for the company - as businesses, enterprises, and industries worldwide get digitized. 

Digitization is about connecting things and processes to provide insights; it offers the potential for more agile business operations, closer employee engagement, and improved customer experiences. Digitization will generate data at high volume, variety, and velocity and so - capabilities such as Artificial intelligence (AI) and Machine learning (ML) need to be concurrently deployed in order to convert data to insights to action. 

The challenges with deploying Artificial Intelligence and Machine learning

As I talk to different enterprises, they say that the challenges with deploying AI and ML are considerable and there are many decisions that need to be made - starting with determining areas that would be a good fit for AI and ML. Data has to be cleaned and pre-processed which can take as much as 50-60% of the project time, nulls and outliers have to be dealt with, and rescaling of the data may be required. The data scientist has to then determine the features that are important, and choose the ML model from a plethora of choices (Bayesian regression, Gradient boosted, Nearest neighbors, Support vector machine…). There are clustering models, classification and regression models, association rule models; no single model works best on all datasets, and different variables, coefficients, and hyperparameters have to be tried. After choosing a model, hyperparameters will probably have to be tuned and it may be necessary to reprocess the data. 

This process of finding the best ML model is a trial-and-error, iterative method which can take a lot of time (unless just a few parameters need tuning, or a lot of money is spent on high-power processing platforms). The data scientist has to decide how many of these iterations to do, whether to try different model families, whether to further tune hyperparameters, and whether to send data back for preprocessing. Even if the enterprise has enough data scientists and software engineers, these professionals may have to spend hundreds of hours each month building and maintaining these ML projects; and aiming for higher accuracy results would mean delaying other enterprise ML projects. What makes things worse is when models need to be continuously upgraded, for example in the field of financial applications there may be new regulations to be considered, or in the case of retail, the latest market feedback.

Automating the steps of Machine learning

Wouldn’t it be great if all of these steps could be done automatically, and there existed a service that identifies the best machine learning pipelines? The program would take in the enterprise’s dataset, optimization metrics, and any constraints in time and cost as inputs - and come up with a ML model. This would enable a large number of iterations to be run over many combinations of algorithms and hyperparameters, which would result in the best model for the input criteria. 

Microsoft’s Automated ML  

Well this seems to be what Microsoft’s Automated ML product available through their Azure Machine Learning service does; it automates feature engineering, algorithm and hyperparameter tuning to find the best model for enterprise data. Last week I got the opportunity to attend a session on Automated ML at Microsoft’s office in Atlanta. A lot of thanks to Mark Tabladillo - Data and Artificial Intelligence Scientist at Microsoft - for an excellent presentation explaining the concepts behind Automated ML.

Domain specific pretrained models for Language, Search, Speech, and Vision applications - as well as popular ML frameworks such as TensorFlow, PyTorch, ONNX, and Scikit-learn are supported, on processors such as CPU, FPGA, and GPU. The program takes in input data along with goals and constraints and tests multiple models in parallel, reducing development time. Guardrails can be placed such as detection of high cardinality features, class imbalance, leaky features, overfitting, missing value imputation. 

Microsoft’s aim with Automated ML seems to be to democratize and scale AI, and enable developers to rapidly build solutions. Some of the companies using Azure for ML include Accenture, BT, Carnival Cruises, Jabil, Juniper, Lennox, Mercedes-Benz, the NBA, Progressive Insurance, Schneider Electric, Uber, UPS.

Chasing two rabbits

“The man who chases two rabbits, catches neither” is a quote attributed to Chinese sage Confucius. Enterprises today feel like they have to chase two rabbits. On the one hand they realize that their entire industry, supply chain, and ecosystem is getting digitized - and they feel pressure to pursue the same, and quickly. At the same time, in order to manage the data that digitization produces, they realize they need to simultaneously pursue new technologies such as artificial intelligence and machine learning. 

With the Automated ML product, Microsoft is helping chase down one of the rabbits. Enterprises can now focus on chasing down the other. 


P.S. and Questions: 

1) What are your thoughts regarding digitization; do you feel the pace of digitization in your industry is too fast, and the RoI is still not clear? 2) Do you feel along with digitization you have to concurrently deploy AI and ML in the enterprise; and what are the challenges? 

Feel free to comment or message me.

Good observation, Alan. Just to add, regardless of the tools and platforms, we see that building production and enterprise quality end to end ML projects need seasoned teams and SMEs. We spend a lot of evangelizing and educating our customers on operationalizing these ML projects. Often times that becomes the catalyst for success rather than the platform or tool used.?

回复

要查看或添加评论,请登录

Alan A. Varghese的更多文章

  • Medtech in the time of COVID

    Medtech in the time of COVID

    During this crisis, many have come to know what a Ventilator is. Several companies switched gears to meet the surge in…

  • Will AI and Machine learning take jobs?

    Will AI and Machine learning take jobs?

    Is it not interesting we have worried so much about this topic, but in the ongoing crisis - it was not AI that took…

    4 条评论
  • Baking the quarter three years ago

    Baking the quarter three years ago

    Digital Realty's very good quarter If you paid any attention to the stock market recently you should have heard of…

  • Artificial Intelligence in Building Design & Architecture: Could Errazuriz be right?

    Artificial Intelligence in Building Design & Architecture: Could Errazuriz be right?

    Sebastian Errazuriz - innovative and sometimes provocative building architect in New York raised a lot of emotions…

  • The Building of Buildings

    The Building of Buildings

    Taller and multipurpose About half the world lives in cities and urban areas today, and by 2050 this number will grow…

    1 条评论
  • Selling the Bloody Furniture - Formulating a Strategy for the Automotive Industry

    Selling the Bloody Furniture - Formulating a Strategy for the Automotive Industry

    Strategies for Disruptive and Complex Markets I am often asked by companies to help them map out their strategy. These…

  • The Sexy Job

    The Sexy Job

    I remember asking a colleague more or less in jest why no spectators show up to watch statisticians solve a probability…

    2 条评论
  • From Peaches to Payments

    From Peaches to Payments

    In July 2018, the South Carolina Department of Agriculture probably to liven up a dull day - tweeted that Georgia (the…

  • Reading things not yet on the page

    Reading things not yet on the page

    In 2004, 3G cellular deployments were already a couple of years late. In fact, a March 2004 press release from analyst…

  • The first to be second: Scaling from IoT pilots to mass deployments to sustainability

    The first to be second: Scaling from IoT pilots to mass deployments to sustainability

    Enjoyed some very insightful sessions at Georgia Tech Research Institute’s IoT for Sustainability conference July in…

    2 条评论

社区洞察

其他会员也浏览了