登录查看更多内容

Operationalizing AI Requires a Storage Architecture Strategy

Sam Werner

Vice President, Chief Product Officer. Product and Business Strategist, Inventor. AI, Big Data, Hybrid Multicloud, Storage, Containers

发布日期: 2019年4月16日

As a storage person, my favorite thing about AI is that it requires lots and lots of data. For someone in the storage business, that is obviously good news. But in all seriousness, it actually creates quite a few challenging technical problems: Data Scientists need to get access to and organize the data; storage administrators need to make the data available while still ensuring privacy, security, and governance of the data; and engineers building storage software and systems need to continue increasing throughput and decreasing total response time to ensure GPUs are always fed all of the data they need.

Many enterprises are starting small on their journey to AI. They purchase a couple of GPU enabled servers and let the data scientists loose on them. Data is copied to these servers where they run frameworks like TensorFlow and PyTorch to train their neural networks. This is a simple way to get started and build something like a chatbot. However, what they quickly learn is they are not in a position to scale. Some of the challenges they will quickly encounter:

How will they ensure data governance when data is being copied off to shared servers being used by multiple data scientists?
How will they ensure the data is always secure and that personal information about their customers is always protected?
How will they ensure copies are destroyed when they are no longer being used and how will they ensure there are no issues with data consistency?
How can they build machine learning and deep learning models that have access to real-time / near real-time data to provide the highest value and most timely insights?

The reality is, you need to start with an information architecture (IA) that is scalable across multiple projects, that provides data consistency and governance, and that can provide a single source of truth.

In order to build an IA you first need to understand the ML/DL data workflow and challenges. The goal of the data workflow is really quite simple; getting from ingest to inference as quickly and accurately as possible.

Machine Learning and Deep Learning Data Workflow

Machine / Deep Learning Data Workflow.

The daily tasks of the Data Scientist

Let’s start by taking a look at each of the above steps in the workflow.

Ingest: This is where all of the data comes together. Sources include IoT, mobile, transactional, supply chain, customer service/support, CRM, etc. This can include years and years of intelligence an enterprise has developed about its industry and customers. Storage in this phase needs to be cost effective (leveraging multiple tiers of media), multi-protocol, geographically dispersed, and multi-cloud enabled.

Classify / Transform: This is where a data scientist spends about 80% of their time classifying, tagging, and cleaning data in order to build training datasets for their neural networks. Tools that can help with policy based tagging and classification of data can greatly accelerate this phase and boost a data scientists productivity.

Training: This is the most compute intensive part of the workflow and is where the GPUs come into play. Most models can be distributed across multiple GPUs and systems to accelerate training. In this phase, distributed storage with high throughput and low latency that can be shared across systems is critical in order to ensure that expensive GPUs do not sit idle.

Inference: This is the stage where you actually use the model to generate an insight or infer something. Storage latency is critical in this phase if the model is going to be deployed where many applications or users will need access to the output. NVMe enabled storage is a great fit for the inference stage.

It is easy to dive into AI by buying a few servers, hiring a data scientist, and deploying some open source frameworks. However, in order to actually operationalize AI in your organization, you will require a scalable infrastructure strategy that considers the entire data workflow. This will ensure data remains secure and safe while data scientists and GPUs remain productive. Most importantly it will ensure you can put your AI models into production in order to drive real value. After all there is no AI without IA.

You can see a video me talking about this topic at the IBM Think conference below

PANDIT ANAND KUMAR

Defense/Hospitality/Sales

5 年

Thanks for sharing

查看更多评论

要查看或添加评论，请登录

Sam Werner的更多文章

Eliminate Storage Complexity – Introducing IBM Storage Ceph as a Service: Cloud storage in your datacenter, managed by IBM

2025年3月4日

Eliminate Storage Complexity – Introducing IBM Storage Ceph as a Service: Cloud storage in your datacenter, managed by IBM

Swamped by ever-growing volumes of data? Struggling to keep up with the demands of data-intensive modern workloads and…

12 条评论
IBM Storage Protect and Copy Data Management: Strong, reliable, and built for the future

2024年10月11日

IBM Storage Protect and Copy Data Management: Strong, reliable, and built for the future

Modern businesses face increasing challenges from data growth, cyber threats, and maintaining operational resilience…

1 条评论
My Top 3 Storage Takeaways from 2021 - Part 1

2022年1月5日

My Top 3 Storage Takeaways from 2021 - Part 1

As we enter 2022, I wanted to take some time to reflect upon what I learned last year engaging with our IBM customers…

4 条评论
4 Insights from the Master of Storage Series

2020年7月17日

4 Insights from the Master of Storage Series

I had the pleasure of participating in the Master of Storage Series this week. Hosted by Idriss Janati and joined by…

3 条评论
Why Process Mining is Critical to your Business Transformation in 2017

2017年1月3日

Why Process Mining is Critical to your Business Transformation in 2017

Most organizations struggle to improve their internal IT-driven processes and core operations such as purchasing…

2 条评论

See all articles

Operationalizing AI Requires a Storage Architecture Strategy

Sam Werner

Vice President, Chief Product Officer. Product and Business Strategist, Inventor. AI, Big Data, Hybrid Multicloud, Storage, Containers

Sam Werner的更多文章

社区洞察

其他会员也浏览了

Foundation model debate: Choices, small vs. large, commoditization

AutoML Revolution: Future of Automated Machine Learning in Transforming Data Science, Industry Applications, and Ethical Considerations

Best Practices For Building And Deploying Generative AI Models At Scale

MLOps: The key to deploying and managing AI Models at scale

Artificial Intelligence News for the Week of December 13; Updates from Ayar Labs, Cloudera, IBM & More

AI Infrastructure Essentials: Building a Future-Ready Platform

IBM's Watsonx Reshaping the AI Landscape

Empowering Intelligence: Automated Machine Learning (AutoML) Unveiled - Making Machine Learning Accessible to All

The Importance of Data Preprocessing in ML & DL: Enhancing Model Performance with Clean Data

Artificial Intelligence News for the Week of July 12; Updates from CodiumAI, Deloitte, Hitachi Vantara & More

Sam Werner的更多文章

Eliminate Storage Complexity – Introducing IBM Storage Ceph as a Service: Cloud storage in your datacenter, managed by IBM

IBM Storage Protect and Copy Data Management: Strong, reliable, and built for the future

My Top 3 Storage Takeaways from 2021 - Part 1

4 Insights from the Master of Storage Series

Why Process Mining is Critical to your Business Transformation in 2017

社区洞察

其他会员也浏览了

Foundation model debate: Choices, small vs. large, commoditization

AutoML Revolution: Future of Automated Machine Learning in Transforming Data Science, Industry Applications, and Ethical Considerations

Best Practices For Building And Deploying Generative AI Models At Scale

MLOps: The key to deploying and managing AI Models at scale

Artificial Intelligence News for the Week of December 13; Updates from Ayar Labs, Cloudera, IBM & More

AI Infrastructure Essentials: Building a Future-Ready Platform

IBM's Watsonx Reshaping the AI Landscape

Empowering Intelligence: Automated Machine Learning (AutoML) Unveiled - Making Machine Learning Accessible to All

The Importance of Data Preprocessing in ML & DL: Enhancing Model Performance with Clean Data

Artificial Intelligence News for the Week of July 12; Updates from CodiumAI, Deloitte, Hitachi Vantara & More