登录查看更多内容

Continuous Streaming Data Ingestion in AI Models

Guillermo Wrba

Autor de "Designing and Building Solid Microservice Ecosystems", Consultor Independiente y arquitecto de soluciones ,evangelizador de nuevas tecnologias, computacion distribuida y microservicios.

发布日期: 2024年2月27日

In this second article about AI Architecture, i'm going to cover the concept of continuous data streaming, in the context of AI Model Inference and Processing.

As AI adoption is growing pretty quickly ultimately, and as cloud providers get more mature in terms of AI platform offerings - being Azure AI and AWS Bedrock two examples of such -, as well as open source initiatives for playing with models such as FloWise, the term "AI" raises here and there with new applications coming every day.

AI Platforms heavily rely on the data ingestion, data curation and data embedding, as they are the "foundational" capabilities that ultimately, put the source pieces of information in front of the AI Model so that AI Models - specifically LLMs - can be trained, and evaluated using the information provided. As models get tailored with specific information, they become "Experts" in a certain field, and can somewhat formulate responses to queries about concepts that have been previously "learnt".

I'm going to refer to these three-stage data processing pipeline as the "Content Pipeline". A Content Pipeline encompasses all tasks that ultimately ingest, transform and label (curate) and transform the curated data into a format that is understandable by an AI model - known as "embedding". Embedding just transforms curated data into a multi-dimensional vector representation of hundreds of dimensions (currently, 1536 for GPT-4). This vector contains numeric values that represent the semantic "meaning" of the individual "words" that have been captured from the curated data.

Below, a graphical representation of how a real Content Pipeline would looks like, but oriented to non-stream oriented processing.

Discrete-processing oriented AI Content Pipeline

One important component within this Content Pipeline architecture is the data ingestion component. Data ingestion pulls from various sources, and that also includes some streaming data sources; streaming data sources represent sources of data that can be queried continuously, and in fact, are intended to be consumed continuously, as a linear continuum of information source.

Streamed data differs from non-streamed data sources in the fact that the data is going to be ingested in a continuous manner. While you can implement the content pipeline oriented to discrete processing - such as PDF documents that are going to be uploaded from time to time - so that you can pull data following a cadence-oriented approach, non-discrete or continuous streaming have pretty different requirements in terms of how a content pipeline must be architected in order to get a fully functional solution that can stand still over time:

Data Ingestion: First of all, data must flow-in continuously, so data connectors must be able to stream the data and feed into continuously; using a landing data store - a special type of document store - will not work because of the amount of data ingested is not compatible with a DB processing approach. Data must typically be splitted down into individual chunks, and transmitted through a data fabric in the form of data events.
Data Curation: typical data curation occurs by reading documents already validated and ingested from within a Landing Store, and that is valid for non-streamed data processing. For streaming data processing, validated data must be consumed from a data fabric, in the form of events, and that ingestion must be able to scale horizontally as needed, since - remember - data is flowing continuously. Curation of data involving annotation and contextualization must happen on-the-fly, as events flow through. In the same direction, the resulting curated pieces of data cannot be stored within a curated data store, but otherwise injected as "curated events" into a data fabric so events can continue its flow through the pipeline.
Data Embedding: in a same way, embedding typically happens by reading curated documents, getting those documents from a Curated data store. In a streaming-oriented content pipeline such curated data must be read from within the data fabric, and fed into the embedding process, that must be able to scale horizontally depending on the streaming volume. The resulting embedded data must be - following same approach - fed into the data fabric via events.
Ingestion of embedded data into Vector DB: ingesting embedded data events from the data fabric must occur at much higher rate. This necessarily needs to be designed for high performance. Vector Database design must also be able to handle high volume and concurrency.

领英推荐

Integrating Data Streaming with AI and GenAI

MIT Sloan Management Review - Middle East 9 个月前

Microservices in AI: Building Scalable Image…

API4AI 1 个月前

How to Set Up Invoke AI on AWS with Pre-Configured AMI

Meetrix.IO 7 个月前

As you can see, the architecture we expect for streamlined processing of source data has pretty tougher requirements than a typical non-streamed one.

The above presented architecture decouples the various content pipeline stages by means of using a data fabric that can esentially connect the stages together via document-driven events. For this to work, Individual stages must be designed with horizontal scalability in mind, and deployed on top of a scalable infrastructure that can scale as per demand. This approach guarantees a near-real-time Content Pipeline processing, since source data is made available into the vector DB - and hence, available as knowledge to the AI Model - as soon as it becomes available in the source data store.

Typically, AI use cases are not usually oriented to bring end users near-real-time capabilities and business applicaitons created on top of AI may make use of sporadic information, that doesn't really require to implement a near-real-time approach as the above. But think a moment about other type of business requirements such as real-time decision that may leverage the above approach.

< this will continue >....

要查看或添加评论，请登录

Guillermo Wrba的更多文章

Enterprise Architecture: Assessing IT organization maturity with ITMAF

2025年2月26日

Enterprise Architecture: Assessing IT organization maturity with ITMAF

Sometimes, when involved on digital transformation engagements and pre-sales activities, and especially during the…

1 条评论
Enterprise Architecture: Architectural Building Blocks for an AI Platform

2025年2月24日

Enterprise Architecture: Architectural Building Blocks for an AI Platform

It's clear that building a software platform from scratch is not an easy thing, in fact, it's a huge endeavor that…

3 条评论
Efficiency-driven Iterative Model Tuning Approach: how to tune AI Models efficiently while keeping the running costs controlled

2024年10月2日

Efficiency-driven Iterative Model Tuning Approach: how to tune AI Models efficiently while keeping the running costs controlled

AI technology as of today are growing at an exponential rate, as faster adoption occurs around AI , generative AI…
AI Content Engineering and Complex Tabular Data Processing

2024年5月27日

AI Content Engineering and Complex Tabular Data Processing

It's not a surprise that an AI Models can be trained to be able to process data in various formats and adhering to…

3 条评论
Exploring Generative AI LLMs using OpenAI API

2024年1月22日

Exploring Generative AI LLMs using OpenAI API

As i promised, here's the full article i prepared on how to leverage existing large language models available from…

2 条评论
Moving Mission-Critical SQL Server Workloads Effectively to Cloud with AWS DMS

2023年7月11日

Moving Mission-Critical SQL Server Workloads Effectively to Cloud with AWS DMS

Quite recently, i was involved on driving a big data migration project following a lift-and-shift approach to move one…
Architecture Deliverables Governance Framework [ AGDF ]

2023年5月10日

Architecture Deliverables Governance Framework [ AGDF ]

Requirements Management Document Purpose The goal of this document is to set a framework on how to handle the…
ML/AI, un hijo de modelos de regresion matematica

2023年4月6日

ML/AI, un hijo de modelos de regresion matematica

Modelos de Regresion Matematico Hoy dia escuchamos el termino "AI" o "Artificial Intelligence" practicamente a diario…
Leveraging Event Multiplexing in Even-Driven Architectures

2023年3月26日

Leveraging Event Multiplexing in Even-Driven Architectures

Implementing Event Multiplexing in EDA In order to deal with multiplexing, microservices should be capable of…
Persistent Storage in AWS Cloud-Native Microservices with AWS FSX Lustre

2022年7月14日

Persistent Storage in AWS Cloud-Native Microservices with AWS FSX Lustre

One characteristic of any modular microservice development is that , microservices are stateless by nature, which means…

See all articles

Continuous Streaming Data Ingestion in AI Models

Guillermo Wrba

Autor de "Designing and Building Solid Microservice Ecosystems", Consultor Independiente y arquitecto de soluciones ,evangelizador de nuevas tecnologias, computacion distribuida y microservicios.

领英推荐

Guillermo Wrba的更多文章

社区洞察

其他会员也浏览了

Generative AI: How Your Business Can Benefit from Cloud Platforms

Best Practices For Building And Deploying Generative AI Models At Scale

Optimizing Generative AI: AWS Introduces Cost-Saving Features for Bedrock

Maximizing ROI with Azure OpenAI: A Comprehensive Guide

Streamlining the development of AI through Azure AI Studio.

The Generative AI Epoch in the Enterprise

Demystifying AI as a Service (AIaaS): Your Ultimate Guide to AI ML Services

How Do You Build Generative AI Applications on AWS?

PMsquare Launches Amazon Q Activation on AWS Marketplace

Building Robust AI Pipelines: Tools and Techniques for Seamless AI Model Deployment

领英推荐

Guillermo Wrba的更多文章

Enterprise Architecture: Assessing IT organization maturity with ITMAF

Enterprise Architecture: Architectural Building Blocks for an AI Platform

Efficiency-driven Iterative Model Tuning Approach: how to tune AI Models efficiently while keeping the running costs controlled

AI Content Engineering and Complex Tabular Data Processing

Exploring Generative AI LLMs using OpenAI API

Moving Mission-Critical SQL Server Workloads Effectively to Cloud with AWS DMS

Architecture Deliverables Governance Framework [ AGDF ]

ML/AI, un hijo de modelos de regresion matematica

Leveraging Event Multiplexing in Even-Driven Architectures

Persistent Storage in AWS Cloud-Native Microservices with AWS FSX Lustre

社区洞察

其他会员也浏览了

Generative AI: How Your Business Can Benefit from Cloud Platforms

Best Practices For Building And Deploying Generative AI Models At Scale

Optimizing Generative AI: AWS Introduces Cost-Saving Features for Bedrock

Maximizing ROI with Azure OpenAI: A Comprehensive Guide

Streamlining the development of AI through Azure AI Studio.

The Generative AI Epoch in the Enterprise

Demystifying AI as a Service (AIaaS): Your Ultimate Guide to AI ML Services

How Do You Build Generative AI Applications on AWS?

PMsquare Launches Amazon Q Activation on AWS Marketplace

Building Robust AI Pipelines: Tools and Techniques for Seamless AI Model Deployment