登录查看更多内容

GenAI Architectural Challenges

Oliver Cronk

Technology Director | Sustainable Innovation & Architecture | Speaker, Podcaster and Facilitator | MBCS CITP

发布日期: 2023年5月10日

Apologies for the newsletter silence! Not a reflection of the activity in the community - there has been a lot going on across Generative AI, Sustainable Technology and the future of Architecture! This month will mostly be an update on our work on Generative AI...

Risks of deploying Generative AI in enterprises

In the full episode below we talk about the challenges of deploying customer facing Generative AI power applications in [regulated] enterprises. We touch on:

Reputational Damage
Legal and regulatory compliance in particular “Customer Conduct” aka “Consumer Duty”
Intellectual Property Infringement
Illegal activity
Ethics and Privacy - Use of Personal / Personally Identifiable Data
Bias in training data

Technical and Technology integration challenges including:

Explainability / repeatability / transparency
Scalability
Security
Data and systems integration
Costs and the dangers of free trials
Environmental impact

Off the back of the research and thoughts from this episode we've pulled together a conceptual architecture. Thanks again to Charles Phiri, PhD, CITP , Chris Booth , James Heward and others in the community who provided input.

Conceptual Architecture for GenAI deployment

No alt text provided for this image — Generative AI Conceptual Solution Architecture

Here is a run through of the notable components (we do acknowledge that the Data Management aspect is important and we will circle back to that very soon).

Demand Management

Requests should be managed through demand management mechanisms – ideally in the form of a queuing mechanism. This ensures that the platform is isolated from and can manage the spikes in customer demand. Should the volumes be high enough and the workloads be time sensitive, auto-scaling of the orchestration components is an option to consider.

Orchestration

This is the wrapper or abstraction layer around the different components, managing the generative AI models (in the Model Zoo – more on that later) and providing the framework to add elements such as telemetry capture, input and output checking. The beauty of this is you can keep the upstream and downstream interfaces consistent but swap out components or models within the orchestrator, or scale them up and down as required.

Input filtering and Model IO engineering

Just from using them, it appears pretty likely that ChatGPT and other Chatbot implementations are using a form of input filtering. As an extension of sanitising inputs this makes a lot of sense, but for systems where the input prompt is so critical to the success of the outcome, this is even more vital. The input might need to be significantly altered to get a better chance of success or to avoid content generation risks; for example, requests that could lead to outcomes that are not compatible with the brand of the organisation. Different models in the Model Zoo might require different input data or prompting styles.

Model Zoo

It’s highly unlikely that a single model (regardless of how powerful or general purpose it is) is going to cover all the use cases of a non-trivial application. The orchestrator can draw on a number of approved models in the Model Zoo (ingested from publicly available model hubs such as HuggingFace and internal model development as required). This allows for the management and governance of models used in the application. This approach could potentially lead to something approaching Artificial General Intelligence (AGI) – as narrow specialist models can be called upon to solve specific challenges and fill gaps in more general language models that are good at human interaction.

领英推荐

AI and Your 2025 Operating Plan

Geoffrey Moore 3 个月前

How to Scale AI in Your Organization

N-iX 8 个月前

Make Information Discovery Easier and Faster With…

Instabase 9 个月前

Telemetry

When aircraft sadly crash, the black box flight recorder is crucial to the investigation of why the accident occurred. When applications leveraging machine learning fail, we need a similar audit trail data source – capturing the input data, decisions and outputs. That way, lessons can be learned and decisions can be made to tune or change models on the basis of data and evidence. From a regulatory perspective, it’s possible that this will be made a requirement when using ML technologies for customer processes – in the future, regulators may demand to see your application telemetry.

Output checking

When dealing with customer requests in real time, it’s not going to be good enough to just try and catch issues and errors after the fact and adjust architecture afterwards. In order to prevent brand damage, misselling, or other mishaps from generating inappropriate content, output checks and filtering is going to be required. This is likely to be a blend of traditional logic-based filtering and ML models that generate a confidence percentage that outputs are aligned with company policies and/or regulatory standards. Responses back to the customer can then be altered or held back and escalated to a human employee to respond to the customer instead.

Design time architecture

Underlying this will be design time components – these will ingest telemetry and performance data and assist with updates to the Model Zoo, input and output filtering, and other supporting components. The performance of the platform can be evaluated and fine-tuned and issues and incidents can be investigated from the captured telemetry data.

If you want more details on this check out the full blog here. And if you'd like to see the discussion / leave feedback check out this thread (or feel free to leave a comment on this post).

What's next?

The power of Knowledge Graphs!

The next topic in the AI series (although not the next episode...) features many of the original panel plus Tony Seale a leading Knowledge Graph Engineer. Chris Booth and I recorded the follow up episode with him this week and we were both blown away (as you'll be able to tell from the "buffer face" moments that I have whilst processing the implications of what Tony articulates describes. In short Knowledge Graphs partner very well with Large Language Models and this is an episode you won't want to miss!

Chris and I also touch on some R&D we have been doing - using Graph structures to manage customer interaction with Large Language Model generated chat dialogue.

This goes some way to addressing the Data Management piece - the next step is likely to be building out more of that part of the conceptual architecture.

Strategy to Reality and Business Architecture

The next episode (that is currently in the editing process) to drop sees Whynde Kuehn returning to talk about her book Strategy to Reality, Women in Architecture and Business Architecture in general.

We are joined by Lisa Woodall (who appeared when we spoke to the Intersection Group ) and Catherine Pratt . An amazing all woman line up! Diary and travel permitting this should be out in the next couple of weeks.

Sharing is caring!

A healthy community is a growing community! If this is helpful please do subscribe and share this with your network.

https://youtube.com/ArchitectTomorrow/

Is there a topic you'd like to talk about or a speaker you'd recommend? Particularly in the area of Sustainable Architecture / Technology (as in making technology more sustainable) please do get in touch via the comments or send Oliver a DM.

Architect Tomorrow

2,650 位关注者

Vincenzo Marchese

Enterprise Architect, Consultant and Educator

1 年

Interesting topic. I think top 3 factors determining risk are: 1. Trustworthiness of the LLM provider; 2. Mode of access (eg systematic via API, embedded plugin, direct by user through web; 3. The specific use case. Mitigating controls (architecture guardrails being one of many posssible) need to be commensurate and appropriate to the risk and the firm’s risk appetite.

1 次回应

查看更多评论

要查看或添加评论，请登录

Oliver Cronk的更多文章

Masking is exhausting & "emotional tectonic plates"

2025年3月21日

Masking is exhausting & "emotional tectonic plates"

So this week is ND celebration week https://www.neurodiversityweek.

20 条评论
Architect Tomorrow turns 5!

2025年2月28日

Architect Tomorrow turns 5!

Next month (March 2025) the #ArchitectTomorrow podcast & community turns 5 years old! To commemorate this, I have…

14 条评论
There is more than one way to architect AI

2025年2月21日

There is more than one way to architect AI

(A variation of "there is more than one way to do GenAI" which was originally published on the Scott Logic Blog). Is…

21 条评论
Are we sleepwalking into tomorrow's AI challenges?

2025年2月10日

Are we sleepwalking into tomorrow's AI challenges?

Even if you are cynical and/or sceptical on AI, I'd urge you to engage in this one - your opinions will be valuable. As…

54 条评论
Getting Deep and Meaningful with DeepSeek R1

2025年1月31日

Getting Deep and Meaningful with DeepSeek R1

Last night I decided to give one of this weeks most hyped models a thorough test running locally. I decided to ask it…

20 条评论
Where is AI leading us? Considerations for Business and Society

2024年12月18日

Where is AI leading us? Considerations for Business and Society

Note this is a preview of a blog series that will be published on the Scott Logic Blog As we’ve passed the two-year…

16 条评论
Mental health lessons for my past self

2024年11月22日

Mental health lessons for my past self

Apparently the 19th November was International Men's Day (I know it's hard to keep track of the various themed days…

17 条评论
A Brief History of the Future of the Web?

2023年11月17日

A Brief History of the Future of the Web?

A somewhat silly take on where the internet was and where it might go (that does not represent the views of any current…

9 条评论
Am I just getting old or has the world gone "Delulu"?!

2023年7月14日

Am I just getting old or has the world gone "Delulu"?!

As I much as I am in denial about being in my early 40s, I increasingly find myself wondering - has the world gone…

24 条评论
Cloud Security and Sustainability

2023年3月3日

Cloud Security and Sustainability

Oops it is March and the newsletter for Feb didn't get out of drafts! Well better late than never here it is! And it…

5 条评论

See all articles

GenAI Architectural Challenges

Oliver Cronk

Technology Director | Sustainable Innovation & Architecture | Speaker, Podcaster and Facilitator | MBCS CITP

Risks of deploying Generative AI in enterprises

Conceptual Architecture for GenAI deployment

Demand Management

Orchestration

Input filtering and Model IO engineering

Model Zoo

领英推荐

Telemetry

Output checking

Design time architecture

What's next?

The power of Knowledge Graphs!

Strategy to Reality and Business Architecture

Sharing is caring!

Architect Tomorrow

2,650 位关注者

Oliver Cronk的更多文章

社区洞察

其他会员也浏览了

Enterprise AI Platform (EAIP): Empowering Businesses through Artificial Intelligence

AI Agents and Prompt Engineering Thoughts

Gen AI: Time for Honesty!

Innovation with Rapid AI Development Frameworks: Building Future-Ready Solutions

Integrated AI Solutions: Bridging the Gap Between Vision and Reality

When AI Breaks: The Hidden Complexity of Enterprise AI Deployments

Implementing AI Agents in Large Enterprises: Strategic Considerations for CIOs

#10 CTO Alert: Generative AI's transformative potential for the IT landscape.

Precision in IDP: Key Improvements & Refinements for High-Accuracy Automation

Nurturing Growth: The Role of AI Centers of Excellence in Technology Consulting

Risks of deploying Generative AI in enterprises

Conceptual Architecture for GenAI deployment

Demand Management

Orchestration

Input filtering and Model IO engineering

Model Zoo

领英推荐

Telemetry

Output checking

Design time architecture

What's next?

The power of Knowledge Graphs!

Strategy to Reality and Business Architecture

Sharing is caring!

Architect Tomorrow

2,650 位关注者

Oliver Cronk的更多文章

Masking is exhausting & "emotional tectonic plates"

Architect Tomorrow turns 5!

There is more than one way to architect AI

Are we sleepwalking into tomorrow's AI challenges?

Getting Deep and Meaningful with DeepSeek R1

Where is AI leading us? Considerations for Business and Society

Mental health lessons for my past self

A Brief History of the Future of the Web?

Am I just getting old or has the world gone "Delulu"?!

Cloud Security and Sustainability

社区洞察

其他会员也浏览了

Enterprise AI Platform (EAIP): Empowering Businesses through Artificial Intelligence

AI Agents and Prompt Engineering Thoughts

Gen AI: Time for Honesty!

Innovation with Rapid AI Development Frameworks: Building Future-Ready Solutions

Integrated AI Solutions: Bridging the Gap Between Vision and Reality

When AI Breaks: The Hidden Complexity of Enterprise AI Deployments

Implementing AI Agents in Large Enterprises: Strategic Considerations for CIOs

#10 CTO Alert: Generative AI's transformative potential for the IT landscape.

Precision in IDP: Key Improvements & Refinements for High-Accuracy Automation

Nurturing Growth: The Role of AI Centers of Excellence in Technology Consulting