登录查看更多内容

Can you control Generative AI?

Duncan Robson

Enterprise Architect | Data and AI Innovator | Retail Expert | University Lecturer | Distinguished Architect with the Open Group | MSc FBCS CITP

发布日期: 2025年1月28日

In 2005 I gave a presentation to a room full of retailers where I made a prediction: in the future computers would learn rather than be programmed, and you would choose to buy a system based on it’s experience and knowledge, rather than it’s technical specifications.

I also posed this question: If in the future you were using AI in your retail business, how would you ensure it stayed on brand and didn’t recommend your competitors’ products over yours, even if they were better?

Fast forward 20 years, and we are now seeing that playing out. There has a lot of discussion about DeepSeek, both its speed and cost, but also about the controls that have been put in place to manage the information it provides.

I am not going to discuss why these controls exist, however I am interested in how they have been implemented, and the implications on controlling Generative AI solutions in the future.

To investigate this I locally installed the DeepSeek-r1-distill-llana-8b model on my laptop to try it out. This isn’t the full R1 model from DeepSeek, but it exhibits similar behaviour.

To start with I asked it ‘What happened at Tiananmen Square'

Ok, that is consistent with what other folk have found, and much better that just ‘I can’t give you this information'

One of the common prompt engineering methods to get more information is to be specific, so let’s try that - ‘What happened at Tiananmen Square in 1989’

Here is its response;

Where this starts to get really interesting is why has the model now chosen to give me more information based on that prompt? Wasn’t it given rules that this was a sensitive topic?

One of the features of DeepSeek in the version I am using is it allows you to see it's ’Thoughts’ - it’s internal monologue. Here it is for the same question.

领英推荐

Why we may be headed for a generative AI winter

Fast Company 10 个月前

#61: Are LLMs Entering the Age of Agents?

Towards AI 3 周前

Are you missing the key to awesome AI outputs?

Salesforce 9 个月前

This fascinates me. Clearly the model knows much more, but is ‘choosing’ how much to tell me based on the prompt I provide. This raises a number of questions;

1. Why didn’t the creators of DeepSeek simply remove the restricted data from the model?

Is this a limitation of the source datasets used to train the model, or it is too complex or resource intensive to selectively remove specific facts? If so what challenges does this pose for curating datasets in future AI systems?

2. Would training your Large Language Model (LLM) using other LLMs transfer their content rules and ethical standards to yours along with their data?

If the model is making the decisions on what it can and cannot say, where are these rules stored, and how are they managed?

3. How effective are the controls if the model can reveal restricted information through alternative phrasing?

Numerous reports suggest that by ‘reasoning’ with the model or rephrasing questions users can bypass restrictions, so how secure are these controls? What are the broader implications for LLMs containing sensitive data, especially in environments where compliance and confidentiality are key?

4. What is the potential impact on people when the model ‘decides’ whether you can access information?

In the legal profession or other regulated industries, how would this 'filtering information by AI' influence decision making, fairness, and trust? Who gets to define the rules, and how transparent are these decisions to users?

5. What happens if bad actors inject harmful information in to your LLMs?

How are you going to control the information produced and your LLM's behaviour if it has ‘learned’ from bad data? Will data security specialists now also need to be behavioural science specialists? Will you have to wrap your GenAI solution with a traditional rule based solution to be safe?

6. It’s also about people - Who do you trust?

Many LLMs, including I assume DeepSeek initially, use Reinforcement Learning from Human Feedback (RLHF) to fine tune the model, and also to build in behaviours based on what they view as acceptable. So when choosing which LLM to use, it's also important to know who tuned the model, and to decide if you trust them.

So more questions than answers, but this is something we need to understand when we are practically implementing Generative AI solutions in our businesses.

What are your thoughts?, and if you would like to follow me on LinkedIn please click here.

Duncan Robson

3 周

Building on these thoughts, I’ve just written another article regarding trust and transparency for LLMs : https://www.dhirubhai.net/pulse/llm-reality-check-which-one-use-duncan-robson-he6ke/

Oliver Cronk

Technology Director | Sustainable Innovation & Architecture | Speaker, Podcaster and Facilitator | MBCS CITP

1 个月

1. Why didn’t the creators of DeepSeek simply remove the restricted data from the model? Surely if that was removed then it could be prone to [semi] hallucinating answers that might not be aligned? 2. This is exactly why we don't think LLMs shouldn't be at the centre of your architecture - something deterministic should orchestrate multiple LLMs to evaluate and keep them "honest" / balanced (more on this coming as we've just completed some research using our InferGPT framework developed with help from Chris Booth 3. Not sure prompt injection has been truly solved yet has it? https://blog.scottlogic.com/2024/07/08/beyond-the-hype-will-we-ever-be-able-to-secure-genai.html 4. Great question - I still don't think the legal implications have been fully thought through. One for a possible discussion with Chris Williams or his colleagues at Clyde 5. This is why I came up with the GenAI conceptual architecture in 2023 - you need filtering, audit logging, evaluation and escalation to a human process all the way through. https://blog.scottlogic.com/2023/05/04/generative-ai-solution-architecture.html 6. Agree! Again why I think you need a basket / zoo of models (as per the architecture above)

2 次回应

Duncan Robson

1 个月

I suspect that some of censorship is due to the initial code used to train the model when it was built. This project should be really interesting as it would potentially keep all of the cool new elements of the tech, but remove the control aspects : https://huggingface.co/blog/open-r1

查看更多评论

要查看或添加评论，请登录

Duncan Robson的更多文章

Why IT Architecture Needs Professional Standards

2025年2月24日

Why IT Architecture Needs Professional Standards

Before we start a quick advisory as I know it can be a hot topic; in this article for simplicity I have combined…

1 条评论
Our Challenge for 2025 - Hybrid Deterministic Solutions

2025年2月13日

Our Challenge for 2025 - Hybrid Deterministic Solutions

Today we stand at an inflection point with Artificial Intelligence (AI) and soon with Quantum Computing, two…

3 条评论
Four reasons why Large Language Models are different from traditional systems.

2025年2月12日

Four reasons why Large Language Models are different from traditional systems.

One of the fun times when bringing up children are the ‘Why years’. Why does the sun shine? Why can birds fly? Why is…

4 条评论
Enterprise Architecture for AI, What’s different?

2025年2月11日

Enterprise Architecture for AI, What’s different?

Update 12/02/25 - I’ve also written an article describing the Four reasons why Large Language Models are different from…

17 条评论
LLM Reality Check: Which one to use?

2025年2月4日

LLM Reality Check: Which one to use?

As an Enterprise Architect I’ve always been a firm believer in the scientific method, shaped by my formative years…

2 条评论
What do Enterprise Architects do? - Part Four

2025年2月2日

What do Enterprise Architects do? - Part Four

Welcome to part four of my series, ‘What do Enterprise Architects do?’. Before we start, I encourage you to check out…
Artificial Specific Intelligence (ASI)

2025年2月1日

Artificial Specific Intelligence (ASI)

AI is big news at the moment and it’s made me think, why are the big companies aiming for Artificial General…

14 条评论
Where is the Emotion in Enterprise Architecture?

2025年1月18日

Where is the Emotion in Enterprise Architecture?

I’ve been working as an Enterprise Architect (EA) for decades, and for many years I’ve had a niggling sense that…

4 条评论
What do Enterprise Architects do? Part Three

2025年1月17日

What do Enterprise Architects do? Part Three

Welcome to part three of my series on articles ‘What do Enterprise Architects do’, based on my framework below. In this…
Enterprise Architecture, the missing piece for successful AI

2025年1月15日

Enterprise Architecture, the missing piece for successful AI

As I write this in January 2025, AI has become the topic everyone is talking about. However the challenge remains, how…

10 条评论

See all articles

Can you control Generative AI?

Duncan Robson

Enterprise Architect | Data and AI Innovator | Retail Expert | University Lecturer | Distinguished Architect with the Open Group | MSc FBCS CITP

领英推荐

Duncan Robson的更多文章

社区洞察

其他会员也浏览了

The Rise of Explainable AI (XAI): Building Trust and Transparency in AI Models

This AI newsletter is all you need #18

Investing in the age of AI

Coolest AI in 2024... and beyond

AI Explainability Explained | Making AI Models More Transparent

Revolutionizing AI: How new LLMs are set to Redefine the Future of AI

The Myth of Omniscient AI: Why No Single AI Can Be Everything, Everywhere, All at Once

THE VALUE OF AI: now and the future (PART 3) AI Predictions and Future

Neuraptic AI is set to launch the Hypercontext, a revolutionary multimodal AI concept

What Is Deepseek AI and Why Is Everyone Talking About It?

领英推荐

Duncan Robson的更多文章

Why IT Architecture Needs Professional Standards

Our Challenge for 2025 - Hybrid Deterministic Solutions

Four reasons why Large Language Models are different from traditional systems.

Enterprise Architecture for AI, What’s different?

LLM Reality Check: Which one to use?

What do Enterprise Architects do? - Part Four

Artificial Specific Intelligence (ASI)

Where is the Emotion in Enterprise Architecture?

What do Enterprise Architects do? Part Three

Enterprise Architecture, the missing piece for successful AI

社区洞察

其他会员也浏览了

The Rise of Explainable AI (XAI): Building Trust and Transparency in AI Models

This AI newsletter is all you need #18

Investing in the age of AI

Coolest AI in 2024... and beyond

AI Explainability Explained | Making AI Models More Transparent

Revolutionizing AI: How new LLMs are set to Redefine the Future of AI

The Myth of Omniscient AI: Why No Single AI Can Be Everything, Everywhere, All at Once

THE VALUE OF AI: now and the future (PART 3) AI Predictions and Future

Neuraptic AI is set to launch the Hypercontext, a revolutionary multimodal AI concept

What Is Deepseek AI and Why Is Everyone Talking About It?