Scaling AI use cases, ChatGPT on your data

Kim Berg

CTO Data & AI Sogeti Sweden | Microsoft AI MVP

发布日期: 2023年9月10日

+ 关注

ChatGPT is a powerful language model that can generate responses based on natural language inputs. It is a highly effective tool for generating content for various use cases. One of the key benefits of ChatGPT is that it can be fine-tuned on your own data, which can lead to significant improvements in the quality and relevance of generated outputs.

But now you don't even need to fine-tune the model to use your own data!

Azure OpenAI on your data enables OpenAI's GPT-3 Turbo and GPT-4 language models to generate responses based on your data. You can access Azure OpenAI on your data through a REST API or the web-based interface in Azure OpenAI Studio. This tool helps create a solution that connects your data, resulting in an enhanced chat experience.

One of the most notable features is the ability to retrieve and use relevant data to improve the output of the language model. This is made possible in collaboration with Azure Cognitive Search. Based on user input and conversation history, Azure OpenAI on your data identifies the most appropriate data source to retrieve data from, then augments and resubmits that data as a prompt to the OpenAI model. The retrieved information is added to the original prompt, and the model processes it like any other prompt.

After the prompt has been submitted to the model along with the retrieved data, the model uses this information to generate a relevant response. It's worth mentioning that while the retrieved data is appended to the prompt, the resulting input is processed by the OpenAI model according to its original prompt.

The best and easiest way of set this up in my opinion is using the "Add your data" in the Azure AI Studio.

To get started, you need to already have been approved for Azure OpenAI access and have an Azure OpenAI Service resource with either the gpt-35-turbo or the gpt-4 models deployed.

Before starting to set this up, create a storage account and add the data that you want do use. Enable managed identity on the Azure OpenAI resource and give that managed identity the Storage Blob Data Contributor role on the storage account. The supported formats of the data is:

.txt

.md

.html

Microsoft Word files

Microsoft PowerPoint files

PDF

You can choose if you want the indexer to run once, hourly or daily. You can also decide to use Vector search using Ada embedding models, available in select regions.

When clicking next you can decide of the different search options in the below table.

After this has been done you can decide if you want the responses to be only from the data that you provide or not.

Now you are ready to start chatting with you data!

As an example, I needed to replace the razorblades on my lawnmower robot, and when trying to find the info it was quite hard. So I added the owners manual as my data and asked it how I should replace them.

Next, you can deploy the model either to Power Virtual Agents or a web app. I think the web app is the simplest way. After deploying you can change the system message, front end or what ever you want by cloning down the repo that the app uses and changing what you want. Then build it and update you app with Azure CLI

You get the references and citations in the answers.

I needed to specify the lawnmower as the Content Filter got updated while writing this article.

To make the app more enterprise, add the services in the below architecture and utilize the service as explained in this previous article. Add on monitoring as this article explains, and you are set!

Woodley B. Preucil, CFA

Senior Managing Director

1 年

Kim Berg Very insightful.?Thanks for sharing.

1 次回应

查看更多评论

要查看或添加评论，请登录

Kim Berg的更多文章

Scaling AI use cases, Securely use Azure OpenAI on your data

2023年12月8日

Scaling AI use cases, Securely use Azure OpenAI on your data

Now that you have you tested the functionality of using your data in Azure OpenAI as explained in the previous article…

2 条评论
Scaling AI use cases, Monitor GenAI in Azure

2023年8月31日

Scaling AI use cases, Monitor GenAI in Azure

Now that we have set the architecture to make it possible to scale and run multiple projects/use cases efficiently and…

2 条评论
Scaling AI use cases, Generative AI in Azure

2023年8月24日

Scaling AI use cases, Generative AI in Azure

Now we have started the journey on how to scale AI use cases! Automation to implement Enterprise Scale AI landing zones…

3 条评论
Scaling AI use cases, where to start

2023年8月17日

Scaling AI use cases, where to start

With increasing number of AI services in Azure that also integrates with other Azure services, the question about how…
Microsoft Fabric, have you tested it out yet?

2023年8月10日

Microsoft Fabric, have you tested it out yet?

Since the release of Microsoft Fabric on Microsoft Build May 23, 2023 there has been a lot of hype around the new…

3 条评论
Comparing Microsoft Fabric and Azure Machine Learning: Which is Right for Your Needs?

2023年6月2日

Comparing Microsoft Fabric and Azure Machine Learning: Which is Right for Your Needs?

Since the release of Microsoft Fabric there has been a lot of questions regarding what service to use for your data…

7 条评论
Data Lakehouse: Medallion architecture

2023年4月25日

Data Lakehouse: Medallion architecture

9 条评论

See all articles

But now you don't even need to fine-tune the model to use your own data!

Kim Berg的更多文章

Scaling AI use cases, Securely use Azure OpenAI on your data

Scaling AI use cases, Monitor GenAI in Azure

Scaling AI use cases, Generative AI in Azure

Scaling AI use cases, where to start

Microsoft Fabric, have you tested it out yet?

Comparing Microsoft Fabric and Azure Machine Learning: Which is Right for Your Needs?

Data Lakehouse: Medallion architecture

社区洞察