登录查看更多内容

Deploying a Minimalist ChatGPT in Azure

Kam-Reef S.

MBA | Certified Azure & M365 Specialist | Cybersecurity | Veteran | Linux+ | Security+

发布日期: 2024年8月29日

Pick any AI and ask how to deploy AI (pun intended). Of course, you will receive an answer that reflects the complexity of your prompt. But how do you get started with the absolute lowest level of complexity and time/cost investment?

Deploy Azure OpenAI, retaining access control of your prompts and data

Based upon my recent experience, here is my simplified roadmap to the most basic version of ChatGPT deployed in the Azure platform. Please see Baseline OpenAI end-to-end chat reference architecture - Azure Reference Architectures | Microsoft Learn for architectural information.

I used Microsoft's Sample Chat App with AOAI (Azure Open AI). Using this particular GitHub repository provided credible source code and low effort path to implementation. When you review this repository, please note that the first implementation described is "the basic" ChatGPT app. The repository also contains guidance and code to further extend AI functions to allow access to your own data or even AdHOC data that you upload. In this simplified guide I am only deploying the most basic implementation, just enough to have a working ChatGPT app.
In Azure, I created a new Resource Group and Azure Open AI object. The S0 pricing tier is the only option and I encourage you to review the pricing for the model(s) you deploy in detail as well as understand how tokens are calculated at Azure OpenAI Service - Pricing | Microsoft Azure. Basically, one token generally corresponds to ~4 characters of common English text (More information at What are tokens and how to count them? | OpenAI Help Center). There is a cost per token for the prompt input and the response output. The cost per a token is directly related to which AI model is being used. I know, it sounds like a potential sticker shock moment but there is a way to throttle token use. My deployment was tested with a community size near 1000 people and cost under $300/month US dollars.
In the new Azure Open AI object, I deploy a simple chat model. At this point I have satisfied the prerequisites specified in the repository. Remember that throttling token use is possible! In the image below, notice I have disabled dynamic token quotas and configured a 5K tokens permute rate limit.

领英推荐

The Curious Case of ChatGPT API Pricing

AIM 1 年前

The Curious Case of ChatGPT API Pricing

Bhasker Gupta 1 年前

Effortlessly Integrate OpenAI’s APIs with…

Eric PETIOT 2 个月前

Model got-35-turbo deployment configuration

4. Back at the repository, I select Deploy to Azure. The deployment template contains a default App Service Plan size B3, which I changed to F0 for my development. If you require a custom DNS name for the web app, select at least size B1 (which supports near 1000 users and stays under 60% CPU). The repository also contains the web app variables for customizing the web app UI without editing the code used from the repository.

This very basic deployment is the most basic implementation, just enough to have a working ChatGPT app. Even though this custom chatGPT web app had humble beginnings, it steadily supported nearly 1000 users and the monthly cost for tokens and required Azure infrastructure remained near $300/month. Of course, this basic deployment can be extended further by using the same repository web app variables. Even more customization is possible with forking/cloning the repository.

Thanks for reading this post!

Woodley B. Preucil, CFA

Senior Managing Director

1 个月

Kam-Reef S. Great post! You've raised some interesting points.

1 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

Deploying a Minimalist ChatGPT in Azure

Kam-Reef S.

MBA | Certified Azure & M365 Specialist | Cybersecurity | Veteran | Linux+ | Security+

Deploy Azure OpenAI, retaining access control of your prompts and data

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

Codeless AiPI's: The Revolutionary OpenAI ChatGPT Plugin API Interface & The Ai-TOML Workflow Specification (aiTWS)

Azure OpenAI Framework

What's in Tech : Wk September 7th 2024

Microsoft Launches OpenAI CoPilots For Dynamics Apps And The Enterprise.

How we built a media recommender with ChatGPT and without training data

Developing a Workflow KNIME with ChatGPT-4o

Revolutionary: Microsoft Lets You Build Your Own AI 'Copilots'

How Azure Innovate & Bizmetric Expertise Supercharge AI-Powered Intelligent App Development

Unpacking The OpenAI Meltdown

Access the Power of OpenAI in Your Code in minutes with ChatMotor

Deploy Azure OpenAI, retaining access control of your prompts and data

领英推荐

Ansible and Azure Cloud Shell

2024年8月9日

Fix Missing DNS PTR Records With PowerShell

2023年10月13日

How To Connect HyperV 2019 to Azure Arc

2023年10月6日

How To Protect Server Data In The Age Of Ransomeware

2023年9月29日

How To Configure NXLog to Receive Consumer Router Logs

2023年9月22日

How To Detect SSH Public Key Login Attempts

2023年9月15日

Sometimes Simpler Cybersecurity Is Better Cybersecurity

2023年9月8日

社区洞察

其他会员也浏览了

Codeless AiPI's: The Revolutionary OpenAI ChatGPT Plugin API Interface & The Ai-TOML Workflow Specification (aiTWS)

Azure OpenAI Framework

What's in Tech : Wk September 7th 2024

Microsoft Launches OpenAI CoPilots For Dynamics Apps And The Enterprise.

How we built a media recommender with ChatGPT and without training data

Developing a Workflow KNIME with ChatGPT-4o

Revolutionary: Microsoft Lets You Build Your Own AI 'Copilots'

How Azure Innovate & Bizmetric Expertise Supercharge AI-Powered Intelligent App Development

Unpacking The OpenAI Meltdown

Access the Power of OpenAI in Your Code in minutes with ChatMotor