Leveraging OpenAI’s Whisper and ChatGPT with UiPath for the Contact Center
A guide to gaining insights from contact center recordings and transcriptions with AI and automation
Alright everyone, this guide is mostly about me jumping on the OpenAI bandwagon. It seems to be getting full and I am frankly feeling a bit of fomo.??
?To start, this guide came about thanks to the convergence of a few events:?
Let’s get started. First, a little about this guide:?
Step #1: Install OpenAI Whisper?
We will first install Whisper. If you are going to have problems it will likely be here and I don’t want you to go through all the other steps if this doesn’t work.?
Implementation Note?
I installed Whisper on my local PC and I am using a UiPath Attended robot. For a production implementation you may want to use a UiPath Unattended robot and would thus need to install Whisper on the VM(s) with that robot.
Whisper also has an API. This may be a better option for environments where running Python isn't practical.??
2. Check that you have Python installed by checking the Python version number?
3. Install and upgrade PIP?
4. Use Chocolatey to upgrade python?
5. Install ffmpeg?
6. Install OpenAI Whisper (finally!)?
7. Take Whisper for a test drive?
Example: whisper --model base “C:\Voicemails\Mar\Call0000001.m4a”?
Step #2: Use Whisper in UiPath?
Another reminder that this is an art of the possible guide. You wouldn’t want to do some of what I have done below in a production implementation. :)?
领英推荐
Implementation Note?
In my example I will loop through the audio recording files using For Each File in Folder, convert them to text, and then move the files to an archive folder. You will need to research the options for accessing the audio files in your environment and design the best option for you (i.e., trigger based, scheduled, file system versus web interface, etc.).?
3. Drop an Invoke Power Shell UiPath Activity into For Each File In Folder?
Implementation Note?
I believe entering as a script is definitely not a best practice. I am using in this example for maximum flexibility and speed in testing the various options available with Whisper. UiPath provides Input and Parameters properties for Invoke Power Shell that could be used to improve the durability of the automation.?
Also, I am using PowerShell because it’s what I know. UiPath has Python activities that could be used in your automation for interacting with Whisper. Leveraging the Python activities is not covered in this guide. Using the Python activities in place of or together with PowerShell may improve your automation.?
4. Add an Assign activity after Invoke Power Shell?
5. Add a Read Text File activity in the For Each File in Folder activity after Invoke Power Shell and Assign?
6. Use a Message Box (or similar activity) to display txtOutput?
7. Run and test the automation
Step #3: Get insights from OpenAI ChatGPT?
In this step we will setup the UiPath Integration Service connector for OpenAI and add it to our automation. If you don’t have access to Integration Service, this is the end of the adventure, but there’s still a lot you can do with the txtOutput from above, so go have some automation fun!?
Implementation Note?
This will create a personal key. For a production implementation you may need to work with your IT team to create an enterprise key.
2. Configure the OpenAI connector in UiPath Integration Service?
3. Configure the automation to use the OpenAI connector?
Summary
It seems like a lot (it's not)* but once you have this plumbing in place there are many AI plus automation opportunities that light up for the contact center. In this example we took an audio recording, transcribed it with OpenAI Whisper, summarized it with OpenAI ChatGPT, and can use UiPath to update the customer’s record. We might also ask ChatGPT to categorize calls and intents, and then use UiPath to store the data and begin to detect trends. We can use sentiment from ChatGPT to have UiPath update voice of the customer data or in performance reviews. At the start of a call UiPath might download and send recent bills to ChatGPT to summarize the differences for the agent. During the call, ChatGPT might provide insights on next best actions and offers, and UiPath will automate the action on the agent’s behalf.?We could go on and on (and I will try to in a future post)**.?
Finally, if you made it this far you are probably keen to learn more about AI and automation. Here is my plug for UiPath’s AI Summit. You will hear from top UiPath product experts and customers about the opportunity and business value of AI and automation. Register below to view the recordings.??
* I only spent 3 hours setting this up and that includes nearly an hour of troubleshooting a couple knucklehead mistakes I made in the configuration.
** I also know a visual of this solution in action will help and I will try to post a video later.
Lead Architect at VKY Intelligent Automation
1 年Had a go following your steps, so thank you for breaking it down. Took more of the CMD Prompt route (just what I know better) but was able to get Whisper installed and running locally. Thanks for the post Brad
Siyad Sistem Ltd. ?ti. ?irketinde i? Geli?tirme Uzman?
1 年air.ai
AI-Powered Healthcare Automation
1 年well done, Brad!
Ryan Lallier