Enterprise Search with ChatGPT & Speech Synthesis with Azure Text to Speech Avatar
Sankha Chakraborty
Senior Cloud Solution Architect | Generative AI Enthusiast | Cognitive AI | Data Analytics
Artificial Intelligence takes centre stage of any technical conversations in the recent times. More so, after the global excitement generated with the advent of Generative Artificial Intelligence (GenAI) applications powered by Large Language Models (LLMs). GenAI use cases span across domains, such as financial services, information technology, education, entertainment, engineering, and design.
Throughout the course of my professional work, I get opportunity to interact with customers and hear their views on GenAI. Every conversation brings out few very interesting use cases that gets you thinking of ways you can build a solution around.
Before we end the year 2023, I thought to work on a solution that can make conversation AI more engaging. I remembered, the public preview announcement of Azure Text-To-Speech (TTS) Avatar and wanted to try this out. So, off I went.
What is TTS Avatar on Azure AI Speech?
Text to speech avatar converts text into a digital video of a photorealistic human (either a prebuilt avatar or a custom text to speech avatar) speaking with a natural-sounding voice. Please refer the link below for more information:
The easiest way to get started is to use the Speech Studio. Launch the Speech Studio from the deployed Speech service. Choose the Text to speech Avatar (Preview) feature from the 'Text to Speech' capabilities
The Text to Speech (TTS) Avatar playground offers a range of capabilities. We can choose from a range of avatars, different languages and the out-of-the-box voice samples that we want to try. Once we type the text, click on the 'Preview Video' to hear the avatar utter the words in the voice and style selected. Whilst, this is a good, quick way to check the feature out, real-life use cases requires us to use the in-built REST APIs and SDKs that we can use in your applications. The studio provides python samples that exactly does that for you and this is what I've used.
So, what is my use case then?
Throughout the year 2023, one of the most prevalent use cases or requirements I've worked with my customers on, is Enterprise Knowledge Search powered by ChatGPT (Azure OpenAI gpt 3.5 turbo model) using Retrieval Augmented Generation (RAG) pattern.
Here, I have simulated an HR application using which employees at Contoso can search and retrieve information on HR policies using ChatGPT.
领英推荐
Document ingestion & indexing on Azure AI Search
Before we can enable knowledge search, we need to ingest, index and prepare the data ready for search. This is what is depicted in Fig 1 above. The various steps have been described below:
For steps 2 and 3, you can refer the code available in the sample notebook from GitHub: azure-search-vector-samples/demo-python/code/azure-search-custom-vectorization-sample.ipynb at main · Azure/azure-search-vector-samples (github.com)
Enterprise search with ChatGPT + TTS Avatar on Streamlit App
Fig:2 represents the front end application that I've built using Streamlit. The application has the embedded logic that powers the enterprise search with ChatGPT and TTS avatar. Following is the sequence of activities that gets initiated when a user submits his/her search query:
You can adjust the voice quality and other characteristics using Speech Synthesis Markup Language (SSML). Refer the link for more information about SSML - Speech Synthesis Markup Language (SSML) overview - Speech service - Azure AI services | Microsoft Learn
A quick look at the demo app that I built using Streamlit. You can go innovative and craft it the way you want.
Senior Technical Program Manager - Azure OpenAI Service | Customer eXperience Engineering (CxE) ?? ?? @ Microsoft | 39x Azure Certified | GenAI Speaker
1 年Love the use-case and explanation, Sankha Chakraborty
Business Development Manger
1 年A Strategic Guide to Product Modernizing with GenAI Get Your Copy: https://bit.ly/3NhxAjp, #genai #generativeai #generative #artificialintelligence #ai #aitechnology #generativeaitools #generativeartificialintelligence #generativemodels #technologysolutions #productdesign #productdevelopment #productinnovation
Senior Technical Specialist – Data & AI at Microsoft | Expert in Cloud-Native Architecture, Presales, Hybrid Solutions, Generative AI, Data & Database Technologies
1 年Great , thanks for sharing