登录查看更多内容

Enterprise Search with ChatGPT & Speech Synthesis with Azure Text to Speech Avatar

Sankha Chakraborty

Senior Cloud Solution Architect | Generative AI Enthusiast | Cognitive AI | Data Analytics

发布日期: 2023年12月29日

Artificial Intelligence takes centre stage of any technical conversations in the recent times. More so, after the global excitement generated with the advent of Generative Artificial Intelligence (GenAI) applications powered by Large Language Models (LLMs). GenAI use cases span across domains, such as financial services, information technology, education, entertainment, engineering, and design.

Throughout the course of my professional work, I get opportunity to interact with customers and hear their views on GenAI. Every conversation brings out few very interesting use cases that gets you thinking of ways you can build a solution around.

Before we end the year 2023, I thought to work on a solution that can make conversation AI more engaging. I remembered, the public preview announcement of Azure Text-To-Speech (TTS) Avatar and wanted to try this out. So, off I went.

What is TTS Avatar on Azure AI Speech?

Text to speech avatar converts text into a digital video of a photorealistic human (either a prebuilt avatar or a custom text to speech avatar) speaking with a natural-sounding voice. Please refer the link below for more information:

Text to speech avatar overview - Speech service - Azure AI services | Microsoft Learn

The easiest way to get started is to use the Speech Studio. Launch the Speech Studio from the deployed Speech service. Choose the Text to speech Avatar (Preview) feature from the 'Text to Speech' capabilities

The Text to Speech (TTS) Avatar playground offers a range of capabilities. We can choose from a range of avatars, different languages and the out-of-the-box voice samples that we want to try. Once we type the text, click on the 'Preview Video' to hear the avatar utter the words in the voice and style selected. Whilst, this is a good, quick way to check the feature out, real-life use cases requires us to use the in-built REST APIs and SDKs that we can use in your applications. The studio provides python samples that exactly does that for you and this is what I've used.

So, what is my use case then?

Throughout the year 2023, one of the most prevalent use cases or requirements I've worked with my customers on, is Enterprise Knowledge Search powered by ChatGPT (Azure OpenAI gpt 3.5 turbo model) using Retrieval Augmented Generation (RAG) pattern.

Here, I have simulated an HR application using which employees at Contoso can search and retrieve information on HR policies using ChatGPT.

领英推荐

From GPT-4 to Microsoft 365 Copilot

Habibur Rahman 2 年前

OpenAI is So NOT Done

Bhasker Gupta 1 年前

Microsoft Co-pilot vs ChatGPT: Which one is better?

ZNet Technologies Private Limited 6 个月前

Document ingestion & indexing on Azure AI Search

Fig: 1 Document ingestion and indexing flow

Before we can enable knowledge search, we need to ingest, index and prepare the data ready for search. This is what is depicted in Fig 1 above. The various steps have been described below:

Ingestion of data into Azure storage account using Azure Data Factory (ADF). I have used blob storage to consolidate all heterogenous documents (PPT, PDF, etc.) in one place
In this step, I have broken down the documents into smaller chunks using custom Web API skills on Azure AI Search and stored in a separate Azure storage container. Please refer the link to know more about this property Custom Web API skill in skillsets - Azure AI Search | Microsoft Learn
In this step, we index individual chunks (documents) along with their vector embeddings in an Azure AI Search index

For steps 2 and 3, you can refer the code available in the sample notebook from GitHub: azure-search-vector-samples/demo-python/code/azure-search-custom-vectorization-sample.ipynb at main · Azure/azure-search-vector-samples (github.com)

Enterprise search with ChatGPT + TTS Avatar on Streamlit App

Fig 2: Enterprise search using ChatGPT & TTS Avatar using Streamlit application

Fig:2 represents the front end application that I've built using Streamlit. The application has the embedded logic that powers the enterprise search with ChatGPT and TTS avatar. Following is the sequence of activities that gets initiated when a user submits his/her search query:

The search query is converted into a vector query and directed to the chunk index. The search returns top N relevant documents. I have used Top 5 in my case
The search result is then augmented with the search query and passed as a prompt to GPT 3.5 Turbo model. The result is passed is used in two ways
The text output is first shown in the front end of the Streamlit application. The other section of the front end performs the speech synthesis and generates the video output showing the TTS Avatar reading out the same text. This makes the whole search experience more engaging from accessibility standpoint and multi-modal

You can adjust the voice quality and other characteristics using Speech Synthesis Markup Language (SSML). Refer the link for more information about SSML - Speech Synthesis Markup Language (SSML) overview - Speech service - Azure AI services | Microsoft Learn

A quick look at the demo app that I built using Streamlit. You can go innovative and craft it the way you want.

Fig 3: Custom application using Streamlit

Subhasish G.

Senior Technical Program Manager - Azure OpenAI Service | Customer eXperience Engineering (CxE) ?? ?? @ Microsoft | 39x Azure Certified | GenAI Speaker

1 年

Love the use-case and explanation, Sankha Chakraborty

1 次回应

Mrunali B

Business Development Manger

1 年

A Strategic Guide to Product Modernizing with GenAI Get Your Copy: https://bit.ly/3NhxAjp, #genai #generativeai #generative #artificialintelligence #ai #aitechnology #generativeaitools #generativeartificialintelligence #generativemodels #technologysolutions #productdesign #productdevelopment #productinnovation

1 次回应

Ajay Kumar Barun

Senior Technical Specialist – Data & AI at Microsoft | Expert in Cloud-Native Architecture, Presales, Hybrid Solutions, Generative AI, Data & Database Technologies

1 年

Great , thanks for sharing

2 次回应

查看更多评论

要查看或添加评论，请登录

Sankha Chakraborty的更多文章

API for GraphQL (Preview) - A new efficient data consumption pattern on Microsoft Fabric

2024年7月23日

API for GraphQL (Preview) - A new efficient data consumption pattern on Microsoft Fabric

I have been working on Microsoft Fabric for almost a year now. It’s truly exciting to witness such rapid innovations in…

3 条评论
Data ingestion patterns on Microsoft Fabric

2024年3月4日

Data ingestion patterns on Microsoft Fabric

Microsoft Fabric has revolutionized the way we looked at end-to-end data analytics at scale. It's AI-powered features…

6 条评论
Process IoT data using Real Time Analytics on Microsoft Fabric

2023年8月29日

Process IoT data using Real Time Analytics on Microsoft Fabric

#MicrosoftFabric makes it super seamless to get started with #RealTimeAnalytics with it's easy-to-use, no-code…
Azure OpenAI Service on your data

2023年6月21日

Azure OpenAI Service on your data

Azure OpenAI on your data enables us to run supported chat models such as ChatGPT and GPT-4 on your data without…

1 条评论
Update Purview Data Assets with REST API at Scale

2023年4月4日

Update Purview Data Assets with REST API at Scale

Enterprise Data governance is one of the key drivers that a 'Data Driven' organization needs, to thrive in today's…

7 条评论
Azure Synapse Link for SQL (Preview)

2022年7月1日

Azure Synapse Link for SQL (Preview)

One of the key announcements in Microsoft Build 2022 was the Microsoft Intelligent Data Platform that accelerates…

1 条评论
Fail Activity - An effective way to induce custom exceptions in Data Factory & Azure Synapse Analytics Pipelines

2022年5月10日

Fail Activity - An effective way to induce custom exceptions in Data Factory & Azure Synapse Analytics Pipelines

Background: Modern data integration workflows often require application developers to induce custom exceptions in data…

2 条评论

See all articles

Enterprise Search with ChatGPT & Speech Synthesis with Azure Text to Speech Avatar

Sankha Chakraborty

Senior Cloud Solution Architect | Generative AI Enthusiast | Cognitive AI | Data Analytics

领英推荐

Sankha Chakraborty的更多文章

社区洞察

其他会员也浏览了

GEMMA, Google's New LLM Model Powered by Gemini Technology

Latest AI, Crypto News Headlines for June 8, 2023

March 2023 Madness: Top 21 AI News and Updates!

Transforming Human Resources: The Impact of AI with Insights from Microsoft

ChatGPT Integration with Microsoft Azure Services

Managing the Risks of Generative AI and LLMs Through Technological Advances

Top 100 Popular AI Tools: Revolutionizing Technology in 2024.

Generative AI: Applications and Business Opportunities

Which AI Platform Should You Choose? A Comprehensive Overview of the Leading AI Platforms

Deepseek vs OpenAI

领英推荐

Sankha Chakraborty的更多文章

API for GraphQL (Preview) - A new efficient data consumption pattern on Microsoft Fabric

Data ingestion patterns on Microsoft Fabric

Process IoT data using Real Time Analytics on Microsoft Fabric

Azure OpenAI Service on your data

Update Purview Data Assets with REST API at Scale

Azure Synapse Link for SQL (Preview)

Fail Activity - An effective way to induce custom exceptions in Data Factory & Azure Synapse Analytics Pipelines

社区洞察

其他会员也浏览了

GEMMA, Google's New LLM Model Powered by Gemini Technology

Latest AI, Crypto News Headlines for June 8, 2023

March 2023 Madness: Top 21 AI News and Updates!

Transforming Human Resources: The Impact of AI with Insights from Microsoft

ChatGPT Integration with Microsoft Azure Services

Managing the Risks of Generative AI and LLMs Through Technological Advances

Top 100 Popular AI Tools: Revolutionizing Technology in 2024.

Generative AI: Applications and Business Opportunities

Which AI Platform Should You Choose? A Comprehensive Overview of the Leading AI Platforms

Deepseek vs OpenAI