Advanced Analytics & Generative AI (end-to-end)
Claudio Mirti
Senior Advanced Analytics & AI Specialist EMEA (Global Black Belt) @Microsoft
Imagine if you could easily build an end-to-end solution that allows you to query any type of data, whether structured or unstructured, in natural language. What if you could create useful dashboards and discover new insights effortlessly?
How integrating unstructured and structured data can provide comprehensive insights and enhance decision-making across various sectors?!
Some ideas / use cases.
Healthcare
Finance
Manufacturing
Legal
But now, let’s dive in:
What if you want to enhance or automate the retrieval of relevant information and provide accurate, context-aware responses by using GPT technology? Uploading or just automating by sending attachments to a team mailbox?
What if you use a hybrid approach with OCR and Large Language Model to get better results without any pre-training?
In this Solution we use Azure Document Intelligence combined with GPT4 vision for unstructured data. With Cosmos DB, the extracted information from the documents is stored and mirroring into Microsoft OneLake allows access the data in near real-time. Some more details around mirroring in Microsoft Fabric can be found here: Mirroring - Microsoft Fabric | Microsoft Learn
From there, Advanced Analytics, BI and AI solutions using Microsoft Fabric, we consume all the information over a dashboard / report. For the document intelligent solution, you can use this GitHub repo and follow the steps.
Document extraction and quick check on the extracted data:
In Microsoft Fabric, you create first a Workspace and a Lakehouse. For more guidance, follow the steps provided here: Lakehouse end-to-end scenario: overview and architecture - Microsoft Fabric | Microsoft Learn
In this example I'm using a public dataset provided from Kaggle from the insurance industry where the goal is Predicting the likelihood of claims for risk assessment and policy pricing. Insurance Claims Dataset (kaggle.com)
As a data replication solution, Mirroring in Fabric is a low-cost and low-latency solution. In this case I have mirrored Cosmos DB and you can see the same information in Microsoft Fabric.
领英推荐
I have created 2 Semantic models. One from the Claims dataset and the other one from Cosmos DB where I have the output of the processed documents.
The Policy subscription distribution was created with the help of Power BI copilot and the other Chart is now representing the documents which have been processed. This example shows how many pages per document. Of course, you can adapt/change and add more metrics or details.
And recently announced as public preview, AI skills allow you to create your own conversational Q&A systems on Fabric using generative AI. You can give your colleagues the experience of simply asking a question and getting a reliable, data-driven answer in return. Here I'm asking, "What age owns more policies?"
And shortly cross-checking with the dashboard, looks like it’s 100% right.
The flow of data from the data source to its destination is key! With the Lineage Capabilities in Microsoft Fabric, you have everything under control.
To effectively secure and govern your data, manage various data types (structured, unstructured, AI-generated) across multiple clouds, and stay updated on current and future regulations is where Microsoft Purview simplifies to ensure a comprehensive and resilient data governance framework.
Some Links
Here the end-to-end video.
Head of Data & Analytics @ MS Reinsurance | Data Platform, Data Governance, Data Quality, Data Management
6 个月Hi Claudio, I hope you are doing well. I’m truly impressed with the solution you've developed and am eager to see it implemented here at MS Re. The potential for leveraging the Advanced Analytics Gen AI is incredibly promising, especially for forward-thinking use cases like improving data quality and forensic analysis. I’m already thinking about how these innovations could redefine our approach and would love to discuss how we can get this project underway. Are you available to kick off this discussion soon? I’m looking forward to collaborating and taking our operations to the next level with this technology. Thanks for leading the charge on such a game-changing initiative. Great job, and I'm excited about making a difference together. Saludos Miguel > "facts based > proving value !"
Even AI Agents need a memory | Principal Product Manager | Azure Cosmos DB | Advisor, Mentor & Coach
7 个月I love this summary Claudio! It shows the "so what" of having an integrated platform for all of data, analytics and AI. You discuss extracting analysis from data. It also works the other way around, letting you drive action based on analysis. Fraud detection becomes prevention if you can react within the time window of the transaction, to take just one example.
Data & AI Lead for Swiss Public Sector Healthcare
7 个月Having had the opportunity to delve deeper with our experts, I am excited for our customers and industry to make use of this!