Decoding Documents: A Deep Dive into Azure AI Document Intelligence
Azure AI Document Intelligence is a powerful AI-based OCR extraction tool offered by Microsoft. In this two-part series, I will guide you through using Azure AI Document Intelligence to build and train a custom model for extracting metadata from research papers. For demonstration purposes, we will work on a small project to extract the following metadata fields: Author, Title, Published Date, and Publisher
We will use the free tier version F0 for this demo to minimize costs. In Part 1, we will focus on understanding custom model features, while Part 2 will explore LabellingUX, including document labeling, securing workflows, and improving the model
Topics Covered
Part 1: Building and Validating a Custom Model
Part 2: Enhancing the Project with LabellingUX
Introduction to Azure AI Document Intelligence
Azure AI Document Intelligence is a cloud-based service that leverages machine learning to extract text, tables, and key-value pairs from documents.
Key Features:
Project Overview: What We Are Building
In this project, we will create a custom model using Azure AI Document Intelligence to automatically extract metadata from research papers available online. The targeted metadata fields are:
Setting Up the Project
Select Custom Extraction Model and complete the following steps
Preparing the Dataset
We will source research papers from the following websites:
领英推荐
Training Set:
Select a small subset of documents for training purposes.
Validation Set:
Set aside another subset of documents to test the model.
Training the Custom Model
Run Layout Analysis
Labeling the Documents
Add metadata fields, and for Published Date, set the subtype to "date" with the desired format (e.g., dd/mm/yyyy).
Training the Model
Note: Training takes time and incurs costs. It's advisable to train after labeling all necessary documents. Check the training progress in the Models section.
Validating the Model
Evaluation
The output JSON contains: