登录查看更多内容

Intelligent Document Processing comparing AWS GenIA and ML Services (Part II)

Ricardo Jorge Baraldi

发布日期: 2024年1月29日

Hi, we will continue with the second part of the article.

Tokens

Refers to a unit of text that has been extracted or identified during the process of tokenization. Tokenization is the task of breaking down a sequence of text into smaller units, which could be words, sub-words, characters, or even phrases. These smaller units are the tokens. A token is approximately four characters long for typical English text.

As an example, the Azure API supports a maximum of 4,000 tokens shared between the prompt (including system message, examples, message history, and user query) and the model's response. As API calls are charged per token, and you can set a maximum limit for response tokens, you should monitor the current token count to ensure the conversation does not exceed the maximum response token limit.

Token Generation

The first step in analyzing a corpus is to break it down into tokens. For simplicity, you can think of each distinct word in the training text as a token, although in reality, tokens can be generated for partial words or combinations of words and punctuation. This quantity of tokens in a document is used for price calculation.

For example, the word "hambúrguer" is split into the tokens ham, bur, and ger, while a short and common word like "pera" is a single token.

We won't delve deeply into the Machine Learning for text classification technique in this article because it would take a considerable amount of time due to its complexity (text preparation, logistic regression)

LangChain

is an open-source framework for building applications based on large language models (LLMs). LLMs are extensive, pre-trained deep learning models trained on vast amounts of data that can generate responses to user queries, such as answering questions or creating images from text-based prompts

Intelligent Document Processing:

In our problem, we will focus on some of those steps. For the same function, we will "compare" solutions with and without, in this case, AWS BedRock or the equivalence Textract + Comprehend services. In any case, the documents will come at S3 (in AWS) and an event is triggered when a new document comes. This event will start and execute the IDP steps that are auto-explained:

Doc Classification Step: Basically, I will need two functions n at this step: Classification training and Classification Inference for any kind of document. In our Architecture follow the function of this service that we need:

Using AWS ML services :

Textract APIs: For extract data: Structured, Semi Structured, Nao structured, Based on queries like NF e Receipts (for example ) or Identity documents
Comprehend APIS: For entity identification: like Identity detection, Training of Customized Entities, or Detection of Customized Entities

Using AWS GEN IA services :

BedRock APIS: We choose among all the options of LLM like Titan. We chose Anthropic's Claude better choice in our case and we used Lanchain and Python to Claude access.

Doc Extraction Step: Same for this step:

In our Architecture and following the function of this service we need:

Using AWS ML services :

Textract APIs: Extract data from the document: Structured, Semi Structured, No structured, Based on queries, invoices e Receipts (if is the case), Identity documents.
Comprehend APIS: Identification of interesting entity names from the extracted: Identity detection, Training of Customized Entities, and Detection of Customized Entities

领英推荐

Azure OpenAI with Azure API Management

John Savill 3 周前

Amazon AI Fairness and Explainability with Amazon…

Jon Bonso 4 个月前

AWS re:Invent ’23 Day 3- Impactful Disclosures on AWS…

CloudThat 11 个月前

Using AWS GEN IA services :

BedRock APIS: Same functions

Doc Enrichment: Same for this step:

In our Architecture and following the function of this service we need:

Using the both ML service and GenIA the function will be:

Redaction of PII data (personally identifiable information) and PHI (if is the case) Tagging of Doc Enrichment of Metadata of Doc Legal retention BedRocK + Summary, normalization Q@A

Comparing generative AI with traditional Machine Learning (ML)

When comparing Generative AI, such as models like GPT (Generative Pre-trained Transformer), with traditional Machine Learning (ML) approaches, there are several advantages:

Versatility and Adaptability: Generative AI models are pre-trained on large datasets and can adapt to a wide range of tasks without task-specific training. Traditional ML models often require more customization and specific feature engineering for each task.
Contextual Understanding: Generative models, especially language models, have a better grasp of contextual information. They can understand and generate human-like text based on context, making them suitable for natural language understanding and generation tasks.
Few-shot and Zero-shot Learning: Generative models can perform few-shot and zero-shot learning, meaning they can make accurate predictions with very few examples or even without specific examples for a task. Traditional ML models may struggle with limited data.
Continuous Learning: Generative models can be fine-tuned on new data to adapt to specific domains or tasks, allowing for continuous learning and improvement over time. Traditional ML models might require retraining from scratch.
Creativity and Novelty: Generative models can be creative in generating new content or ideas. They are capable of producing novel outputs, making them valuable for creative tasks such as content generation, art, and brainstorming.
Language Understanding: For language-related tasks, Generative AI excels in understanding and generating text. It can handle context, nuances, and varying sentence structures better than traditional ML models.
Reduced Feature Engineering: Generative models often require less explicit feature engineering compared to traditional ML models. They can learn complex patterns and representations from data on their own.

While Generative AI has these advantages, it's important to note that traditional ML approaches may still have their place in scenarios with well-defined tasks, large labeled datasets, and where interpretability or explainability is critical. The choice between Generative AI and traditional ML depends on the case.

Prize compassion:

Let's perform the comparison on a large set of documents ok 18k docs x 53 pag média = 954.000 pag day. ?Let made an exercise :

Text + Compr = $322071 + $4293= $326.364 para todo o ciclo mensal
BedRock: 470 Tokens p doc x 18k doc mes = 8.460.000 tkns * 1,5 fator de saida = 12.690.000 tkns / 1k tkns = 12.690 x $0.00240 = US$ 30,456 mes

in Conclusion:

BedRock is almost 10 times cheaper (in both options some services aren't in Sao Paulo). at this time those services aren't available in Sao Paulo for now, plus all the benefits describe above.

Jo?o Moreira

Especialista em Desenvolvimento de Produtos e Inova??o | Estratégia de Negócios e Crescimento | CSPO? | Canastra R2'24

9 个月

Ricardo, obrigado por compartilhar!!!

1 次回应

要查看或添加评论，请登录

Ricardo Jorge Baraldi的更多文章

Innovative Solutions for Improving Customer Experience

2024年11月18日

Innovative Solutions for Improving Customer Experience

Delivering exceptional customer experience (CX) is a competitive differentiator in today’s market. Here are some…
Advances in Generative AI: Multi-Agent Large Language Model (LLM) Architecture

2024年11月13日

Advances in Generative AI: Multi-Agent Large Language Model (LLM) Architecture

I been at TI for a long time, now Gen AI caught my eye so continue to learn. Generative AI has rapidly evolved from…
Mainframe Modernization: Transforming Legacy Systems to Cloud Microservices

2024年9月28日

Mainframe Modernization: Transforming Legacy Systems to Cloud Microservices

Mainframe systems have been the backbone of many enterprises for decades, but they are now facing increasing pressure…
Enterprise Governance: A Strategic Imperative for Large industries

2024年9月22日

Enterprise Governance: A Strategic Imperative for Large industries

I been in IT for a long time and I sure there is a lack in Governance that I saw at many industries, in this case…
Mainframe Modernization using AWS Cloud-Native Serverless Architecture Using SAM and Java

2024年9月19日

Mainframe Modernization using AWS Cloud-Native Serverless Architecture Using SAM and Java

I will be at IT legacy systems since the 80′s. I did using severals tools and technics migrating to others differents…
Some common network topologies for Hydric Cloud networking (public cloud interaction with private on-premise environments)

2024年9月6日

Some common network topologies for Hydric Cloud networking (public cloud interaction with private on-premise environments)

I faced several client that need a new hybrid network between current on-premisses DataCenter with new Cloud…
Mainframe Modernization Planning in Detail

2024年9月1日

Mainframe Modernization Planning in Detail

Mainframe systems have long been the backbone of critical business operations, handling high volumes of transactions…

1 条评论
BIAN Model and Applications of Generative AI in Banking

2024年8月25日

BIAN Model and Applications of Generative AI in Banking

I been working at financial sector almost all my life and I follow the Bian framework and TM Forum Frameworx and I…

1 条评论
MLOPS: Applying AWS Bedrock with LLM

2024年8月19日

MLOPS: Applying AWS Bedrock with LLM

Applying AWS Bedrock with LLM in MLOps: A Practical Guide with Python Example AWS Bedrock is a powerful tool that…

1 条评论
Container undestanding in GCP

2024年8月9日

Container undestanding in GCP

whats does mean? GKE is a Container First system while GAE is a Code First system…

See all articles

Intelligent Document Processing comparing AWS GenIA and ML Services (Part II)

Ricardo Jorge Baraldi

Tokens

Token Generation

LangChain

Intelligent Document Processing:

领英推荐

Comparing generative AI with traditional Machine Learning (ML)

Prize compassion:

in Conclusion:

Ricardo Jorge Baraldi的更多文章

社区洞察

其他会员也浏览了

Everything About Azure ML Service- A Must Knowledge - NareshIT

Data Readiness with AWS: Empowering Your Generative AI Journey

Becoming an Oracle Cloud Infrastructure Certified Generative AI Professional: Insights and Applications

Which cloud offers better AI tools?

AWS update of Week 30 (24Jul - 30Jul)

This Week in AI

Tokens

Token Generation

LangChain

Intelligent Document Processing:

领英推荐

Comparing generative AI with traditional Machine Learning (ML)

Prize compassion:

in Conclusion:

Ricardo Jorge Baraldi的更多文章

Innovative Solutions for Improving Customer Experience

Advances in Generative AI: Multi-Agent Large Language Model (LLM) Architecture

Mainframe Modernization: Transforming Legacy Systems to Cloud Microservices

Enterprise Governance: A Strategic Imperative for Large industries

Mainframe Modernization using AWS Cloud-Native Serverless Architecture Using SAM and Java

Some common network topologies for Hydric Cloud networking (public cloud interaction with private on-premise environments)

Mainframe Modernization Planning in Detail

BIAN Model and Applications of Generative AI in Banking

MLOPS: Applying AWS Bedrock with LLM

Container undestanding in GCP

社区洞察

其他会员也浏览了

Everything About Azure ML Service- A Must Knowledge - NareshIT

Data Readiness with AWS: Empowering Your Generative AI Journey

Becoming an Oracle Cloud Infrastructure Certified Generative AI Professional: Insights and Applications

Which cloud offers better AI tools?

AWS update of Week 30 (24Jul - 30Jul)

This Week in AI