登录查看更多内容

Redacting Personally Identifiable Information in ChatGPT Content

Rand Morimoto

President - Convergent Computing

发布日期: 2024年4月23日

As Generative A.I. continues to find a place in enterprises for information knowledge sharing and processing, organizations want a way to process the content of data while redacting (hiding / eliminating) personally identifiable information (PII) for privacy reasons.

Most tools to date "block" PII processing or requires the organization to manually "sanitize" content before ingesting it into their GenAI system. Microsoft now has a tool (currently in beta/Preview) that organizations can apply for early access to clean-up sensitive PII information, while retaining the core data of the content for GenAI information processing!

Microsoft Native Document Support for AzureAI

The technology Microsoft has available is called "Native Document Support for AzureAI" that is documented at https://learn.microsoft.com/en-us/azure/ai-services/language-service/native-document-support/use-native-documents?tabs=pii

It currently allows you to process PDF, DOCX, and TXT files

Effectively, upon processing of documents, it can turn an initial document with PII:

Into a document that has the PII redacted:

Preprocess or Pipeline the Redaction

To process the files for redaction, they can be run through a scripted routine to "clean-up" files before they are uploaded to a ChatGPT environment, or an automated process (aka a Pipeline) can be created using Microsoft's new Fabric technology that can automate the redaction process.

Although the content is being redacted, there is likely still sensitive information within the content, and thus using a PRIVATE instance of ChatGPT deployed using Microsoft's AzureAI/ChatGPT will provide an organization an internal instance of ChatGPT the Microsoft maintains privacy and security of content being processed by its OpenAI platform.

I cover the deployment and operations of AzureAI/ChatGPT in several articles I've posted on LinkedIn including - https://www.dhirubhai.net/pulse/leveraging-chatgpt-against-internal-office-365-create-rand-morimoto

Wrap-up

Just an additional step to leverage the power and capabilities of ChatGPT, in a safe, security, and compliant manner.

Per Werngren

Partner Ecosystems Advisor | CEO at Group Accelerator-Idenxt-Awakish | IAMCP 5x WW President

5 个月

Great insights! Thanks Rand!

1 次回应

要查看或添加评论，请登录

Rand Morimoto的更多文章

Building Sophisticated A.I. Chat Models Using GraphRAG

2024年10月8日

Building Sophisticated A.I. Chat Models Using GraphRAG

The latest thing in A.I.
Improving A.I. Chats with Multi-Modal Integration

2024年10月1日

Improving A.I. Chats with Multi-Modal Integration

A.I.
Latest Toy - A Laser Cutter / Engraver - for Home Projects

2024年9月24日

Latest Toy - A Laser Cutter / Engraver - for Home Projects

A little while back I posted an article on the latest 3D printer (toy) we got to fiddle with, and while working on a 3D…

3 条评论
Addressing Key Business Topics - Audit Compliance, Business Continuity, and Data Governance

2024年9月18日

Addressing Key Business Topics - Audit Compliance, Business Continuity, and Data Governance

Beyond technical topics of A.I.

4 条评论
Azure A.I. Model Use, Token Use, and "Good Answers"

2024年9月10日

Azure A.I. Model Use, Token Use, and "Good Answers"

With multiple versions of ChatGPT available now, and an eye on Token consumption to answer questions, the latest focus…
A.I. Chat Tokens Drive Up Azure A.I. Costs

2024年9月3日

A.I. Chat Tokens Drive Up Azure A.I. Costs

There was a time not too long ago that I used to say you couldn't use up enough tokens in a private Azure A.I.

4 条评论
Pinnacle Point is now a UNESCO World Heritage Site!

2024年8月28日

Pinnacle Point is now a UNESCO World Heritage Site!

Just last month I wrote about a vacation we took this summer to South Africa and was toured around by Prof Curtis…

2 条评论
A.I. Chatting WITH Your Filenames - A ChatGPT Game Changer!

2024年8月21日

A.I. Chatting WITH Your Filenames - A ChatGPT Game Changer!

We perfected a process that has significantly improved the way we're able to Chat in our private ChatGPT instances that…

5 条评论
3D Printing - An Easier Go at it This Time Around...

2024年8月13日

3D Printing - An Easier Go at it This Time Around...

I jumped into 3D printing a few years ago, played with it for a few weeks, then gave up as the setup and process was…

6 条评论
CCO Created an A.I. Think Tank to Accelerate Innovation Development!

2024年8月6日

CCO Created an A.I. Think Tank to Accelerate Innovation Development!

We've had incredible success working with 150+ customers on their A.I.

4 条评论

See all articles

Redacting Personally Identifiable Information in ChatGPT Content

Rand Morimoto

President - Convergent Computing

Rand Morimoto的更多文章

社区洞察

其他会员也浏览了

50 Key Improvements: How OpenAI o1 Outperforms ChatGPT-4

How to choose the right version of ChatGPT to meet your needs

Breakthrough: Using OpenAPI Schemas to shape ChatGPT responses

What ChatGPT is. What it Isn't. And why that Matters.

"I Want My Own ChatGPT with My Company's Data"

I Have Some Concerns About ChatGPT

The Perfect ChatGPT Prompt

Does your company need its own ChatGPT or fine-tuned LLM? Probably not

Claude 3.5 vs ChatGPT 4o

Smart Work, Smart AI: 7 ChatGPT Tips for Efficiency

Rand Morimoto的更多文章

Building Sophisticated A.I. Chat Models Using GraphRAG

Improving A.I. Chats with Multi-Modal Integration

Latest Toy - A Laser Cutter / Engraver - for Home Projects

Addressing Key Business Topics - Audit Compliance, Business Continuity, and Data Governance

Azure A.I. Model Use, Token Use, and "Good Answers"

A.I. Chat Tokens Drive Up Azure A.I. Costs

Pinnacle Point is now a UNESCO World Heritage Site!

A.I. Chatting WITH Your Filenames - A ChatGPT Game Changer!

3D Printing - An Easier Go at it This Time Around...

CCO Created an A.I. Think Tank to Accelerate Innovation Development!

社区洞察

其他会员也浏览了

50 Key Improvements: How OpenAI o1 Outperforms ChatGPT-4

How to choose the right version of ChatGPT to meet your needs

Breakthrough: Using OpenAPI Schemas to shape ChatGPT responses

What ChatGPT is. What it Isn't. And why that Matters.

"I Want My Own ChatGPT with My Company's Data"

I Have Some Concerns About ChatGPT

The Perfect ChatGPT Prompt

Does your company need its own ChatGPT or fine-tuned LLM? Probably not

Claude 3.5 vs ChatGPT 4o

Smart Work, Smart AI: 7 ChatGPT Tips for Efficiency