登录查看更多内容

DATA CLASSIFICATION IN ADDITION TO DATA LOSS PREVENTION

Sanobar Khan

Product Manager | Identity and Access Management, Data Leakage Prevention, Data Migration, Data Classification

发布日期: 2024年4月13日

Governments and organizations need to adopt new guidelines for user safety in cyberspace as a result of the growing digitization of society. The need for software for the classification and protection of sensitive data is growing along with the limitations associated with its processing and storage. When working with financial data (accounting, finance), personal data (sales, HR), or data that gives them a competitive edge (projects, price lists), almost every employee creates sensitive data. There are several methods to handle this data: it may be copied, cut into restructuring paraphrased, and so on...

Data protection becomes much more difficult as a result. Data that has been organized into a table or document with an agreed-upon layout may be protected rather easily. Nevertheless, if the data is unstructured and consists of memos, emails, or non-standard contracts, it is far worse.

What steps can we take to secure the data?

The answer lies in three main areas: Encrypt them – data encryption. Classify them into a given category – data labeling. Enforce protection rules on them – data loss prevention.

Data classification in the DLP (Data Loss Prevention)

The system uses parameter-based scanning (personal information, credit card information, self-defined criteria) to find sensitive data, and it subsequently applies policies depending on the search findings. What constitutes sensitive data and how it is handled are centrally decided by the IT department with business cooperation. The DLP system is the sole option available for proactive implementation of policies because the IT security department is ultimately responsible for digital data. "Get the manager's confirmation if the file contains personal data and is to be sent outside the corporate network," for instance.

Classification in Data Labeling systems

Consider user is aware of the type of data he is dealing with and uses the appropriate label (such as "Public" or "Private") to categorize it. In this instance, the worker becomes a point of contact for the IT department by actively engaging in the data classification procedure, which reduces the overall number of false-positive outcomes while simultaneously enhancing information management knowledge. As an example, "I classify files related to important business initiatives as secret." Note that while data labeling solutions let you "tag" data, they do not support remedial actions like blocking or alerting.

领英推荐

How do you manage data transfers and sharing across…

Anil Patil ??"PrivacY ProdigY"?? 8 个月前

''Is Your Data Putting You At Risk? How To…

Keith Budden 5 个月前

Data Remediation and Its Role in Data Security and…

Javid Ur Rahaman 2 年前

Why is knowing who owns the data so crucial?

Because of the aforementioned variations, using a single method to identify the information we deem crucial to the business puts us at risk of missing something or classifying files incorrectly. This will increase the number of false positives or put us in immediate danger of legal action. Classification errors are mostly caused by complex internal connections (mostly involving unstructured data) and a high degree of subjectivity; for example, an employee's definition of "public" may differ from the company's definition of "secret." Here, data owner awareness is essential.

How do I classify data with the use of a DLP system?

To classify data, DLP systems have built-in classifiers:

dictionaries / keywords – searches based on comparison of strings;
regular expressions—regex—make it possible to match the classifier to the type of data, but remember that a “narrow” regex (too narrowly defined) will catch everything;
scripts – these are regexes plus a validating algorithm, e.g. not every 11 digits is a PESEL number;
machine learning – the system “learns” from the test data and then detects documents with a certain probability for the test sequence, here the key lies in choosing the right test data,
fingerprinting – a magnum of DLP systems – we create “hashes” of sensitive data, e.g. fragments of a commercial contract, and use them to compare with data detected elsewhere; resistant to file structure changes, detect unstructured data.

But what if anything is overlooked by the system?

Well, what if the system incorrectly or never qualifies something in our mountain of paperwork (and it is hard to find someone who would have complete control over the papers generated and handled in the organization)? This is where data labeling comes in handy, enabling the user to categorize the document that he is working on. The document's creator will be the most knowledgeable source when it comes to classification.

What are the advantages of combining data labeling with DLP, then?

Complete and Accurate Classification : A file is classed twice when a data labeling policy is added to a DLP system: once by the IT department and once by the user. Files that one of the parties ignored can be categorized with this help (the DLP system categorizes the data without a “label,” and the user can add a “label” to a file that DLP skips).
Double validation – What would happen if the employee marked the file as "Public" and attempted to transfer it via email to someone outside the company, and the DLP system discovered that the file included customer data? Security professionals may manage such scenarios and fix errors by utilizing both platforms.
Reduction of false-positives – the outcome of double validation, but it's important enough that it deserves its own discussion topic. Double validation allows us to greatly enhance our experts' job when the system is in use across the entire organization. Would you be able to imagine managing five thousand notifications a day effectively?

The next development in data protection is the combination of DLP and Data Labeling systems, which enable users to collaborate with security teams and foster a culture of data usage. We support the growth of knowledge and best practices for handling data within the organization in addition to adding another layer of security.

Shaikh Muzammil

Helping Colleges & Universities adopt AI | Automating Educational Institutions | ERP Consultant @ QualCampus ERP for Education |

11 个月

very well written.

2 次回应

查看更多评论

要查看或添加评论，请登录

Sanobar Khan的更多文章

Impacts of Adding Humour to Team Meetings

2023年12月11日

Impacts of Adding Humour to Team Meetings

One tool you may use to increase the productivity of your team meetings is humour at work. A useful strategy for…

3 条评论
Secure Software Development Life Cycle

2023年5月29日

Secure Software Development Life Cycle

People need secure products straight out of the box, thus security needs to be everyone's primary focus. But it is…

2 条评论
Born as a Girl Who Wears Many Hats!

2023年3月7日

Born as a Girl Who Wears Many Hats!

I dreamed of being the most renowned face. Well, that’s the thought I have been breeding since I was a kid.

2 条评论
Private/Protected Information

2023年1月4日

Private/Protected Information

What is Private/Protected Information when it comes to security? Protecting consumer information has to be one's…
OT Network Segmentation

2022年11月30日

OT Network Segmentation

Network Segmentation Network segmentation is a physical security layer that separates one network from another, such as…
Vulnerability

2022年8月22日

Vulnerability

What is a Vulnerability? As basic as this question may sound, a thousand-mile journey starts with one step…
Threat Modeling

2022年5月30日

Threat Modeling

Threat Modelling Threat modelling is a method of improving application, system, or business process security by…
Don’t Scan a Scam!

2022年5月17日

Don’t Scan a Scam!

What is QR Code ? A QR code (Quick Response Code) is a sort of matrix barcode (or two-dimensional barcode). A barcode…

2 条评论
Access Recertification - An IT Control for Preventing Unauthorized Access

2022年5月11日

Access Recertification - An IT Control for Preventing Unauthorized Access

Access Recertification Access recertification is an IT control that includes reviewing user access rights to verify if…

1 条评论
CyberArk

2022年5月9日

CyberArk

CyberArk is a security tool with a strong capability to address enterprises' cybersecurity demands. Organizations do…

See all articles

DATA CLASSIFICATION IN ADDITION TO DATA LOSS PREVENTION

Sanobar Khan

Product Manager | Identity and Access Management, Data Leakage Prevention, Data Migration, Data Classification

What steps can we take to secure the data?

Data classification in the DLP (Data Loss Prevention)

Classification in Data Labeling systems

领英推荐

Why is knowing who owns the data so crucial?

How do I classify data with the use of a DLP system?

But what if anything is overlooked by the system?

What are the advantages of combining data labeling with DLP, then?

Sanobar Khan的更多文章

社区洞察

其他会员也浏览了

Mastering Copilot with Purview DLP and Auto Labeling: Enhancing Data Security and Compliance

How SysTools Adheres to Data Wiping Standards to Ensure Complete Data Erasure

Data Validation for Row-Level Security (RLS)

Navigating the Data Maze: Top Challenges Executives Face Today

Dangerous Animals of Data Management

How MDM helps preparing for the Kenyan Data Protection Bill

Data Masking: Protecting Sensitive Information

The silent threat of APIs: Inadvertent transfer of excess data

Data Discovery and Classification: A Pillar of Data Security

Data Protection Audit Checklist

What steps can we take to secure the data?

Data classification in the DLP (Data Loss Prevention)

Classification in Data Labeling systems

领英推荐

Why is knowing who owns the data so crucial?

How do I classify data with the use of a DLP system?

But what if anything is overlooked by the system?

What are the advantages of combining data labeling with DLP, then?

Sanobar Khan的更多文章

Impacts of Adding Humour to Team Meetings

Secure Software Development Life Cycle

Born as a Girl Who Wears Many Hats!

Private/Protected Information

OT Network Segmentation

Vulnerability

Threat Modeling

Don’t Scan a Scam!

Access Recertification - An IT Control for Preventing Unauthorized Access

CyberArk

社区洞察

其他会员也浏览了

Mastering Copilot with Purview DLP and Auto Labeling: Enhancing Data Security and Compliance

How SysTools Adheres to Data Wiping Standards to Ensure Complete Data Erasure

Data Validation for Row-Level Security (RLS)

Navigating the Data Maze: Top Challenges Executives Face Today

Dangerous Animals of Data Management

How MDM helps preparing for the Kenyan Data Protection Bill

Data Masking: Protecting Sensitive Information

The silent threat of APIs: Inadvertent transfer of excess data

Data Discovery and Classification: A Pillar of Data Security

Data Protection Audit Checklist