登录查看更多内容

Getting the Most out of Healthcare Data: De-Identification Methods at Your Service

Andrew Mazur

Senior Business Development Manager @ DataArt | Driving Technology Transformation

发布日期: 2023年4月21日

Considerations on Healthcare Data Governance Under GDPR

The digitization of healthcare is in full swing, transforming every industry sector. The introduction of a multitude of digital solutions has led to an explosion of health data, which has quickly become the industry’s most valuable asset. Effective and secure collection, processing, storage, and analysis of health data is essential for healthcare companies, health professionals, and patients, ensuring better health outcomes through more targeted product offerings and data-driven decisions.

Indeed, the sheer volume of?valuable health data is?enormous. It?is?constantly being collected from multiple sources such as?electronic patient health records, patient-generated data and patient-generated outcomes (e.g., surveys), healthcare applications (including DiGAs), IoT devices and wearables, clinical studies, prescriptions, and medication adherence data. This creates large and complex data sets with tremendous potential to?provide valuable insights. However, the healthcare industry itself is?complex, highly regulated, and subject to?changing rules and constraints, making compliant data operations a?challenging task. Addressing these challenges is?key to?unlocking the power of?predictive analytics in?healthcare.

Health data collected in?electronic patient records is?considered patient-related personal data, which under the General Data Protection Regulation (EU) 2016/679 (following ?GDPR?) in?general may be?used for dedicated purposes only. Nevertheless, in?certain cases, GDPR and national data protection regulations of?the member states allow the use of?such data for specific further use cases. If?the use of?the data outside of?the scope of?the original purposes is?permitted by?national legislation and GDPR, healthcare companies will usually be?required to?remove all personally identifiable information (PII) so?that it?can no?longer be?traced back to?an?individual. Then the anonymized data can be?used for specific new purposes and may be?further analyzed, aggregated, and processed to?generate new insights for healthcare entities. These can then be?used to?offer highly customized solutions to?clients?— whether they are patients, medical staff, or?research professionals.

How to?Securely Work with Personal Data?

The current European data protection law regulates data governance, protection, and security. The GDPR provides the basis to?recognize a?much more complete spectrum of?de-identification.

To?date, the GDPR has taken a?binary approach to?de-identification. Data is?either personal data or?anonymous. The recitals of?GDPR Number 26?state the following: ?The principles of?data protection should apply to?any information concerning an?identified or?identifiable natural person. Personal data which have undergone pseudonymization, which could be?attributed to?a?natural person by?the use of?additional information, should be?considered information on?an?identifiable natural person. To?determine whether a?natural person is?identifiable, account should be?taken of?all the means reasonably likely to?be?used, such as?singling out, either by?the controller or?another person to?identify the natural person directly or?indirectly.?

An?effective anonymization solution prevents all parties from singling out an?individual in?a?dataset, linking two records within a?dataset (or?between two separate datasets), and inferring any identifiable information within such a?dataset. In?consequence, it?is?necessary to?remove more than just the directly identifiable information to?ensure that the identification of?a?data subject is?no?longer possible. That means that, depending on?the context and data processing purpose, additional measures may be?needed to?prevent data from being identified. In?this case, the data is?becoming anonymous.

However, in?specific data analytics cases, it?is?important to?keep pieces of?sensitive data?— for example, the user ID?— that can potentially be?traced back to?the individual patient record. Strictly speaking, such data cannot be?considered fully anonymous, and the term pseudo-anonymous is?being used.

Various de-identification techniques provide a?wide range of?valuable tools that can be?used to?protect individual privacy. These techniques range from relatively weak (i.e., can reduce privacy risks to?a?modest degree) to?powerful techniques that can effectively eliminate most privacy risks.

At?present, there is?no?universal method for making data pseudo-anonymous. In?fact, every scenario demands a?thorough evaluation and a?uniquely customized blend of?de-identification approaches. The table below enumerates several of?these methods:

领英推荐

Consent Management Under DPDP Act, 2023, and DPDP…

Sujeet Katiyar 2 个月前

Technologies Empowering Consent Management

Concur - Consent Manager 7 个月前

Navigating the Intersection of the AI Act and GDPR: A…

Kersi Porbunderwala 5 个月前

The data de-identification techniques and methods described in?the table above still contain some risks to?ultimately being able to?identify individual subjects. Thus, such data can only be?considered pseudo-anonymized. An?additional level of?data protection can be?achieved by?employing a?combination of?technological, organizational, and legal measures. Proper data governance will include?— but must not be?limited to?— the following:

An?access and permissions policy with the appropriate authentication of?individuals accessing the data
Definition of?retention policy
Mechanisms to?delete the data upon request
Personnel training and education on?data governance and protection principles
Collection of?the users’ consent that clearly describes to?the subject the data collection purpose and its usage and others

Thus, to?ensure that data governance is?performed correctly, both an?experienced technology partner and a?legal regulatory consultant with industry knowledge must be?involved in?all stages of?the development of?a?digital health product.

What Compliant Data Solutions May Look Like

In?partnership with NEXTEC medical, DataArt has and continues to?work on?solutions for medical device development for the?EU market, where de-identified data is?used for product analytics and to?generate product insights.

These data solutions typically include the following pillars:

Data collection:?Data collection is?performed by?client-side applications (e.g., mobile app or?web app), which monitor user activities. Collected data (events of?user actions) is?sent to?the back-end system on?a?defined schedule (e.g., an?event of?opening the app or?performing a?specific action). The client ensures that all processing activities, including the use of?data for the original purpose, the anonymization, and further analysis of?anonymous data, are in?accordance with the applicable data protection regulations.
De-identification transformation: Back-end services include de-identification techniques specifically tailored to?each application. Usually, full data anonymization is?not achieved due to?the need to?keep certain unique identifiers such as?user ID?or member ID. As?an?organizational measure, during clinical studies, separate and optional consent is?provided by?the users for this kind of?data processing.
Data storage:?Raw data is?stored securely in?an?encrypted form either on?EU-based on-premise infrastructure or?EU-based servers of?local and international hosting providers. Raw and de-identified data is?stored in?separate storages with a?proper permissions schema and data access control applied to?all employees. Retention policies are applied to?data storage where applicable.
Data transfer and visualization: Pseudo-anonymized data was securely transferred via REST API to?data visualization tools?— such as?Power Bi, Looker, and Tableau?— for further data visualization and data analytics. Further, correlation and multivariate analysis informed product and treatment decisions for the relevant user cohorts.
Data security: All data processing happens in?a?safe environment with secure connections and authentication between the services. This includes following other standard industry best practice approaches.

Data solutions delivered by?DataArt and NEXTEC medical represent a?powerful engine that is?able to?inform the product management teams about the benefits as?well as?potential weaknesses of?their application at?the very start of?clinical trials. This is?key to?ensuring a?fast decision-making process.

With the correct data solution, it?is?possible to?comply with the strict GDPR and still gain valuable insight from anonymized medical data. Insights can then be?used to?optimize product design and more accurately target offers to?users, bringing benefits to?all parties in?the healthcare ecosystem?— patients, care providers, researchers, and manufacturers.

Originally published here.

要查看或添加评论，请登录

Andrew Mazur的更多文章

Modern Data Catalogs and Semantic Layers

2025年3月17日

Modern Data Catalogs and Semantic Layers

In today's data-driven world, organizations are inundated with vast amounts of information, making effective data…
Proof of Concept: Enhancing Travel Services with AI through Salesforce Agentforce

2025年3月11日

Proof of Concept: Enhancing Travel Services with AI through Salesforce Agentforce

Challenge The growing integration of Artificial Intelligence (AI) in travel technology presents numerous opportunities…
The Data Explosion in Asset Management: Why Modernization is Essential

2025年3月3日

The Data Explosion in Asset Management: Why Modernization is Essential

The investment world has changed. Investors no longer want generic investment products.

1 条评论
Snowflake Tables: Revolutionizing Data Management for Modern Businesses

2025年2月24日

Snowflake Tables: Revolutionizing Data Management for Modern Businesses

Modern businesses encounter significant challenges when implementing data warehouses, pipelines, and predictive models.…
Seven Themes to Watch in the Media and Entertainment Space in 2025

2025年2月17日

Seven Themes to Watch in the Media and Entertainment Space in 2025

The transformative impact of Generative AI (GenAI) in the Media and Entertainment space promises to be a pivotal force…
Data Knows What You Will Wear Next Summer

2025年2月10日

Data Knows What You Will Wear Next Summer

Have you ever wondered what truly shapes our fashion choices? Emotional buying, clever marketing, and gender-specific…
Unleashing Revenue Potential with Automotive Data and Equity Mining

2025年2月3日

Unleashing Revenue Potential with Automotive Data and Equity Mining

Car dealerships’ very existence relies on profit, but to remain competitive and successful, a company must move beyond…
Travel Agency Retailing Challenges

2025年1月27日

Travel Agency Retailing Challenges

Add to Cart and Add to Trip: Shifting retailing landscape of the travel industry. With the immense popularity of online…
Look Through the Keyhole: How Fashion Leaders Are Transforming with Technology

2025年1月20日

Look Through the Keyhole: How Fashion Leaders Are Transforming with Technology

At DataArt, we’ve identified the key trends shaping the fashion technology landscape for 2023–2025. The fashion…
The Power of Video Streaming Analytics for Audience Engagement

2025年1月13日

The Power of Video Streaming Analytics for Audience Engagement

The video streaming industry is at a pivotal moment where success hinges on understanding and engaging an increasingly…

See all articles

Getting the Most out of Healthcare Data: De-Identification Methods at Your Service

Andrew Mazur

Senior Business Development Manager @ DataArt | Driving Technology Transformation

How to?Securely Work with Personal Data?

领英推荐

What Compliant Data Solutions May Look Like

Andrew Mazur的更多文章

社区洞察

其他会员也浏览了

AI & Data Protection: EU's AI Act Vis-à-vis GDPR

How Does GDPR Impact Healthcare Providers in the US

ICO Novartis Privacy Sandbox Report Offers Insights for all Controllers and Processor

"How the EU AI Act is Shaping the Future of GDPR Compliance"

Europe’s New Strategy for Data as a Public Good

Do we avoid GDPR when we leave the data on the device?

Data Ownership and Privacy

European Parliament Highlights the Need for More Effective Data Protection to Comply with GDPR and Schrems II Requirements

Regulatory soup of data

Safeguarding European AI Deployments Part 2: The Intersection of EU Data Sovereignty and GDPR

How to?Securely Work with Personal Data?

领英推荐

What Compliant Data Solutions May Look Like

Andrew Mazur的更多文章

Modern Data Catalogs and Semantic Layers

Proof of Concept: Enhancing Travel Services with AI through Salesforce Agentforce

The Data Explosion in Asset Management: Why Modernization is Essential

Snowflake Tables: Revolutionizing Data Management for Modern Businesses

Seven Themes to Watch in the Media and Entertainment Space in 2025

Data Knows What You Will Wear Next Summer

Unleashing Revenue Potential with Automotive Data and Equity Mining

Travel Agency Retailing Challenges

Look Through the Keyhole: How Fashion Leaders Are Transforming with Technology

The Power of Video Streaming Analytics for Audience Engagement

社区洞察

其他会员也浏览了

AI & Data Protection: EU's AI Act Vis-à-vis GDPR

How Does GDPR Impact Healthcare Providers in the US

ICO Novartis Privacy Sandbox Report Offers Insights for all Controllers and Processor

"How the EU AI Act is Shaping the Future of GDPR Compliance"

Europe’s New Strategy for Data as a Public Good

Do we avoid GDPR when we leave the data on the device?

Data Ownership and Privacy

European Parliament Highlights the Need for More Effective Data Protection to Comply with GDPR and Schrems II Requirements

Regulatory soup of data

Safeguarding European AI Deployments Part 2: The Intersection of EU Data Sovereignty and GDPR