”Big Data and Analytics” in Healthcare

After increasing importance in the financial industry, logistics and production, “big data and analytics” have rapidly started to gain momentum in healthcare as well. The potential of this emerging, IT-driven wave of innovation is huge. “Big data” is supposed to make medicine more affordable and better. Data transparency and analysis are the drivers towards value-based medicine where the basis of payment is not the resulting medical expense, but the success of treatment for the individual patient.

A lot of hope is associated with “big data” in medicine: treatment costs are to be reduced, hospital stays are to be shortened, individualized therapy plans are to be enabled or optimized and the submission of false claims is to be combatted. Accordingly, the market potential is huge. The “big data and analytics” market in the US is valued at US $ 80 billion for the year 2020. These prospects are increasingly drawing in companies such as Google, Apple, IBM or salesfore.com on top of medical technology companies native to the healthcare market.. It will be fascinating to see whether and how these new players assert themselves in the jungle of healthcare systems.

 

Drivers of “Big Data” in medicine

The explosion of “big data” in medicine is based on three factors:

  1. There is an exponential growth of digitized information. This is due to the increasing digitalization of formerly analog media (images, reports, lab results, etc.) as well as the continuous optimization of diagnostic laboratory and imaging sensors. Thus, cutting edge CT systems allow the recording of several thousand projections in only 1 second. Within 10 seconds, the entire body is depicted in 2 mm thin layers. Add to this new patient data such as imaging data or molecular or genetic information. Furthermore, growing amounts of data are generated by increased monitoring with sensors of all kinds. Such monitoring is by no means limited to patients any longer, but also encompasses “healthy” people in ever greater numbers. Smartphones, Fitbits and so on continuously produce huge floods of data. Correspondingly, structured, semi-structured and unstructured data from a variety of sources coexist today, which jointly constitute “big data”.

2. The biological processes underpinning modern medicine are increasingly better understood in their heterogeneity. The cause of this improved understanding is the deciphering of the human genome, through which genetics determines whether and how people respond to different therapies. The associated differentiation is the basis of “individualized medicine”, to which prognostic risk stratification is inherent. Diagnosis and treatment are thus growing exponentially complex and thus increasingly overwhelm any given physician. Big data analytics are needed to filter out relevant, differentiating information from the resulting flood of data and, based on this, to help make correct, individualized treatment decisions.

3. The technological development of cloud-based IT solutions has cleared important thresholds in several key areas. Cloud Computing allows data from different sources and of different quality, modality and structure to be unified information technically into a whole "Big Data". Transparency and networking coupled with mobility make data available independently of time and space. Moreover, available data are useful for the development of novel algorithms (e.g. closest neighbor analysis, deep machine learning). This in turn allows conclusions to be drawn for individual patients (decision support) as well as for whole populations (population health).

 

Big Data Challenges

Together, the above-described drivers are causing an explosion of “big data” in medicine. However, there are also significant challenges. Technically, it is important to master the three "Vs" in the domain of big data: huge amounts of data (volume), great variety of data (variety) and high speed of data generation and processing (velocity). In this regard, the technical development is making rapid progress.

Apart from the technical ones, there are other challenges that hinder the development of big data in medicine. Of these, the issue of data security certainly takes up the biggest space. Any health-cloud has to meet specific standards of safety and protection. There are two aspects to data security in cloud computing: first, the so-called security of personal data and secondly, the security of their own corporate data from unauthorized access or from loss. EU-wide or global standards need to be developed here.

 

The potential of Big Data

Big Data harbors enormous opportunities, which build upon one another in cascading fashion

1. Dashboard / transparency

Big data technologies combine different data sources and create transparency by sorting unstructured data. Thus, relationships are visualized, which improves the quality of medicine at the same time as reducing costs. This networking is not limited to hospital or physicians’ practice data. Rather, even patient-specific data from sensor watches such as pulse, blood pressure or blood sugar can be wirelessly transmitted and analyzed in real time. Without delay, abnormalities in the readings can be identified and corrective action taken in this way. Most applications of big data are currently at this stage.

2. CAD: Computer Assisted Detection

Computer-assisted detection (CAD) combines elements of artificial intelligence and of digital image processing with radiological imaging. The typical field of application is already tumor diagnosis, especially breast and lung diagnosis. Thus, CAD supports screening mammography, which has been employed for the early detection of breast cancer for many years. CAD, established mainly in the US and the Netherlands, serves diagnosticians as a second opinion to their own analysis. In lung cancer diagnostics, computed tomography has established itself as the gold standard, thanks to special 3-dimensional CAD. Here, a volumetric data set of up to 3,000 individual images is processed and analyzed. Nodules (outbreak, metastases and benign changes) can be detected from 1 mm in size. Thus, the strain on physicians is relieved and cognitive errors are reduced.

Big data-technologies go one step further. “Deep machine learning” can automate the evaluation of all image data. Deep learning arranges neural networks onto planes that use increasingly complex features, such as to recognize the content of an image. Thus, masses of data can be sorted into categories. These networks, consisting of electronic nerve cells and the connections between them, are not explicitly programmed but trained using examples. Yet, it is only recently that the technology has become capable of simulating really complicated networks on the computer. With vast amounts of data, they can be trained.

The fascinating results of Deep Machine Learning are known: The IBM computer Watson, a semantic search engine that captures question asked in natural language and finds appropriate facts and answers in a big-data database in a short time, has not only beaten the human champion of the quiz show Jeopardy!, but is now also making medical diagnoses. Now WATSON is learning how radiographs are diagnosed. The first step is to differentiate between normal and pathological findings. This is followed by the structured analysis of abnormal findings that lead to a diagnosis. This technology will fundamentally change the reporting of all image data in radiology, pathology, and all other medical fields. Although the "Google Brain" can already simulate around 1 million neurons and 1 billion connections (synapses), we are currently only at the very beginning of this exciting technological development.

3. Decision Support

Decision Support Systems (DSS) are software systems in medicine that are employed in hospital settings as instruments of knowledge management. They build on a knowledge base (of existing clinical pathways) maintained by experts and are able to autonomously reach conclusions, assessments and solutions of certain problems in complex treatment processes. They represent the next evolutionary stage of big data in medicine.

Expert systems are designed and implemented in such a way that they are integrated into existing hospital information systems in order to not only manage the information therein but also work independently with them. The need for this arises in situations where physicians have to rely on selecting and consultative end support in their daily medical decision-making processes, due to the increasing complexity of various treatment options. The set of rules for DSS systems is based on, e.g., guidelines adopted by professional bodies, so-called clinical pathways. Expert systems select and structure information along these pathways for specific treatment situations, thereby increasing the safety of patient treatment.

4. Analytics: Only "Data Mining turns Big Data into Smart Data"

After the capture, storage, visualization and analysis of data, big data allows further analysis as a next step, by using cohort data to correlate various therapies as “inputs” with various “outcomes”. What is crucial in this respect is the availability of validated data sets with well-documented patient histories. The right retrospective conclusions on the best therapeutic option can only be drawn with known outcomes, which require millions of records for statistical evaluation. The thus resulting correlative framework is enriched continuously through enrichment with new patient histories. The result is a self-optimizing learning system.

The therapy optimizations that are identified based on such analysis then find their way into DSS systems. Clinical pathways are constantly being adapted to the latest findings. Thus, it is possible to develop improved algorithms and modify existing clinical care pathways. Optimal treatment is identified for newly diagnosed patients based on all available data.

Simultaneously, the analysis provides evidence regarding cost-effectiveness, by correlating outcomes with the cost of therapies. This is the basis of population health.

5. Population Health

Population health is about the optimal use of financial resources for maximum health for a larger cohort. Building on networking, storage and analysis, big data analytics provides the instruments to control health care services with the aim of optimizing outcomes for an entire cohort. Lessons learnt in this way are fed back into the design of individual patient pathways, thus enabling a cost-effective use of medical therapies.

In Germany, population health evaluations are only just starting out. The "National Cohort" is a long-term study, funded by the Federal Ministry of Education and Research, which is running over a period of 20 to 30 years. 200,000 randomly selected participants aged 20-69 years from all over Germany will be medically examined and asked about lifestyle habits. In addition, blood samples are taken for all study participants and stored for later research projects in a centralized bio bank. The aim is to acquire profound knowledge in terms of prevention and early detection of typical widespread diseases.

In the US, however, population health studies are already being employed by insurers today. Cohort studies from Finland and the US have shown that the occurrence of type 2 diabetes in high-risk individuals (impaired glucose tolerance) could be lowered by more than half over an average period of three years through lifestyle interventions (e.g. reduction of excess weight, exercise). It will be crucial that, apart from costs, the “outcome” of treatment is adequately considered.

Summary

Big Data and Analytics has become a top issue in the healthcare industry. The trend in medicine is from reactive treatment towards predictive and preventive medicine. Through early intervention, disease is to be prevented, or at least treated early and in an individualized way. The paradigm shift from generalized to individual, personalized medicine will help patients and reduce costs in the healthcare system.

Big data and analytics can already point to impressive results in the medical field, but development is in its infancy. If it becomes possible to satisfactorily solve data protection issues in addition to technical challenges, broad societal acceptance of "big data and analytics in healthcare" can be expected.

Mathias I see a lot of Big Data technologies, and I think computer assisted methods to audit medical data will help. In the US....getting people skinny would be a huge win.

回复
Samir Abid

Motorsports race engineer | Making driving faster easier, not complex | Podcast host | Founder of Your Data Driven

7 年

The societal effects raise valid and fundamental questions for all stakeholders - for example, what value does (would) a Doctor then offer? Whilst we pursue the technological potential (ambition) the effect on people is key - afterall, people can significantly effect the quality of the data captured, and, are needed to identify the features (outcomes) of meaningful value. Our work delivering data science within Olympic and Professional sports teams shows often the challenge isn't (only) developing the technology systems, but also helping to reassure (and redefine the roles of) individuals. Whilst great for the organisation (and their customers) it can actually be extremely unsettling for people. We did one piece of work that improved the accuracy of a persons regular manual results analysis by an order of magnitude and reduced the time it took to them to do it from 4+ hours to less than 4 mins. Great (for the organisation) but is that person (resource) now needed? In this case yes, but soon I don't know.

回复
Bryan McGuinn

Marketing & Health Tech Enthusiast

7 年

Some great insights Mathias. I like how you structured the stages of development of Big Data in healthcare.

Manish Marotkar

Mendix ISV Business | INSEAD MBA | Economic Times India’s Young Leader

7 年

Very interesting and insightful Prof. Mathias, thanks for sharing.

回复

要查看或添加评论,请登录

社区洞察

其他会员也浏览了