EXPLAINABLE ARTIFICIAL INTELLIGENCE (XAI) - ONE OF THE MAIN CHARACTERISTICS OF PETROLEUM DATA ANALYTICS (PDA); Section -1
Shahab D. Mohaghegh
Professor at West Virginia University and President of Intelligent Solutions, Inc.
Outline
Introduction (Section -1)
Explaining Traditional Engineering Models (Section -1)
Explaining Models of Physics Developed by AI & Machine Learning (XAI) (Section -2)
Part 1 of XAI: Key Performance Indicators (KPI) (Section -2)
Part 2 of XAI: Sensitivity Analysis (Section -2)
Step One: Single Parameter Sensitivity Analysis (Section -2)
Step Two: Double Parameters Sensitivity Analysis (Section -2)
Step Three: Multiple Parameters Sensitivity Analysis (Section -2)
Part 3 of XAI: Type Curves (Section -3)
Explainable AI Model for Unconventional Reservoirs (Shale Analytics) (Section -3)
Explainable AI Model for Conventional Reservoirs (Top-Down Modeling) (Section -3)
History of Explainable AI (XAI) in the Petroleum Data Analytics (Section -4)
Conclusions (Section -4)
Introduction
Petroleum Data Analytics is a solid Engineering Application of Data Science in Petroleum Engineering related problems. The Engineering Application of Data Science is defined as use of Artificial Intelligence and Machine Learning to model physical phenomena purely based on facts (field measurements, data). The main objective of this technology is the complete avoidance of assumptions, simplifications, preconceived notions, and biases.
One of the major characteristics of Petroleum Data Analytics is its incorporation of Explainable Artificial Intelligence (XAI). While using actual field measurements as the main building blocks of modeling physical phenomena, Petroleum Data Analytics incorporates several types of Machine Learning Algorithms including artificial neural networks, fuzzy set theory, and evolutionary computing. Predictive models of Petroleum Data Analytics (data-driven predictive models) are not represented through unexplainable “Black Boxâ€. Predictive models of Petroleum Data Analytics are reasonably explainable.
In early 1990s, when Artificial Intelligence & Machine Learning started to be used to solve engineering related problems, engineers and scientists started asking how this technology achieves its predictive objectives. What recently is being addressed as Explainable AI (XAI) for the non-engineering application of AI and Machine Learning, is not new in the context of engineering application of this technology. The main reason behind the fact that engineering application of AI and Machine learning to a large extend is quite explainable has to do with the historical problems of the application of traditional statistics in solving engineering related problems.
While traditional statistics’ main solutions are about identification of “correlations†in the data, engineers and scientists were always interested in “causations†that are capable of explaining “correlationsâ€. This has always been one of the main problems associated with the use of traditional statistics to solve problems. There are still many data-driven solutions that are referred to as “Black Boxâ€. Engineering application of AI and Machine Learning does not generate “Black Box†solutions.
Questions by engineers and scientists gave rise to research and development efforts in early 2000 and resulted in what today is called Explainable AI (XAI). In this article history of what today is call Explainable AI (XAI) is shown through 7 technical papers in the oil and gas industry that were published between 2001 to 2010.
Historically, prior to the development of Artificial Intelligence and Machine Learning, traditional statistics was used to analyze data. The major role of traditional statistics is to specify hypotheses that can fit the collected data. Therefore, the key behind the traditional statistics is “correlation†while engineers and scientists have always been interested in “causationâ€. It is a well-established fact that “correlation†does not necessarily determine and/or represent “causationâ€. Figure 1 and Figure 2 are good examples of how “correlation†does not have anything to do with “causationâ€. These figures demonstrate how data collected for different variables can be correlated to each other while absolutely having nothing to do with one another. Interesting examples of such approaches (correlation versus causation) are equally true when Artificial Intelligence and Machine Learning have been used in the oil and gas industry by those that have no petroleum engineering domain expertise.
Poor applications of AI and Machine Learning in the oil and gas industry (which I have heard a very large number of them) are hardly ever exposed. One exposed example of application of AI and Machine Learning in the petroleum industry is when an operating company in Bakken, North Dakota intended to use AI experts to provide them with data analytics. Based on the collected data, the AI expert’s concluded that hydrocarbon production from shale wells in Bakken is controlled by the North Dakota weather[1]. Historically, many major operating companies as well as the largest service companies in the petroleum industry have been using AI experts with no domain expertise as their data analytics leadership. Some details about our industry’s approach to AI and Machine Learning have been covered in an SPE Petro Talk[2].
The objective of this article is to cover and illustrate Explainable AI (XAI) and demonstrate its initiation as part of Petroleum Data Analytics almost two decades ago. Different sections of this article will cover how models that are developed through Petroleum Data Analytics can explain the physics of the models that are generated using purely data-driven AI-based algorithms that do not use any data generated by mathematical equations.
[1] https://community.hitachivantara.com/s/article/update-data-blending--spurious-correlations-and-a-rainy-day
[2] SPE PetroTalk: Shahab Mohaghegh – AI and Machine Learning - https://www.youtube.com/watch?v=Simc0Kd4sPY
Explaining Traditional Engineering Models
It is a well-known fact that models of physical phenomena that are generated through mathematical equations can be explained. This is one of the main reasons behind the expectation of engineers and scientists that any potential model of the physical phenomena should be explainable. Explainability of the traditional models of the physical phenomena is achieved through the solutions of the mathematical equations that are used to build the models. Explanation of such models are achieved through the analytical (for reasonably simple mathematical equations) or numerical (for complex mathematical equations) solutions of the mathematical equations. Solutions of the mathematical equations provide the opportunities to get answers to almost any questions that might be asked from the model of the physical phenomena.
Solutions of the mathematical equations are used to explain why and how certain results are generated by the model. It allows examination and explanation of the influence and impact of all the involved parameters (variables) on each other and on the models results (output parameters). In general, this is the definition of engineering approach to problem solving. Therefore, results of the physical phenomena that are based on mathematical equations (no matter how simple or complex) can be explained in detail.
Figure 3 shows the Fetkovich “Type Curveâ€. Including early time (driving from transient flow equations) and late time curves (boundary dominated flow, Arp’s Decline Curves). Following are (Equation 1 and Equation 2) the equations that are used for this series of “Type Curvesâ€:
Dimensionless rate (qD) and time (tD) are used in Equation 1 and Equation 2 are defined as shown below (Equation 3 and Equation 4) in the context of well-testing domain:
It can be clearly seen that the curves in Figure 3 are very “well-behaved†(continuous, non-linear, certain shape that changes in a similar fashion from curve to curve). The reason behind the “well-behaved†characteristics of these curves is the mathematical equations that were used to generate them. All “Type Curves†that have been historically generated in the petroleum industry are very “well-behaved†because they all are developed using the solutions of the mathematical equations. Figure 4 demonstrates two more examples of such “well-behaved†“Type Curvesâ€.
This means that traditional, mathematics-based, models of the physical phenomena are “explainableâ€. Physical phenomena that are developed using mathematical equations are “explainable†through performing several types of analyses of the mathematical equations. Here are four ways of explaining physical phenomena models that are developed and solved using mathematical equations:
1. Identification of the influence of each parameter (variable) in the mathematical equation, also known as Key Performance Indicators (KPI),
2. Single-Parameter Sensitivity Analysis,
3. Multiple-Parameters Sensitivity Analysis, and
4. “Type Curve†Generation.
Performance of such analyses can provide the required explanation of any question that might be asked regarding why, and how certain outcomes are generated or certain forecasts and predictions are made based on the solutions of the mathematical equations that have been used to model the physical phenomena. Figure 3 and Figure 4 demonstrate examples of “Type Curves†that are generated through the solutions of certain mathematical equations.