登录查看更多内容

Machine Learning, Geoscience, and Black Boxes

Scotty Salamoff

Geoscience Product Champion | Geoscience SME | Advocate for Responsible and Intelligent Use of AI in Geoscience

发布日期: 2021年8月31日

When a plane crashes, the black box is considered the holy grail by investigators, containing most of the important and relevant information regarding the event. However, in the context of Deep Learning & Machine Learning, black boxes are the opposite. Tightly locked, opaque, and somehow all-knowing, these types of black boxes regularly generate results that are used by the energy industry to make very expensive decisions, often without any sort of quantitative quality control (QC) of results or complete understanding of how exactly the black box generated them.

It doesn’t take an advanced degree to understand that applying results from these “magic” processes to actual drilling decisions is insanely dangerous for several reasons. The most obvious being that the network used in the black box was likely NOT trained on the data you are feeding it. Basic parameterization exposed to geoscientists in these tools give the illusion of network control without really ceding any significant control at all. Let’s not forget processing times can be on the order of days or weeks before you can supply feedback to the network…at which point you will need to run it again from scratch. Many of these black box tools and processes require you to generate a full result before modifying weights, biases, or the network training set, making them inefficient for long-term adaptation. Thus, we introduce the concept of Interactive Machine Learning (Interactive ML), wherein the geoscientist remains in the figurative driver’s seat and steers the network as it learns - the adjustment and optimization of training set information provides immediate feedback to the network, which adjusts weights and biases accordingly in real-time. Essentially, we remove the shrouding veil of the black box and expose the Deep Learning tools to the user up-front in a familiar, geoscientific context.

Fig. Conceptual representation of interactive Machine Learning. (istockphoto.com)

There are two types of approaches to deep learning: the overwhelmingly popular approach right now is what I refer to as the “black box”. The black box is a Deep Learning network that has been trained on unknown data and is offered as a sort of processing step - essentially telling the geoscientist to “put your seismic data in here, trust us, and we will create an answer for you”. I emphasize the “trust us” because that’s exactly what you’re doing with this approach - trusting a network that has been trained on data that you have not seen and therefore have no control over, created with a set of unknown biases - to work as you intend on your data. Given the inherent variability in seismic data quality due to vintage, geologic environment, or both, it is unreasonable to assume a network trained in one part (or parts) of the world by others with varying levels of skills and expertise will work where you need it to. These datasets may contain faulty labels which could translate to erroneous predictions within your area of interest. In fact, it can be quite dangerous to blindly trust the outputs of tools using a black box approach because the learning process is hidden from the expert (you).

The second type of approach is the Interactive ML approach. This approach, as mentioned earlier, removes the black box effect, and exposes the major controls for the deep learning network to the user. The interactive approach is a regional, data-specific approach that focuses on creating the best learning model for the data you are working with. The idea behind the interactive deep learning approach is to start with a clean, blank network and train it while labeling geologic features on seismic data.

领英推荐

You can Start Making a Difference in Every Field with…

aNumak & Company ? 2 年前

Text Mining and Sentiment Analysis for Business…

Maxwell E. Uduafemhe, Ph.D., CDA. 7 个月前

Deep Learning for High Resolution Images

Deep Block 1 年前

Fig. Two very different approaches to machine learning within the geosciences.

This labeling process differs from the interpretation process that we’re used to in one major way - labels should only be placed precisely where the features of interest are visible, and never where our geoscience minds want to extend or propagate them. Take for example the image below, which perfectly highlights the difference between a label set and an interpretation using an obvious fault interrupted by a thick shale sequence (NW Shelf Australia; data courtesy of Australian Government). While the interpretation of this feature would include the shale section itself, the correct labeling strategy for adding meaningful information to the training set would be to label the fault where it is clearly imaged and omit the shale section at the beginning of the training cycle. The reason for this is to keep our initial training set as clean and uncorrupted as possible - the network will predict the presence of the fault through this area, after which the prediction may (or may not) be reinforced by the geoscientist placing a label where the inference is.

Fig. The difference between labeling a fault for machine learning (left) and providing a fault interpretation (right). (Bonaventure 3D, NW Shelf Australia; courtesy of Australian Government and Bluware Corp.)

It is this simple application of live, iterative reinforcement learning that makes the Interactive ML approach unique. Interpreters and oilfinders are no longer restricted to what exists inside the black box. By exposing the figurative (and literal) knobs and buttons of the network to the geoscientist, an interactive deep learning product can be generated in a fraction of the time it takes more traditional black box approaches days or even weeks to process. Creating the training set interactively allows the geoscientist to exert control over what the network is looking for and where in the 3D seismic volume it looks for it. By modifying and building on the initial label set, the geoscientist is simultaneously perfecting the input information and generating a mental image of what the earth model will look like. Generating output objects from the trained network brings you to the doorstep of static geomodelling; because the interactive approach makes deep learning tools accessible to geoscientists in a context that they are familiar with. The time it takes to go from an uninterpreted 3D seismic volume to a working, coherent earth model has been cut from months to days…or in some case hours. If you wouldn’t trust a black box to put a spud location on a map, you shouldn’t trust one to provide “interpretations” that will ultimately carry heavy financial and operational risks. AI Won’t replace seismic interpreters, but seismic interpreters who use AI will replace those who do not. Take control of your deep learning tools by applying the interactive approach and stop putting all your faith in black boxes - we’re scientists, not airplanes.

Katya Casey

Value Network at U3 Explore | Founder and Entrepreneur

3 年

Just like any predictive model in geoscience - without an exact match to the local geology: the outcome of the network used in the black box falls short because it is always "NOT trained on the data you are feeding it"

1 次回应

Michele Isernia

Driving innovation in Energy

3 年

An ML “black box” approach is successful in many domains. But for subsurface interpretation where the gap between truth and the interpretation of the data representing the truth is still large, where the mistakes have a very high cost and high risk, the “AI augmented interpretation” guided and validated by the experts, is probably the fastest, safest and most accurate option at this time (assuming the geoscientist is willing to re-learn from their symbiosis with AI). Your approach can also be a perfect labeling front end to the best black box solutions (and they should pay attention to that part). Start applying IAI to a broader and more complex set of subsurface features, ideally on pre-stack data and multiple azimuth sets as this will make the difference with other approaches quite obvious, and will broaden the business impact.

1 次回应

查看更多评论

要查看或添加评论，请登录

Scotty Salamoff的更多文章

Is Ethical AI Possible?

2024年9月3日

Is Ethical AI Possible?

Artificial Intelligence (AI) is permeating every aspect of our lives, yet little consideration has been given to the…

2 条评论
Data-Centric, Interactive Deep Learning for Complex Geological Features: A Case Study

2023年5月19日

Data-Centric, Interactive Deep Learning for Complex Geological Features: A Case Study

Introduction Detailed interpretation of complex facies intervals within high-resolution 3D seismic data is a tedious…
Understanding the “I”: Artificial Intelligence and its Root in Human Neurology

2018年8月24日

Understanding the “I”: Artificial Intelligence and its Root in Human Neurology

Scotty Salamoff Geophysical Advisor | Data Scientist Actus Veritas Geoscience NOTE: The terms “AI”, “Neural Network” or…

1 条评论
How Deep is Too Deep? Reflections and Observations of how AI and Machine Learning are Perceived from AAPG ACE 2018 Annual Convention

2018年5月29日

How Deep is Too Deep? Reflections and Observations of how AI and Machine Learning are Perceived from AAPG ACE 2018 Annual Convention

May 29, 2018 Scotty Salamoff, Chief Geophysicist at GeoTerra Technologies, LLC Having just returned from a rather…

9 条评论
A Monster in the Closet? AI, Machine Learning, and Neural Networks in Seismic Interpretation

2018年5月11日

A Monster in the Closet? AI, Machine Learning, and Neural Networks in Seismic Interpretation

May 11, 2018 Scotty Salamoff, Chief Geophysicist at GeoTerra Technologies, LLC What mental images or real emotions are…

12 条评论

See all articles

Machine Learning, Geoscience, and Black Boxes

Scotty Salamoff

Geoscience Product Champion | Geoscience SME | Advocate for Responsible and Intelligent Use of AI in Geoscience

领英推荐

Scotty Salamoff的更多文章

社区洞察

其他会员也浏览了

From Data Mining to the Era of LLMs: A Journey

Deep Learning: Earth Science's Crystal Ball

Advanced Plant Disease Recognition: A Comprehensive Deep Learning Methodology

3D Fractal Dimension

Synthesizing Geospatial Data with A Simple NoGAN Technique

Advanced Text Mining with OpenAI

Segmentation of Water Bodies Using Satellite Images and Deep Learning

Semantic Segmentation (UNET) in QGIS

How to build a GAN from scratch with library, PyTorch

OpenDebateEvidence: A Massive-Scale Argument Mining and Summarization Dataset

领英推荐

Scotty Salamoff的更多文章

Is Ethical AI Possible?

Data-Centric, Interactive Deep Learning for Complex Geological Features: A Case Study

Understanding the “I”: Artificial Intelligence and its Root in Human Neurology

How Deep is Too Deep? Reflections and Observations of how AI and Machine Learning are Perceived from AAPG ACE 2018 Annual Convention

A Monster in the Closet? AI, Machine Learning, and Neural Networks in Seismic Interpretation

社区洞察

其他会员也浏览了

From Data Mining to the Era of LLMs: A Journey

Deep Learning: Earth Science's Crystal Ball

Advanced Plant Disease Recognition: A Comprehensive Deep Learning Methodology

3D Fractal Dimension

Synthesizing Geospatial Data with A Simple NoGAN Technique

Advanced Text Mining with OpenAI

Segmentation of Water Bodies Using Satellite Images and Deep Learning

Semantic Segmentation (UNET) in QGIS

How to build a GAN from scratch with library, PyTorch

OpenDebateEvidence: A Massive-Scale Argument Mining and Summarization Dataset