Some Machine Learning Applications in Seismic Interpretation

August 2018 Satinder Chopra David Lubo-Robles Kurt Marfurt

“Big data” and “data analytics” are the buzzwords these days. The oil and gas industry has always had large volumes of data to acquire, process and interpret, and since the introduction of 3-D and 4-D seismic data acquisition, the handling of large quantities of data has only become more challenging. As our industry moved from large mainframe computers coupled with array processors to scalable multiprocessors for crunching large volumes of data, seismic software, data storage and visualization capabilities have been able to keep pace.

But, in the last decade, our industry grappled not only with ever-larger volumes of data, but also with increased data heterogeneity. Fortunately, advancements in handling such large, heterogeneous, “big data” volumes, have come along. Recent developments in data analytic capabilities applied to other industries hold significant promise for those of working in the hydrocarbon exploration and development.

“Data analytics” refers to a special class of analytical tools or methods that are used to study complex systems, many of which are not amenable to traditional analysis, such as multivariant statistics.

Deductive versus Inductive

To better understand traditional interpretation versus data analytic workflows, we need to distinguish two terms: “deductive” and “inductive” reasoning. Using logic or reason to form a conclusion or opinion about something is deductive, whereas using examples to reach a general conclusion about something is inductive.

Interpreters routinely use deductive reasoning, analyzing the data using principles of geology, physics and petrophysics. Examples might be as simple as constructing synthetics to tie a well log to seismic data, or as complex as defining the environment of deposition using pattern recognition and modern analogues. There are two limitations to this approach. The first is that, try as we will, we may not be able to understand the physical reasons why one area of a survey is more productive, or alternatively completes better, than another. The second is that we may simply not have enough time to carefully correlate multiple attribute volumes using principles of physics and geology.

In contrast, data analytics uses inductive reasoning to find patterns between multidimensional data volumes. Petrophysical analysis tells us that there is a theoretical basis for porosity to correlate to P-impedance and is an example of deductive theory-based reasoning. In contrast, if there is a statistically significant correlation between TOC and P-impedance for multiple wells in a specific play, and if we can successfully validate this correlation on new wells, we have an example of transductive (good for a limited number of data sets only) or inductive (good for most data sets) data-based reasoning. Often, we do not know the reason behind a good correlation but given significant validation, we can use it as a statistically valid prediction tool. In other cases, the correlations identify a feature that can be explained by an already established theory. In still other cases, the correlations allow us to formulate a hypothesis based on physics or geology that, with further validation, can lead to a new theory.

Machine Learning is nothing new

What is now called machine learning (ML) is nothing more than a collection of traditional, classical algorithms (many of which have been around for a lot longer than 20 years) grouped together under a new name. For example, see: A Tour of Machine Learning Algorithms https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/ A small subset of those algorithms listed make up multivariate statistics tools. People should realize that algorithmically there is very little new in ML; nothing to indicate that ML is a game-changing new advancement encompassing a totally new technology of data/information processing methods and algorithms (per se). We have not suddenly been accelerated into the 25th century along side Captain Kurk in the starship Enterprise (I date myself). And it does not require a whole new set of PhD degrees to understand and apply what is now called AI and ML. We've already been applying these algorithms for many, many decades. As the article states, what is new is the "Big" part of Big Data: vast quantities of multivariate data available in the blink of an eye, able to be reduced and its information content communicated real-time and interactively.

10/21/2018 2:42:58 PM

Jeffrey Baldwin

Desirability of Hyperlinks to AAPG sources in On-line versions of articles.

This article makes no less that ten references to prior articles found in the Geophysical Corner of the Explorer. It seems to me that the addition of hyperlinks to these articles would be a low-cost, high-valued addition to the online content of the articles. If even one tenth of the links are clicked upon, this article would double the views of pages in the Corner. P.S. what is the purpose of "Rating" in the "Submit a comment" page? Is it to rate the article? If so, it is in the wrong place or ambiguously labeled.

8/24/2018 4:36:15 PM

Stephen Rasey

Some Machine Learning Applications in Seismic Interpretation

Deductive versus Inductive

Extended reading

ICE Field Trips Provide Rare Opportunity

Registration Now Open!

Report: A State of the EMD Union Update

Tracking Hidden Potential

Turn, Turn, Turn: Rotating for S-Wave Data

Deductive versus Inductive

Supervised and Unsupervised Learning

Humans Still Needed

Machine Learning Tools

Applications

Comments (2)

You may also be interested in ...

Then and Now: An EMD Perspective on Oil Shal...

Tracking Hidden Potential

A Look at Seismic Sources

Registration Now Open!

Popular articles

Oklahoma Plays Offer Untapped Potential

Tracking Hidden Potential

Bakken Boom Brings Big Changes to Dakota

Tracking Hidden Potential