Digitalization of Legacy Datasets and Machine Learning Regression Yields Insights for Reservoir Property Prediction and Submarine-Fan Evolution: A Subsurface Example From the Lewis Shale, Wyoming

Thomas Martin; Jared Tadla; Zane Jobe

doi:10.2110/001c.36638

Martin, T., Tadla, J., & Jobe, Z. (2022). Digitalization of Legacy Datasets and Machine Learning Regression Yields Insights for Reservoir Property Prediction and Submarine-Fan Evolution: A Subsurface Example From the Lewis Shale, Wyoming. The Sedimentary Record, 20(1). https://doi.org/10.2110/001c.36638

Download all (8)

Figure 1. Dad Sandstone submarine fan deposits, Wyoming.
Download
Figure 2. Stratigraphic framework developed by Carvjal (2006) based on maximum flooding surfaces (MFS).
Download
Figure 3. Pairplot of kernel density estimates of the complete well-log dataset (including imputed data) used in the study with three contours separating the data into quartiles.
Download
Figure 4. Subset of Data from F028, depth in feet.
Download
Figure 5. Core image data methodology.
Download
Figure 6. Comparison between random and blind-well test-train splits (median values from three model iterations).
Download
Figure 7. A) XRF Cross plots comparing calcium (Ca) to silicon (Si), and sulfur (S) to iron (Fe), grouped into Maximum Flooding Surface roups defined by Carvajal (2006) (Fig. 2).
Download
Supplementary datasets
Download

View more stats

Abstract

Machine-learning algorithms have long aided in geologic property prediction from well-log data, but are primarily used to classify lithology, facies, formation, and rock types. However, more detailed properties (e.g., porosity, grain size) that are important for evaluating hydrocarbon exploration and development activities, as well as subsurface geothermal, CO₂ sequestration, and hydrological studies have not been a focus of machine-learning predictions. This study focuses on improving machine-learning regression-based workflows for quantitative geological property prediction (porosity, grain size, XRF geochemistry), using a robust dataset from the Dad Sandstone Member of the Lewis Shale in the Green River Basin, Wyoming.

Twelve slabbed cores collected from wells targeting turbiditic sandstones and mudstones of the Dad Sandstone member provide 1212.2 ft. of well-log and core data to test the efficacy of five machine-learning models, ranging in complexity from multivariate linear regression to deep neural networks. Our results demonstrate that gradient-boosted decision-tree models (e.g., CatBoost, XGBoost) are flexible in terms of input data completeness, do not require scaled data, and are reliably accurate, with the lowest or second lowest root mean squared error (RMSE) for every test. Deep neural networks, while used commonly for these applications, never achieved lowest error for any of the testing. We also utilize newly collected XRF geochemistry and grain-size data to constrain spatiotemporal sediment routing, sand-mud partitioning, and paleo-oceanographic redox conditions in the Green River Basin.

Test-train dataset splitting traditionally uses randomized inter-well data, but a blind well testing strategy is more applicable to most geoscience applications that aim to predict properties of new, unseen well locations. We find that using inter-well training datasets are more optimistic when applied to blind wells, with a median difference of 0.58 RMSE when predicting grain size in phi units. Using these data and results, we establish a baseline workflow for applying machine-learning regression algorithms to core-based reservoir properties from well-log and core-image data. We hope that our findings and open-source code and datasets released with this paper will serve as a baseline for further research to improve geological property prediction for sustainable earth-resource modeling.

INTRODUCTION

Well-log analysis, formation evaluation, and subsurface property modeling are an integral part of subsurface characterization for oil and gas exploration, carbon sequestration, geothermal development, mineral exploration, and water-resource characterization (Maries et al., 2017; Stumm & Como, 2017; Wallis et al., 2009; Williams & Lane, 1998). These workflows are typically performed on proprietary datasets using closed-source commercial software (e.g., Eriavbe & Okene, 2019). However, machine learning (ML) models and open-source datasets have led to major advances in the geosciences (Dramsch, 2020) due to their ease of use in high-level programming languages (e.g., Python) that have thorough documentation and community support. Developing reproducible workflows (e.g., reservoir property prediction) using open-source ML tools will allow researchers to investigate relationships and predictive performance on their own datasets. However, the large number of choices of ML models and methodologies (e.g., classification, regression) makes it difficult to select a particular model that will be effective for a specific use case (Raschka, 2018).

This study compares five open-source ML regression models for subsurface formation property prediction, utilizing well-log data and derived core-image statistics. We chose a regression methodology (predicting a value) instead of classification (predicting a label) because (1) numerous studies have explored classification (Bormann et al., 2020; Hall & Hall, 2017) and (2) regression tends to be a better choice for predicting continuous reservoir property values. We demonstrate this workflow using an open-source dataset focusing on the Dad Sandstone Member within the Lewis Shale in south central Wyoming, which is analogous to many submarine channel-fan deposits that host significant oil and gas reserves (Pyles & Slatt, 2007). While these data and models are from a specific sedimentary depositional environment, the findings and workflows are transferable to other environments (e.g., fluvio-deltaic systems, carbonate platforms) and datasets (e.g., borehole image-log, hyperspectral core images) as long as there are trusted training and testing datasets from those environments.

GEOLOGIC SETTING

This study examines Upper Cretaceous submarine fan deposits within the Greater Green River Basin (GGRB) in south central Wyoming (Fig. 1). The GGRB is an active hydrocarbon-producing basin with exploration activity dating back to the early 1950’s (Hettinger & Roberts, 2005). The cores in this study were retrieved from legacy vertical wells in several producing gas fields (Fig. 1; Asquith, 1975; Hettinger & Roberts, 2005); currently, operators utilize horizontal drilling techniques in the basin for hydrocarbon extraction activities (Levon & Mazza, 2020). Digitalizing these legacy vertical cores is imperative to provide better parameterization of reservoir models for future horizontal hydrocarbon development and/or carbon sequestration activities. The north, south, and east of the GGRB is bound by Precambrian basement thrust faults and Sevier fold and thrust belt structures to the west (Johnson & Andersen, 2009). The subsidence of the Mesozoic foreland basin in the northern part of the basin was driven by changes in the subduction angle of the Farallon plate (Johnson & Andersen, 2009; Yonkee & Weil, 2015), while subsidence in the southern part of the basin was mainly controlled by the nearby Uinta uplift (Johnson & Andersen, 2009; Liu & Nummedal, 2004). Today, the Great-Divide and Washakie sub-basins are separated by the Wamsutter Arch, but during the time of deposition, they formed a continuous deepwater basin (Olariu et al., 2012; Yonkee & Weil, 2015). The core presented in this study was collected from wells located east of the Rock Springs Uplift and south of the Wind River Range within the Great-Divide and Washakie sub-basins of the GGRB (Fig. 1).

Figure 1.Dad Sandstone submarine fan deposits, Wyoming.

A) Conceptualized block and depositional model, modified from Van Horn and Shannon (1989). B) Stratigraphic column modified from Wyoming State Geological Survey. C) Basin map with well locations. Color contours represent subsurface depth to the top of the Lewis Shale. Grey line denotes approximate location of Figure 2.

The Dad Sandstone Member of the Lewis Shale was deposited during the Late Cretaceous (Fig. 1B), and consists of deep-water siliciclastic deposits, interpreted to be slope deposits and submarine fan deposits, with both channelized and lobate architectures (Fig. 1A, Asquith, 1970; Cain, 1986; Carvajal & Steel, 2012; Koo et al., 2016; Pyles & Slatt, 2007; van Horn & Shannon, 1989; Winn et al., 1987). Pyles (2000), Sapardina (2012), and Koo (2015) provide detailed core descriptions and interpretations of depositional processes of the Dad Sandstone. Common event-bed types in the Dad Sandstone are turbidites (Bouma, 1962; Lowe, 1982) and hybrid event beds (Haughton et al., 2009; Talling et al., 2012). The deep-marine Dad Sandstone member is coeval to the Fox Hills shallow-marine shoreface (Olariu et al., 2012), and both have rapid progradation rates of ~50km/My (Carvajal & Steel, 2012) that progressively fill the basin from north to south (Fig. 2). The deposition of the linked Fox Hills-Lewis/Dad depositional system occurred over ~2.2 My (Pyles et al., 2011).

Figure 2.Stratigraphic framework developed by Carvjal (2006) based on maximum flooding surfaces (MFS).

Blue, green, and pink outlines represent the stratigraphic intervals of the 12 cores used in this study (Table 1). Blue outline denotes the MFS 2-4 interval for wells F042, E945, E952, E974, F041, S179, S821 (;eft). Green outline denotes the MFS 5-8 interval for wells F028, E934, E997 (central). Red outline denotes the MFS 12-14 interval for wells CEPO and PDRMT (right). Figure modified from cross section NS2 of Carvajal (2007); see location in Figure 1.

DATASET

Overview

The dataset used in this study was collected from cored intervals from twelve wells in the GGRB (Fig. 1, Table 1) that have targeted the Dad Sandstone. Ten of the wells are available at the USGS Core Research Center (Hicks & Adrian, 2009) and two of them are available at the Colorado School of Mines core repository (PDRMT & CEPO). These wells were chosen for their accurate depth markings, spatial and stratigraphic location, presence of associated measurements (e.g., porosity), and general core quality and condition; many other Dad/Lewis cores are available in the public domain, but do not satisfy these criteria. This dataset and associated code are open-source and available on the supplementary material and GitHub (Martin, 2022).

Table 1.Overview of the 12 cored wells that form the dataset for this study. Grain-size and XRF measurements collected every 0.1 and 0.5 feet, respectively, by this study. Porosity measurements obtained from published reports. All well names except PDRMT and CEPO are the identifiers used by the USGS Core Research Center.

	Well Name	Core Thickness (ft)	Grain Size Measurements (n)	Porosity Measurements (n)	XRF Points (n)	Carvajal MFS (Fig. 2)
1	F042	76.5	765	71	150	2 to 4
2	E945	61.2	612	34	122	2 to 4
3	E952	57.2	572	46	112	2 to 4
4	E974	61.3	613	41	123	2 to 4
5	F041	62.1	621	60	124	2 to 4
6	S179	52.2	522	52	105	2 to 4
7	S821	25.1	251	25	49	2 to 4
8	F028	116.1	1161	24	231	5 to 6
9	E934	41.1	411	0	83	5 to 8
10	E997	576.8	5768	153	1133	5 to 8
11	PDRMT	48.5	485	27	97	12 to 14
12	CEPO	34.1	341	26	67	12 to 14

	Total	1212.2	12122	559	2396

Well-log data

All wells in the study area have well-log data in the cored interval, ranging from having a full suite of logs (Fig. 3) to one well only having gamma-ray (S821). In cases of missing well-log data, the curve was imputed by using the scikit-learn Python package (Pedregosa et al., 2011) with a decision tree ML model using other wells that had all the logs desired (Fig. 3). This is not a replacement to collecting the full suite of well logs but allows for the ML models to be compared using the same datasets, as some models require matching and complete data. After the data was imputed, each well had the following curves: Caliper, Gamma Ray, Sonic, Spontaneous Potential, Density, Photoelectric Factor, Deep Resistivity, Density Porosity (Fig. 3). Each individual well-log curve was pre-processed on a per-well basis to normalize some of the differences between wells arising from different tools, vendors, vintages, and subsurface conditions. The well-log data were interpolated to a 0.1 Ft depth step basis to match the grain-size data we collected from the cores as part of this study. Further description of the original well-log data and the specific implementation of the log imputation and pre-processing is described in the GitHub repository (Martin, 2022).

Figure 3.Pairplot of kernel density estimates of the complete well-log dataset (including imputed data) used in the study with three contours separating the data into quartiles.

Abbreviations are: CAL = Caliper, GR = Gamma Ray, DT= Sonic, SP = Spontaneous Potential, DENS = Density, PE = Photoelectric Factor, RESD = Deep Resistivity, PHID = Density Porosity. This plot is included to explore linearity between different well log data types, as additional available well-log data is available but was not used due to strong linear correlation.. Strongly correlated input variables can have downstream effects on machine learning because of duplicate information.

Grain size

Most ML studies utilize subjective geologic classifications determined by geoscientists (e.g., lithofacies, systems tracts) (Hall & Hall, 2017). Our goal was to minimize the interpretive error in lithologic description, and thus grain size was determined by the authors using a grain size card and physical inspection of the core itself (Compton, 2016). While there is still human error and bias in determining grain size visually (see discussion in Jobe et al., 2021), it is more objective than interpreting stratigraphic facies or other more subjective/qualitative classifications. The grain-size data were collected using a categorical guide (e.g., upper-fine, Wentworth, 1922) and digitized and converted to the phi scale to have a linear, numeric scale (Krumbein, 1938); finally, these data were interpolated onto a 0.1 Ft scale (Fig. 4).

Figure 4.Subset of Data from F028, depth in feet.

From right to left; RGB (Red-Green-Blue) Core photo; core description modified from Koo (2015); grain size (in Phi); Gamma ray (In API); porosity (percentage), Titanium (Ti) from XRF.

XRF Geochemistry

X-Ray Fluorescence (XRF) is a non-destructive measurement technique commonly used on core material to obtain efficient and accurate elemental composition (Young et al., 2016). For this study, we collected data using a Bruker Handheld Tracer 5G portable XRF every 0.5Ft on the core, using a 50kV energy level and 90 second collection time. Comparing results of the standards used before and after every day of data collection, most differences were below measurement error and therefore no corrections were needed. We used a helium purge to further enhance data quality of lighter elements such as magnesium, aluminum, and silicon. The five elements we will focus on for this study are aluminum (Al), titanium (Ti), silicon (Si), calcium (Ca), and magnesium (Mg). These five elements were chosen due to relatively higher proportion of total weight percentage, and use as a detrital indicator (e.g. Al, Ti, Si ratios). The entire elemental suite is available in the online supplementary material.

Porosity

Porosity measurements from core plugs were taken from legacy scanned PDF reports available on Wyoming Oil and Gas Conservation Commission and USGS Core Research Center websites. These data were not processed in any way to attempt to normalize for differing methodologies and standards between various laboratories, The largest potential differences are different laboratories, improved methodology with time, and the specific calculation methodology of porosity. The depths of porosity measurements were interpolated to the most reasonable decifoot (e.g., 11-12 Ft. became 11.5 feet).

Image Data

Image data were collected by the authors and by USGS core research center staff on the cores in this study (table 1). Images have various core-tray sizes, amount of core, lighting conditions, and resolution. Due to this variability, we did not employ an automated ML model to trim, edit, and stack core images into a depth registered core column (e.g., Meyer et al., 2020), but rather manually performed this task. To reduce errors from shadows and edge effects, we cropped the middle 60% of the core image for the analysis (Fig. 5). After the cropped image was manually depth registered, the standard (RGB format) images were converted to Hue-Saturation-Value values (HSV, Joblove & Greenberg, 1978; Smith, 1978) using the scikit-image package (Van Der Walt et al., 2014). This transformation reduces the differences caused due to lighting, shadows, and camera type/setup. We tested using Red-Green-Blue (RGB) channels from the original image as input features rather than HSV, and RGB did not improve results in our testing because the RGB channels are highly correlated to the value channel. The HSV values for each discretized depth step (0.1 ft, Fig. 5) are then further reduced from the entire image to median and mode of H, S, and V values, along with the inter-quartile range of saturation and value. This compresses the amount of data for each depth step by 2-3 orders of magnitude compared to full image data; similar transformations were used by Martin et al. (2021) for core image data, and in both cases allow for efficient model runtime and use of computational infrastructure.

Figure 5.Core image data methodology.

The RGB image was cropped to 60% of its original pixel width and converted to HSV channels, with a plot shown for each channel. Core image from F028, 7758 to 7760 Ft. depth. Colorbar is equal for H (hue), S (saturation), and V (value) channels. Line plot (column 6) shows the median Value (V) for every 0.1 Ft depth interval.

Core to Log offset

For studies that use both core and well-log derived properties for analysis, typically a core-to-log depth shift is performed to correct wireline stretch and core loss/breakage during drilling operations (Fontana et al., 2010; Jeong et al., 2020). In this study, we compared 3 different strategies: no offset, a qualitative offset, and a quantitative offset. The no offset is a base case, where no depth shifting was done to either dataset. The qualitative depth shift was done by visually inspecting log patterns, and matching patterns. The quantitative offset utilizes a simple linear regression method to compare gamma-ray log and the grain size to further fine tune the qualitative offset to the nearest 0.1Ft. This linear regression methodology assumes that mudstones generally have a higher gamma-ray value compared to sandstones and testing shows that this method matches these well-log peaks to core-derived grain size quite well.