using neural networks and the markov chain approach for
TRANSCRIPT
Accepted Manuscript
Using neural networks and the Markov Chain approach for facies analysis andprediction from well logs in the Precipice Sandstone and Evergreen Formation, SuratBasin, Australia
Jianhua He, Andrew D. La Croix, Jiahao Wang, Wenlong Ding, J.R. Underschultz
PII: S0264-8172(18)30552-X
DOI: https://doi.org/10.1016/j.marpetgeo.2018.12.022
Reference: JMPG 3641
To appear in: Marine and Petroleum Geology
Received Date: 2 July 2018
Revised Date: 2 December 2018
Accepted Date: 10 December 2018
Please cite this article as: He, J., La Croix, A.D., Wang, J., Ding, W., Underschultz, J.R., Using neuralnetworks and the Markov Chain approach for facies analysis and prediction from well logs in thePrecipice Sandstone and Evergreen Formation, Surat Basin, Australia, Marine and Petroleum Geology(2019), doi: https://doi.org/10.1016/j.marpetgeo.2018.12.022.
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service toour customers we are providing this early version of the manuscript. The manuscript will undergocopyediting, typesetting, and review of the resulting proof before it is published in its final form. Pleasenote that during the production process errors may be discovered which could affect the content, and alllegal disclaimers that apply to the journal pertain.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Using Neural Networks and the Markov Chain Approach for Facies Analysis and Prediction from Well Logs in the Precipice Sandstone and Evergreen Formation, Surat Basin, Australia
Jianhua He a,b,c****, Andrew D. La Croixc, Jiahao Wangc,d, Wenlong Ding a,b, J.R. Underschultze
a. School of Energy Resources,China University of Geosciences,Beijing 100083,China
b. Key Laboratory of Marine Reservoir Evolution and Hydrocarbon Enrichment Mechanism,Ministry of
Education,China University of Geosciences,Beijing 100083,China
c. Energy Initiative, University of Queensland, Brisbane, Australia 4072
d. Faculty of Earth Resource, China University of Geosciences, Wuhan 430074, China
e. Centre for Coal Seam Gas, Brisbane, University of Queensland, Australia 4072
* Corresponding author.
E-mail address: [email protected]
Postal address: School of Energy Resources,
China University of Geosciences, Beijing 100083, China
Tel.: +86 10 82320629
Fax: +86 10 82326850
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Abstract: Facies analysis is crucial for reservoir evaluation because the
distribution of facies has significant impact on reservoir properties. Artificial
Neural Networks (ANN) are a powerful way to use facies interpretations from
core to determine equivalent facies from wireline logs in uncored wells.
However, ANN do not incorporate information that relates to facies
successions. This has limited the ability to effectively use facies information
derived from logs alone in reservoir modelling, especially at the regional scale
where data is often sparse, clustered, or incomplete. In this study, based on
observations of 8 cored wells with a total thickness of ~2000 m, 20 core facies
were defined that range from 0.22 m to 11.56 m thick. Facies were based on
grain size, sedimentary structures, and ichnological characteristics; Each facies
corresponds to a distinct depositional sub- environment within the broader
context of a large nearshore to shallow marine system. It was essential that these
facies were incorporated into reservoir models to accurately map the
distribution of reservoir and seal geobodies for CO2 storage assessment in the
Surat Basin, Australia. However, core data are few and far between in the Surat
Basin. To use core-defined facies in the absence of core, six wireline log
parameters – gamma ray, density, sonic, neutron, photoelectric factor, and deep
resistivity were plotted in multidimensional space and examined using Linear
Discriminator Analysis. Combined with model recognition and Fisher
Canonical Discriminance, the 20 core facies were simplified into 10
representative wireline log facies (WLF) with unique petrophysical parameters.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
We then used the Markov Chains Approach (MCA) to determine the
significance of vertical facies transitions, which supported the interpretation that
facies group into 5 distinct associations: (1) channel-levee complex; (2) lower
delta plain; (3) subaqueous delta; (4) shoreface and; (5) tidal flats and channels.
Based on the facies analysis and statistical classification, Multilayer Perceptron
Classifier, a type of neural network method was applied using a training set of
three cored wells that had all 6 wireline log data under the monitor of the facies
successions determined from the MCA. Results show that the accuracy of WLF
prediction ranges from 66% to 99% (ca. 83%). The accuracy of facies
recognition decreased step wise with a decreasing number of logs as input data,
such that when only gamma ray, density, deep resistivity, and sonic were used
to train neural networks the accuracy dropped to between 45- 98% (ca. 67%),
depending upon the facies. This was considered the lowest acceptable threshold
of accuracy for facies determination for input into reservoir models for carbon
capture and storage. The results of this study show that sedimentary facies can
be accurately predicted for uncored intervals in the Precipice Sandstone and
Evergreen Formation to improve facies mapping and static reservoir modelling.
Additionally, wireline log facies are helpful for interpreting Lower Jurassic
stratigraphy, depositional setting, and basin evolution in the Mesozoic of
Eastern Australia.
Keywords: Surat Basin; Precipice Sandstone; Evergreen Formation; Facies Analysis; Neural Networks; Markov Chain Analysis.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
1. Introduction
The Lower Jurassic Precipice Sandstone and Evergreen Formation are an
important prospective reservoir and seal target for potential future carbon
capture and storage (CCS) in the Surat Basin (Bradshaw, 2010). The geological
context in terms of depositional environment is poorly constrained especially in
the basin centre because of the fact that the strata are not thought to be
hydrocarbon bearing and therefore data is sparse. However, the basin centre is
also where carbon storage potential is highest. Depositional interpretations and
facies analysis have not been examined in detail, and this hinders the predictive
accuracy of reservoir performance and sealing potential. The level of detail of
the interpretation of depositional-facies and facies associations in the Precipice-
Evergreen succession is in need of an overhaul, and this will help establish more
realistic reservoir models for carbon-geostorage-site evaluation (Hodgkinson,
2013). This is because the sedimentary fabric, as well as grain size of different
facies will influence the hydraulic behaviour of strata in dynamic reservoir
simulation of CO2 injection. Capturing this detail may reduce uncertainty in the
prediction of plume migration and sealing potential of the top seal.
Sedimentary facies analysis is used to classify and map sedimentary
bodies, each which formed under unique depositional conditions. Facies are
typically assigned based on their physical or paleontogical characteristics
(Middleton, 1978; Dalrymple, 2010). However, facies differ in their intrinsic
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
textures and rock properties and this can greatly affect hydraulic and
mechanical properties (Chang et al., 2000, 2002; Burton and Wood, 2013; La
Croix et al., 2013; Baniak, 2014; He et al., 2016; La Croix et al., 2017).
Identification of sedimentary facies is based on both qualitative and quantitative
parameters, including mineral composition, texture and fabric, stratification,
sedimentary structures, bioturbation, grain-size distribution, and can be applied
in outcrop or core (Borer and Harris, 1991; Dill et al., 2005; Khalifa, 2005; Qi
and Carr, 2006; Qing and Nimegeers, 2008). However, geological datasets are
commonly limited in breath (e.g. outcrop) or due to cost (e.g. core), and thus
establishing facies relationships with regional perspective with limited control
data is often a challenge. Therefore, facies distributions based on well log data
are highly sought after (Berteig et al., 1985; Li and Anderson, 2006; Dubois et
al., 2007), as they represent the most abundant and widespread dataset in
subsurface studies. The prediction of facies from conventional wireline logs has
the potential to extend observations from the core scale (centimetres to metres)
to the well scale (meters or tens of metres), and ultimately to the regional scale
(> kilometers), allowing facies to be mapped. Nonetheless, the process of
quantitatively determining facies from well logs is currently being refined such
that it can be applied in a variety of sedimentary basins and in deposits from
different depositional environments (Tang et al., 2011; Wang and Timothy,
2013).
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
High precision sedimentary facies prediction is absolutely essential to
build large-scale, geologically reasonable, static reservoir models. Past studies
have focused on using statistical methods to analysze facies from well logs such
as discriminant analysis (Sakurai and Melvin, 1988; Avseth et al., 2001; Tang et
al., 2004), naïve Bayes classifier (Li and Anderson, 2006; He et al., 2016),
fuzzy logic (Cuddy, 2000; Saggaf and Nebrija, 2003), and support vector
machines (EI-Sebakhy et al., 2010; Wang et al., 2014; Deng et al., 2017). The
past decade has also seen successful application of Artificial Neural Networks
(ANN) (Derek et al., 1990; Wong et al., 1995; Siripitayananon et al., 2001;
Bhatt and Helle, 2002; Wang and Timothy, 2012) in the prediction of sandstone
and carbonate lithofacies because of the ability to unravel non-linear
relationships, quantify learning from training data, and work in conjunction with
other kinds of artificial intelligence (Bohling and Dubois 2003; Kordon 2010).
Multilayer perceptron classifier (MLPC) is a classifier based on feedforward
ANNs. MLPC is not a unique classifier for pattern recognition; however, the
merits of MLPC result in its broad application within various scientific and
academic fields (Micheli- Tzanakou 2000). MLPC is very flexible in the design
of learning algorithms, determining network architecture, selecting sensitive
input variables, and adapting codes for special issues (Wang and Carr, 2012b,
c). MLPC is a useful research tool because of its ability to solve complex
nonlinear problems stochastically, especially in shale lithofacies application
(Wang and Carr, 2012, 2013).
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Most previous facies from wireline logs methods have determined facies at
each data point in the well, but failed to account for vertical continuity in the
facies profile (Lindberg and Grana, 2015). Each sample in the well log was
recognized independently from the adjacent samples. Therefore, unrealistic
facies successions tend to occur in facies profiles determined this way. Markov
Chain Analysis (MCA) has long been applied to determine whether the
occurrence facies in a stratigraphic succession are dependent on the underlying
facies (Gingerich, 1969; Le Roux, 1994; Xu and Maccarthy, 1998; Bohling and
Dubois, 2003). MCA results reveal the presence of preferred vertical
occurrences of facies in a sedimentary succession and therefore, can serve as
independent evidence to support interpretations of facies associations (Miall,
1973; Powers and Easterling, 1982; Wells et al., 1989; Carle et al., 1999). This
improves facies associations and facies succession prediction in complex and
variable sedimentary systems (Weissmann, 2005). To use the most effective and
reliable facies prediction method and include information about the vertical
relationships between facies, MCA are applied in this study.
The main objectives of this paper are to (1) use detailed interpretations of
the sedimentary-facies and facies classification to identify electrofacies from
conventional well logs and relate these to facies observed in core; (2) apply
statistical methods to predict and analyse electrofacies based on MLPC and
MCA; and, (3) use the electrofacies and electrofacies associations to map the
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
distribution of sedimentary environments in a small case-study area to provide
guidance for carbon capture and storage.
2. Geological setting
The Surat Basin is a large Early Jurassic to Early Cretaceous intra-cratonic
basin in eastern Australia and covers some 327, 000 km2 of Queensland and
New South Wales, Australia, from latitudes 25° -33° S, and from longitudes
147° to 152° E (Fig. 1A). The basin is filled with ~2500 m of clastic
sedimentary rocks and coal. The Eromanga and Clarence-Moreton basins (Fig.
1) are broadly time equivalent to the Surat Basin, connected on the western and
southeastern parts of the basin, respectively (Power and Devine, 1970; Exon,
1976; Green et al., 1997).
The Surat Basin developed as a shallow platform depression following 30
Ma of uplift, exposure, and non-deposition that eroded the top of the Bowen and
Gunnedah basins (Exon, 1976; Green et al., 1997). The major stages of basin
development and driving processes are not well studied, however, thermal
subsidence (Korsch et al., 1989), dynamic platform tilting (Gallagher et al.,
1994; Korsch and Totterdell, 2009; Waschbusch et al., 2009), and intraplate
rifting (Fielding, 1996) have been suggested as possible mechanisms. The Surat
Basin has several important structural features, the most important of which is
the Mimosa Syncline that forms the north-south axis of the basin (Fig. 1; Exon,
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
1976; Fielding et al., 1990; Hoffmann et al., 2009; Fielding et al., 1990; Raza et
al., 2009).
The Surat Basin was filled with sediment in six major fining upward pulses
/ cycles (Exon and Burger, 1981). The first cycle encompasses the Precipice
Sandstone and Evergreen Formation (Fig. 2). The Precipice Sandstone
represents braided river deposits characterized by thick cross-bedding with only
a few thin muddy intervals that lack marine palynoflora (Sell et al., 1972; Exon,
1976; Exon and Burger, 1981; Martin, 1981). However recent evidence has
shown that the deposition of the Lower Precipice Sandstone also could be
influenced by marine processes due to flaser and wavy bedding, clay drapes,
rare marine trace fossils and ‘brackish’ palynomorphs (Martin et al., 2018). In
contrast, the Evergreen Formation has been interpreted to represent deposits laid
down in meandering rivers and freshwater lakes. Though the upper parts of the
Evergreen Formation, including the Westgrove Ironstone Member and the
Boxvale Sandstone Member, show possible indications of marine influence on
deposition (Mollan et al., 1972; Exon, 1976). The Precipice Sandstone has a
maximum thickness of ~ 150 m and dominantly consists of quartzose, fine- to
coarse-grained sandstone with common siltstone and shale laminae in the upper
part (Exon, 1976). The finer-grained Evergreen Formation can be as thick as
300 m in addition to being geographically more widespread than the Precipice
Sandstone. The unit is dominated by carbonaceous siltstone with some horizons
of sandstone, carbonaceous mudstone, oolitic ironstone, and coal (Green et al.,
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
1997). This study will help elucidate the palaeoenvironments recorded by the
Precipice Sandstone and Evergreen Formation, because at present these remain
poorly constrained in a regional context.
The study area is located in the northern part of the Surat Basin. It covers
more than 21, 000 km2 (Fig. 1). 2D seismic data coverage across most of study
area helps constrain the lateral distribution of the Precipice Sandstone beyond
well control. A total of 192 wells across the basin had stratigraphic tops picked
based on well log signatures tied to core. Additionally, there are 8 cored wells
located within or in close proximity to the study area. Therefore, the study area
is an ideal case-study in which to test our facies prediction and mapping
capabilities. We chose two important intervals within the study area to
showcase the prediction and mapping of facies from logs. These are the
lowstand systems tract- LST which is defined by the sequence stractigraphic
surface ‘J10’ and ‘TS1’, and the overlying transgressive surface defined by the
surface ‘TS1’ and ‘MFS1’ (Wang et al., 2018). These represent the main
reservoir intervals being investigated for CCS, and the overlying seal.
3. Data Set and Methods
3.1 Database
This study utilizes sedimentological and ichnological core observations in
addition to wireline log data. The cumulative cored section that this study is
based on is approximately 2000 m from 8 wells (Fig. 1C). Core facies were
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
defined based on their sedimentological and ichnological characteristics,
including grain size, the nature of bedding contacts, physical structures,
biogenic structures, bioturbation intensity and the distribution of burrowing.
Wireline logs that were used for facies identification and prediction included
gamma ray (GR), bulk density (DEN), compressional slowness (SONIC), deep
resistivity (LLD), neutron porosity (NEUTRON), and photoelectric factor
(PDPE) from 31 wells. The logs have a resolution of 0.15 m.
Dataset selection and calibration are important for obtaining successful
facies identification and prediction results. Several quality-assurance steps were
performed on wireline logs to improve the outputs including depth correction,
normalization, and the removal of outliers. Outliers were defined using the
following criteria (Wong et al., 1998; Tang et al., 2011): (1) Intervals with null
or missing values (core missing); (2) Intervals with obvious post-depositional
overprints (fractures observed in core or image logs; hot sandstone influenced
by hydrothermal fluids input); (3) Intervals characterized by caliper-indicated
washouts or bad-wellbore conditions; (4) Intervals with facies thickness less
than 1.0 m; (5) An interval surrounding the contact between different logging
facies to remove the “averaging” of properties between two adjacent facies.
After establishing a robust and representative training set, neural-network
training can be conducted on the key core wells.
3.2 Discriminant and Principal Component Analysis
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
To develop a more accurate facies prediction model, pre-processing of the
training dataset was undertaken to identify a representative set of wireline
logging facies (WLF) from the set of core facies (CF). The representative input
database is the most important factor for controlling the quality of classifiers,
because successful application of neural networks generally requires clear
petrophysical and geological classification (Wong et al., 1998).
Linear discriminant analysis (LDA) and principal component analysis
(PCA) are two methods we used to extract and determine the main components
of variation within our dataset. These improve the accuracy of classifiers by
removing non-distinctive and interrelated features (Jungmann et al. 2011). LDA
is a multivariate method for finding a linear combination of features that
characterizes or separates two or more classes of samples (i.e., grouping
samples into major categories). Fisher canonical discriminant (FCD) is a
slightly different discriminant method from LDA that does not make some of
the assumptions that LDA does, such as normally distributed classes or equal
class covariance. FCD was used to double check the CF classification results by
LDA. PCA was used to determine the sensitivity of different types of well logs
to WLFs and also to determine the contribution of each log type to facies
differentiation. All types of CF could not be identified using conventional well
logs alone because core scale observations of structure and texture do not
necessarily translate to petrophysical properties. Therefore, LDA, PCA, and
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
FCD methods were all applied in this study to collectively achieve a useful set
of WLF to be predicted in wells lacking core.
3.3 The Markov Chain Analysis
The Markov Chain Analysis (MCA) is a statistical means of determining
the probability of transition between two states that are not controlled by the
previous state – i.e., they are “memoryless” (Grinstead and Snell. 1997). The
transition probability (Markov Chain) method is a modified form of indicator
kriging. In geological application, the method assumes the type of sediment that
will be deposited in a stratigraphic succession depends solely upon what is
currently being deposited in the present environment and not on the rock types
deposited in past environments (Jones et al., 2002). For example, in a
prograding shoreface environment, a gradual upward-coarsening succession of
facies will occur if no significant depositional hiatus exists (Fig. 3). In terms of
a vertical facies distribution, the probability of the occurrence of one facies is
dependent on the nearest occurrence of another facies over a lag interval. The
Markov chain can be mathematically described as follows: There is a set of
facies, F = {F1, F2, … , Fr}, which pass sequentially from one to another in
succession. Let the indicator variable, Ij(x), for facies j be defined as Ij(x) = {1,
if j occurs at x; 0, otherwise}, where x is a location in one vertical facies
succession. In terms of indicator variables, the marginal (initial) probability, Pj,
can be defined by
pj=E{I j(x)} (Equation 1)
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
The joint probability, pjk(h), can be also define by
pjk(h)=E{I j(x)Ik(x+h)} (Equation 2)
where h represents the lag distance in one direction (Fig. 3).
Fundamentally, the joint probability is the purest bivariate measure of spatial
variability. However, the transition probability of facies 1 passing into facies 2,
tjk(h) is the most interpretable, defined here with respect to indicator variables as
tjk(h)=E{I j(x)Ik(x+h)}/E{I j(x)} (Equation 3)
It can also be defined in terms of a conditional probability as
tjk(h)=Pr(k at x+h | j at x} (Equation 4)
Probability law requires that the row sums of the transition probability
matrix,T(h), sum to one and that the column sums obey
∑j pjtjk(h)= pk (Equation 5)
In this study, the probability of each facies transitioning to another was
calculated using PAleontological Statistics Software (PAST Version 3.17;
Hammer, 1999). Vertical facies succession analysis used an interactive
algorithm for Embedded Markov Chains (EMC) (Davis, 1986) based on the
PAST platform. The algorithm calculates a transition count matrix, a transition
probability matrix, an independent trials probability matrix, and a difference
matrix. In the transition probability matrix, the self-transition curves start at a
probability of 1 (100%) and decrease with increasing lag distances, whereas the
off-diagonal curves start at a probability of 0.0 (0%) and increase with lag
distance (Fig. 3; Carle, 1999). In the difference matrix, high positive entries
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
serve to emphasize the Markov property by suggesting which transitions have
occurred with greater than random frequency. The Powers-Easterling method in
this paper was used to test the matrix as a whole for non-randomness. It
determines the significance level of each facies transition and produces
preferred facies trends. The Powers-Easterling method yields the chi-square
value, degrees of freedom and critical value (Powers and Easterling, 1982).
3.4 Multi-Layer Perceptron Classifiers
We used a Multi-Layer Perceptron Classifier (MLPC), a form of
feedforward ANN (Haykin, 1998), to take the conventional wireline log input
data and determine the most probable WLF. MLPC consists of multiple layers
of nodes. Each layer is fully connected to the next layer in the neural network.
Nodes in the input layer represent input data; in this case wireline log data
consisting of DEN, SONIC, LLD, GR, NEUTRON, and PDPE. All other nodes
map inputs to outputs by a linear combination of the inputs with the node’s
weights w and bias b and applying an activation function (Fig. 4). This can be
written in matrix form for the MLPC with k + 1 layers as follows:
y(x) = fk (…f2 (w2Tf1 (w1
Tx + b1) + b2)…+ bK) (Equation 6)
Nodes in recurrent layers use sigmoid (logistic) function:
izi eZf −+
=1
1)(
(Equation 7)
Nodes in the output layer use softmax function:
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
∑ =
= N
k
z
z
ik
i
e
ezf
1
)(
(Equation 8)
The recurrent layer included two parts. The first was the computed distance
from the input vector to the training vector. The second layer summed these
contributions for each class of inputs to produce a vector probability. The
number of nodes N in the output layer corresponds to the number of classes; in
this case the 10 different types of wireline logging facies. The MCA helped us
understand the relationship between different logging facies and helped limit the
probability of a facies determinations that were not supported by the transition
probability analysis.
3.5 Neural Network Performance Evaluation and Application
We cross-validated our neural network results to determine the degree to
which outcomes were consistent and reliable. This was done by withholding
wireline log data from the control set, log by log, and then assessing the ratio of
times that the neural network made correct facies predictions as determined by
core facies in the training set. In addition, to decrease uncertainty in our results,
a convergence error was calculated to test the accuracy of facies prediction by
MLPC (Eyi, 2012). The convergence error (ɛc) is defined as follows:
ɛc = t
1KMAX
1k
t
tε
KMAX
1ε
= ∑=
, ( )KMAX1 ε,...,εmaxε = (Equation 9)
ɛ = ji TT − (Equation 10)
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
The variable t is the number of lithofacies types. Solving this equation with
a grid size of 10X10 gives the number of state variables, KMAX, 100. The
number (i, j) under each lithofacies indicates the position of classes in the error
matrix. The value of Ti and Tj are equal to the code of the lithofacies in the
MLPC output. Small convergence error values represent higher prediction
accuracy.
After confirming adequate and accurate neural network results, prediction
was undertaken on 38 uncored wells across the northern portion of the study
area. Log facies were flagged according to confidence levels with different
prediction accuracy; yellow for high confidence (wells with 6 logs), green for
medium confidence (5 logs), and red for low confidence predictions (4 logs).
The proportion of each WLF occurring in the same stratigraphic interval was
calculated. We also calculated the dominant WLF occurring in each well. The
facies were then grouped into their corresponding associations, allocated to their
respective stratigraphic position in the succession and mapped with reference to
the shale content distribution of wells (calculated by using GR logs) and seismic
interpretation.
4. Results
4.1 Core-scale facies definition
From core observations, twenty CF were defined based on their
sedimentological and ichnological characteristics. The CF types included
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
conglomerates and breccias (Facies G1 and G2), sandstones (Facies S1, S2, S3,
S4, S5, and S6), mudstones (Facies M1, M2, M3, and M4), heterolithics (Facies
SM1, SM2, SM3, SM4, and SM5), as well as organic and miscellaneous facies
(Facies O1, O2 and O3; Table 1 and Fig. 5).
4.2 Wireline Log Facies Determination and Analysis
4.2.1 Wireline Log Facies Determination
The twenty CF were simplified into ten representative wireline log facies
(WLF) using the LDA method (Fig. 6A). Some CFs were not recognized
because they did not have discrete petrophysical properties that allowed their
differentiation, such as G1, G2, SM5 and O2. Other CFs were grouped together
into a single WLF (Fig. 6A; Table 3) because of their similar log response
characteristics (Table 3). For example, CFs S5 and S6, both have high neutron
porosity (> 35%) and sonic values (> 98 us/f), moderate GR (avg. 106.6 API)
and PDPE (avg. 2.98 B/E) values, and low LLD (< 4 ohmm) and DEN (< 2.42
g/cm3) (Fig. 7). These can only be differentiated in core based on their
sedimentological differences, but plot together based on their petrophysical
properties (Table 3). The results of PCA show that GR contributes the most
information to facies differentiation, demonstrated by a high eigen absolute
value (an indication of how well that variable differentiates the groups),
followed by PDPE and DEN. SONIC and NEUTRON have the same
contribution rate, while the log that provides the least information relation to
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
discriminating WLF is LLD (Fig. 6B and C). The FDA results are consistent
with the CF classification results from LDA and this demonstrates that WLF
determination from LDA is effective and credible.
4.2.2 Facies succession analysis using MCA
The transition frequency matrix, transition probability matrix, independent
trials probability matrix, and difference matrix for cored well data by MCA are
presented in Table 4 and Fig. 8. Only three facies transitions were determined to
be highly significant, larger than predicted for a random sequence at the 0.20
level of significance. Six facies transitions are moderately significant at levels
between 0.15 and 0.20. Four transitions were significant at levels from 0.10 to
0.15. Finally, nineteen facies transitions were slightly significant between the
levels of 0.01 to 0.10 (Fig. 8 and Table 4C). The Powers-Easterling method was
used to test the matrix results with a chi-square value of 186.37, 68 degrees of
freedom, and a critical value of 118.57. This means that facies transitions are
significant with a 96% confidence level indicating a strong rejection of the null
hypothesis which was random deposition.
The most significant facies transitions support our facies association
interpretations (Fig. 9). For example, the shoreface facies association is
supported by the transition from bioturbated sandy mudstone (MB) passing
upward into bioturbated muddy sandstone with wave-ripple to HCS interbeds
(SD). This transition is supported by a significance value of 0.17. Similarly, all
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
five major facies associations are identified from the embedded Markov Chains
method: channel-levee complex (Wireline Log Facies AssociationⅠ),
floodplain / lower delta plain (Wireline Log Facies AssociationⅡ), subaqueous
delta (Wireline Log Facies Association Ⅲ), shoreface (Wireline Log Facies
Association Ⅳ), and tidal flats and channels (Wireline Log Facies Association
Ⅴ; Fig. 9).
Wireline Log Facies AssociationⅠis characterized by a sandy facies
succession that includes facies SA, SB, and SC. This association consists of two
major facies transitions: coarse-grained planar-tabular to trough cross-stratified
sandstone (SA), transitioning into fine-grained planar-tabular grading into
current-ripple laminated sandstone (SB), to fine-grained planar-parallel
laminated sandstone (SC). This vertical transition sequence indicates a
transition from braided channel complex deposits to a lower-energy meandering
channel environment (Fig. 9).
Wireline Log Facies AssociationⅡ is dominated by muddy and organic
facies: OA, MA, and less commonly MB. This association shows frequent
transitions from thicker massive mudstone (MA) to coal (OA), capped with
coarse silt bioturbated sandy mudstone (MB). This mud-dominated succession
is interpreted to have mainly been deposited on a low-energy floodplain or delta
plain.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Wireline Log Facies Association Ⅲ consists of the WLF MA, SMB, SMA,
SC, SB, and less commonly OB (Fig. 9). These transitions form a coarsening-
upward succession, indicating deposition that grades from low-energy muddy
prodelta deposits to a wave-influenced sandy delta front environment.
Wireline Log Facies Association Ⅳis composed of only a single transition:
bioturbated sandy mudstone (MB) grading into bioturbated muddy sandstone
with wave-ripple and HCS interbeds (SD), arranged as a coarsening-upward
succession. This association is interpreted to represent the upper offshore to
shoreface transition (Fig. 9).
Wireline Log Facies Association Ⅴ comprises facies SD, OB, SMB, and
MB. These transitions construct a fining-upward succession and are interpreted
to reflect sandy lower tidal flats to mixed sandy and muddy tidal flats, capped
with mud dominated upper tidal flats and lagoons (Fig. 9).
4.3 Wireline log facies prediction using MLPC and facies associations
Three WLF training wells were selected - Condabri MB9-H, Woleebee
creek GW4 and Reedy Creek MB3-H - on the basis of their geographic location
within the basin and the availability of appropriate well logs and core data
(Table 5). From these, 12194 data points from the six logging parameters, DEN,
SONIC, LLD, GR, NEUTRON and PDPE, were used as inputs to the MLPC
model (Fig. 4).
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
The MLPC method shows geologically reasonable facies prediction results
(Fig. 10A). By increasing the number of training cycles, the convergence error
of prediction decreased sharply and became asymptotic at 0.53, corresponding
to 1300 cycles. At 1300 cycles the model was stable and had the best match
with the core-defined facies. The accuracy of facies prediction (Rs means the
ratio of the count of correctly identified facies sample to the number of core-
defined facies samples) to CF ranges from 66.48 % to 99.14 % with an average
value of 83.05% (Fig. 10A; Table 5). Moreover, the WLFs SA, SB and OB –
the most representative siliclastic facies – are for the most part correctly
classified with a prediction accuracy of > 92%. By contrast, the WLFs SC and
SMA were less accurately predicted by the MLPC method with Rs values of
less than 75%. These WLFs were commonly misidentified as one another or
SMB (Table 5). This is because the rescaled distance between petrophysical
properties among these facies was very small (Fig. 10B). We also preferentially
chose WLF predictions that resulted in thick intervals rather than those with
thinner intervals due to their improved mapping potential (Fig. 11). For
instance, the thickness intervals of SA, SB, SMB seemed to be predicted better
than other facies, because thicker facies intervals are less influenced by the
petrophyscial signature of their neighbouring facies.
4.4 Mapping of WLF in the Study Area
We applied the results from the robust MLPC model to the uncored
intervals in the wells with at least 4 wireline logs (Fig. 12 A) to develop a better
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
understanding of the facies distribution and depositional setting. In the lowstand
systems tract (LST; i.e., the Precipice Sandstone) (Fig. 12B), the WLFs were
dominated by SA and the proportion of WLFs other than SA were not
volumetrically (or spatially) important. However, from the analysis we
determined that the thickness of SA is greatest in Woleebee Creek GW4, with
the thickness decreasing sharply towards the margins of the basin. In the
southwestern part of the study area, SA is not widespread. In contrast, within
the transgressive systems tract (TST; broadly equivalent to the lower Evergreen
Formation) facies zonation is more prominent (Fig. 12 C). The TST shows
channel sandstone facies mainly located in the southwestern and northern part
of study area, whereas the central portion of the region is mudstone-dominated
(Fig. 12 D). This suggests that at these stratigraphic levels sediment input was
mainly from the southwest and northern portion of the basin with secondary
provenance being located on the northeastern and eastern margins.
5. Discussion
5.1 Effects of the scale of observation on WLF prediction
The accuracy of the MLPC prediction strongly depends on the input
provided by the training data. To ensure highly reliable WLF prediction
adequate training data are needed. However, more data does not always equate
with better prediction results. It is far more important to acquire representative
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
training samples and remove outliers on the basis of geological insights from
core.
The mismatch between the resolution of core logging and wireline log data
makes it challenging to obtain facies identification at an appropriate scale for
study. Wireline logs are recorded at the cm-scale, whereas facies are typically
assigned at the m-scale for utility in mapping of depositional environments.
Therefore, some balance is necessary and WLF may have to be upscaled to a
useful thickness for reservoir or regional studies. An example of this in our
dataset was the differentiation of the facies SMA and SMB. SMA and SMB are
relatively easily differentiated on the basis of sedimentological features
observed in core. However, in wireline logs, the distinction between these facies
by MLPC methods is more obscure. For example, in Condabri MB9-H between
the depths of 1475-1484.72 m a bad match between the CF and predicted WLF
occurs (Fig. 11). This shows how SMA can be misidentified as SMB, resulting
in a low prediction accuracy overall for these two particular WLFs. Therefore, it
is necessary to have a geologist ensure that the predicted WLF are geologically
sensible and not to rely solely on numerical results from the neural network.
5.2 Effects of Well-Log Input
The use of a full suite of wireline logs as input greatly increases the
prediction accuracy of WLF, as manifest in a decreased convergence error.
However, the full suite of logs seldom occurs in wells within the study area;
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
only 37.62 % of wells had all six types of logs available. Therefore, we had to
determine the minimum number of logs needed to produce an acceptable
accuracy of facies determination that would be useful for mapping depositional
environments on a regional scale. The accuracy of facies recognition decreases
step wise with decreasing log input data, such that when only gamma ray,
density, deep resistivity, and sonic are used to establish the MLPC structure the
accuracy drops to between 45% and 98% depending on the facies (avg. 67%;
Fig. 13) with a convergence error of 0.75. We considered this to be the bottom
threshold for facies prediction appropriate for regional paleogeography
determination and reservoir modelling.
In addition to the number of logs, the choice of wireline log input affects
the accuracy of WLF prediction results. For example, when using NEU and
DEN, the accuracy is quite high; however, the additional input of SONIC does
not greatly improve the accuracy. The reason for this is that different logs add
different levels of “new” information to the neural network. New independent
information will increase the identification ability significantly, while redundant
or even conflicting information may reduce the neural network recognition
ability. Different WLF have different sensitivity to the input log parameters. In
our case example, SA, SB and SMB facies consistently have a high prediction
accuracy no matter which type of well logging data are used for prediction.
However, a decrease of input log data exerts great influence on the
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
identification of MB, SD and SC. The lack of DEN strongly affects the
accuracy of OB facies prediction.
5.3 Cross Validation by Withholding Input Log Data
Cross validation is useful to evaluate the performance of MLPC. In this
study, the training dataset in the four key wells was divided into two groups: a
training group and a validation group. The training dataset accounted for
approximately 75% of the entire dataset and was used to calculate errors and
adjust connection weights and bias. The residual validation group was used to
avoid over-training or over-fitting by detecting the predicted results in the
validation group. In practice, the training dataset was randomly collected from
three-quarters of the dataset and the remaining part was applied as a validation
group. During validation the jackknife statistical approach (Wang et al., 2012)
was run ten times using subsets of available data. From cross-validation we
were able to understand the range of prediction accuracy for each facies (Fig.
14). The cross-validation results suggest the prediction accuracy ranges from
48% to 97% depending on the facies with an average value of 71%.
Additionally, there is 70-97% prediction accuracy for common facies but
significantly lower accuracy for less-common and improminent facies. It also
means that the established MLPC mode actually gives some satisfying
performance.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
5.4 Geographic and Stratigraphic Distribution of Facies: Implications
for the Depositional History of the Precipice-Evergreen Succession in
the Northern Surat Basin
Facies interpretation of uncored wells gives detailed information about the
regional depositional environment especially when used in conjunction with
Vshale maps. The Precipice Sandstone was dominated by SA facies deposited
in braided fluvial systems. The coarse-grained sandstone was deposited over an
increasingly large area through time as paleovalleys in the underlying
unconformity were filled (Exon, 1976). But the southwestern part of the study
area lacks these thick sandstone deposits (Fig. 15A) because elevated basement
blocks to the west and southwest provided the main sediment input source for
the area and this was an area of sediment bypass or non-deposition (Exon, 1976;
Green et al., 1997). During deposition of the Lower Evergreen Formation, sea
level rise occurred rapidly (Wang et al., 2018). Many deltas building out into
the central basin (near wells West Wandoan 1 and Trelinga 1) from the west and
southwestern margin extending into the basin for a distance of at least 53 km.
West Wandoan 1, Woleebee Creek GW4, and Trelinga 1 are interpreted to be
located near the locus of deposition– representing the “basin centre”. Younger
fluvio-deltaic systems cut into older strata with complex cross-cutting
relationships (Fig. 15B). It is also possible that minor delta complexes could
have been sourced from the north and eastern parts of the basin, but did not
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
extended as far from their provenance areas (Auburn Arch and Yarraman
Block) potentially due to the fact that accommodation space was being filled by
these nearshore and shallow marine systems (Bianchi et al., 2018). The stratal
stacking patterns are indicative of progressive backstepping of depositional
environments up-section and towards the basin margins within the Precipice
Sandstone and Lower Evergreen Formation.
5.5 Application of WLF prediction to CCS in the Surat Basin
The WLF distribution is essential for modelling reservoir flow units in the
Precipice Sandstone, as well as evaluating the sealing capacity of the overlying
Evergreen Formation. From the facies maps of the LST and TST, the greatest
stratigraphic sealing potential occurs in the central and northwestern part of
study area because of more laterally continuous and thicker muddy intervals. By
contrast, in the southwestern part, channel sandstones in the Lower Evergreen
Formation are widely distributed. As porosity and permeability are influenced
by the stacking patterns of facies, a realistic 3-dimensional facies distribution
has the potential to significantly improve modelling of facies and reservoir
properties in static reservoir models.
6. Conclusion
In this paper, a robust workflow is introduced to predict siliciclastic
sedimentary facies from wireline logs by integrating Multi-Layer Perceptron
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Classifier (MLPC) and Markov Chain Analysis (MCA) techniques. To
summarize the key finding of this research:
1) Twenty core facies were defined from the Precipice Sandstone and
Evergreen Formation that were distinguished based on sedimentary texture,
physical sedimentary structures, and bioturbation.
2) Using statistical means, the 20 core facies were simplified into 10
recurring wireline log facies with common wireline log responses and
petrophysical distributions. Using MCA methods, 44 significant facies
transitions were observed, and these were used to group the WLF into five
wireline log facies associations – fluvial channel belt, lower delta plain,
subaqueous delta, shoreface, and tidal flats and channels.
3) After establishing a representative training set of WLF artificial neural
networks were trained using key cored wells to predict WLF in wells where
core was not present. The average overall prediction accuracy was > 83% for
the most common facies.
4) We used the WLF to map depositional environments across a large area
in the north-central portion of the Surat Basin to better understand the
paleogeography and improve reservoir modelling efforts for CO2 storage
application. This is one of the most detailed attempts at understanding the
paleogeography in this stratigraphic interval and should be investigated further
for the rest of the basin.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Acknowledgments
We thank the Australian Government, through the CCS RD&D
programme, ACA Low Emissions Technology (ACALET), and the University
of Queensland for financial support of this project. We also thank APLNG,
CTSCo, and QGC for data access. We also appreciate Ahmed Harfoush and Iain
Rodger for providing constructive suggestions on facies prediction and
petrophysical analysis. Staff at the DNRM Exploration Data Centre in Zillmere
are acknowledged for access to core data. JH’s research was supported by the
China Scholarship Council (201706400014), Fundamental Research Funds for
the Central Universities (2652017309), National Science and Technology Major
Project of China (2016ZX05046-003-001 and 2016ZX05034-004-003), and the
National Natural Science Foundation Projects (GrantNos 41372139 and
41072098). We thank Associate Editor Sergio G. Longhitano and two
anonymous reviewers for their constructive revisions and comments that greatly
improve this manuscript.
References
Avseth, P., Mukerji, T., Jorstad, A., Mavko, G., Veggeland, T., 2001. Seismic
reservoir mapping from 3-D AVO in a North Sea turbidite system.
Geophysics. 66 (4), 1157-1176.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Baniak, G.M., La Croix, A.D., Polo, C.A., Playter, T., Pemberton, S.G., and
Gingras, M.K., 2014. Associating x-ray microtomography with
permeability contrasts in bioturbated media: Ichnos, v. 4, 234-250.
Berteig, V., Helgeland, J., Mohn, E., 1985. Lithofacies prediction from well
data. In: Proceedings of SPWLA Twenty-Sixth Annual Logging
Symposium.
Bhatt, A., Helle, H., 2002. Determination of facies from well logs using
modular neural networks. Pet. Geosci. 8 (3), 217-228.
Bianchi, V., Zhou, F., Pistellato, D., Martin, M., Boccardo, S., Esterle, J., 2018.
Mapping a coastal transition in braided systems: an example from the
Precipice Sandstone, Surat Basin. Australian Journal of Earth Sciences.
65(4), 483-502.
Bohling, G.C., Dubois, M.K., 2003. An integrated application of neural network
andMarkov chain techniques to prediction of lithofacies from well logs.
Kansas geological survey open-file report. 50, 6.
Borer, J.M., Harris, P.M., 1991. Lithofacies and cyclicity of the Yates
Formation; Permian Basin: implications for reservoir heterogeneity. AAPG
Bulletin. 75 (4), 726-779.
Bradshaw, J., 2010. Regional Scale Assessment Results & Methodology
Queensland CO2 Storage Atlas. 2nd EAGE Workshop on CO2 Geological
Storage, Berlin, Germany.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Burton, D. and Wood, L.J. (2013) Geologically-based permeability anisotropy
estimates for tidally-influenced reservoirs using quantitative shale data.
Petrol. Geosci., 19, 3–20.
Carle, S.F., 1999. T-PROGS: Transition Probability Geostatistical Software.
Version 2.1 User's Guide. University of California, Davis, CA.
Chang, H.C., Kopaska-Merkel, D.C., Chen, H.C., 2002. Identification of
lithofacies using Kohonen selforganizing maps. Comput. Geosci. 28, 223-
229.
Chang, H.C., Kopaska-Merkel, D.C., Chen, H.C., Durrans, S.R., 2000.
Lithofacies identification using multiple adaptive resonance theory neural
networks and group decision expert system. Comput. Geosci. 26, 591-601.
Cuddy, S., 2000. Litho-facies and permeability prediction from electrical logs
using fuzzy logic. SPE Reserv. Evalu. Eng. 3 (4), 319-324.
Dalrymple, R.W., 2010. Interpreting sedimentary successions, in James, N.P.,
and Dalrymple, R.W., eds., Facies Models 4: St. Johns, Newfoundland,
Geological Association of Canada, p. 3-18.
Davis, J.C., 1986. Statistics and Data Analysis in Geology, 2nd ed.: New York,
Wiley, 646.
Deng, C.X., Pan, H.P., Fang, S.N., Konaté, A.A., Qin, R.D., 2017. Support
vector machine as an alternative method for lithology classification of
crystalline rocks. Journal of Geophysics and Engineering, 14(2), 341.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Derek, H., Johns, R., Pasternack, E., 1990. Comparative study of a
backpropagation neural network and statistical pattern recognition
techniques in identifying sandstone lithofacies. Proceedings 1990
Conference on Artificial Intelligence in Petroleum Exploration and
Production. Texas A and M University, College Station, TX, 41-49.
Dill, H., Ludwig, R.R., Kathewera, A., Mwenelupembe, J., 2005. A lithofacies
terrain model for the Blantyre Region: implications for the interpretation of
palaeosavanna depositional systems and for environmental geology and
economic geology in southern Malawi. Journal of African Earth Sciences.
41 (5), 341-393.
Dubois, M.K., Bohling, G.C., Chakrabarti, S., 2007. Comparison of four
approaches to a rock facies classification problem. Computers &
Geosciences 33 (5), 599–617.
El-Sebakhy, E.A., Asparouhov, O., Abdulraheem, A., Wu, D., Latinski, K.,
Spries, W., 2010. Data mining in identifying carbonate lithofacies from
well logs based from extreme learning and support vector machines. In:
Proceeding of AAPG GEO 2010 Middle East geoscience conference &
exhibition, 1-17.
Exon, N.F., 1976. Geology of the surat basin in Queensland. Bureau of Mineral
Resouirces, Geology and Geophysics, Canberra, Australia, 160.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Exon, N.F., Burger, D., 1981. Sedimentary cycles in the Surat Basin and global
changes in sea level. BMR Journal of Australian Geology and Geophysics.
6, 153-159.
Eyi, S., 2012. Convergence Error Estimation and Convengence Acceleration in
Iteratively Soved Problem. Seventh International Conference on
Computational Fluid Dynamics, ICCFD7 Papers 2012- 1802, July 9-13,
Big Island, Hawaii.
Fielding, C.R., 1996. Mesozoic sedimentary basins and resources in eastern
Australia – a review of current understanding, Mesozoic Geology of the
Eastern Australia Plate Conference. Geological Society of Australia,
Brisbane, Queensland. 180-185.
Fielding, C.R., Gray, A.R.G., Harris, G.I., Saloman, J.A., 1990. The Bowen
Basin and overlying Surat Basin, in: Finlayson, D.M. (Ed.), The
Eromanga–Brisbane Geoscience Transect: A Guide to Basin Development
Across Phanerozoic Australia in Southern Queensland. Australian
Government Publishing Service, Canberra, ACT.
Gallagher, K., Dumitru, T.A., Gleadow, A.J.W., 1994. Constraints on the
verticle motion of eastern Australia during the Mesozoic. Basin Research.
6, 77-94.
Gingerich, P.D., 1969. Markov analysis of cyclic alluvial sediments. Journal of
Sedimentary Petrology. 39(1), 330-332.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Green, P.M., Hoffmann, K.L., Brain, T.J., Gray, A.R.G., 1997. The Surat and
Bowen Basins, south-east Queensland, Queensland Minerals and Energy
Review Series. Queensland Department of Mines and Energy, Brisbane,
Queensland, 244.
Grinstead, C.M., Snell, J.L., 1997. Introduction to Probability. American
Institute of Mathematics, Palo Alto, 405-413.
Hammer, O., 1999. PAST: PAIeontological Statistics. Version 3.18 User’s
Guide. University of Oslo, Natrual History Museum, Norway.
Haykin, Simon, 1998. Neural Network: A Comprehensive Foundation (2 ed.).
Prentice Hall. ISBN 0-13-273350-1.
He, J.H., Ding, W.L., Jiang, Z.X., Li, A., Wang, R.Y., Sun, Y.X., 2016.
Logging identification and characteristic analysis of the lacustrine organic-
rich shale lithofacies: A case study from the Es3l shale in the Jiyang
Depression, Bohai Bay Basin, Eastern China. Journal of Petroleum Science
and Engineering. 145, 238-255.
Hodgkinson, J., Grigorescu, M., 2013. Background researchj for selection of
potential geostorage targets – case studies from the Surat Basin,
Queensland. Australian Journal of Earth Sciences. 60, 71-89.
Hoffmann, K.L., Totterdell, J.M., Dixon, O., Simpson, G.A., Brakel, A.T.,
Wells, A.T., Mckeller, J.L., 2009. Sequence stratigraphy of Jurassic strata
in the lower Surat Basin succession, Queensland. Australian Journal of
Earth Sciences. 56, 461-476.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Hsieh, A.I., Allen, D.M., MacEachern, J.A., 2015. Statistical modeling of
biogenically enhanced permeability in tight reservoir rock. Marine and
Petroleum Geology. 65, 114-125.
Jones, N.L., Walker, J.R., Carle, S.F., 2002. Using Transition Probability
Geostatistics with MODFLOW. IAHS Publication .277, 359-364.
Jungmann, M., Kopal, M., Clauser, C., Berlage, T., 2011. Multi-class
supervised classification of electrical borehole wall images using texture
features. Comput. Geosci. 37, 541-553.
Khalifa, M.A., 2005. Lithofacies, diagenesis and cyclicity of the ‘‘Lower
Member’’ of the Khuff Formation (Late Permian), Al Qasim Province,
Saudi Arabia. Journal of Asian Earth Sciences. 25 (5), 719-734.
Kordon, A.K., 2010. Applying Computational Intelligence: How to Create
Value. The Dow Chemical Company, Freeport, TX, U.S.A, 459.
Korsch, R.J., O’Brien, P.E., Sexton, M.J., Wake-Dyster, K.D., Wells, A.T.,
1989. Development of Mesozoic transtensional basins in easternmost
Australia. Australian Journal of Earth Sciences. 36, 13-28.
Korsch, R.J., Totterdell, J.M., 2009. Subsidence history and basin phases of the
Bowen, Gunnedah and Surat Basins, eastern Australia. Australian Journal
of Earth Sciences. 56, 335-353.
La Croix, A.D., Gingras, M.K., Pemberton, S.G., Mendoza, C.A., MacEachern,
J.A., and Lemiski, R.T., 2013. Biogenically enhanced reservoir properties
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
in the Medicine Hat gas field, Alberta, Canada: Journal of Marine and
Petroleum Geology, v. 43, 464-477
La Croix, A.D., MacEachern, JA., Ayranci, K., Hsieh, A., and Dashtgard, S.E.,
2017, An ichnological-assemblage approach to reservoir heterogeneity
assessment in bioturbated strata: insights from the Cretaceous Viking
Formation, Alberta, Canada. Joournal of Marine and Petroleum Geology,
v.86, 636-654.
Le Roux, J.P., 1994 Spreadsheet procedure for modified first-order embedded
Markov analysis of cyclicity insediments. Computers & Geosciences.
20(1), 17-22.
Li. Y., Anderson-Sprecher. R., 2006. Facies identification from well logs: a
comparison of discriminant analysis and naive Bayes classifier. Journal of
Petroleum Science and Engineering. 53(3–4), 149–157.
Lindberg, D.V., Grana, D., 2015. Petro-elastic log-facies classification using the
expectation–maximization algorithm and hidden Markov
models. Mathematical Geosciences. 47(6), 719-752.
Martin, K.R., 1981. Deposition of the Precipice Sandstone and the evolution of
the Surat Basin in the Early Jurassic. The APEA Journal. 21, 16-23.
Martin, M.A., Wakefield, M., MacPhail, M.K., Pearce, T., Edwards, H. E.,
2013. Sedimentology and stratigraphy of an intra-cratonic basin coal seam
gas play: Walloon Subgroup of the Surat Basin, eastern
Australia. Petroleum Geoscience, 19(1), 21-38.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Martin, M., Wakefield, M., Bianchi, V., Esterle, J., Zhou, F., 2018. Evidence for
marine influence in the Lower Jurassic Precipice Sandstone, Surat Basin,
eastern Australia. Australian Journal of Earth Sciences. 65(1), 67-91.
Miall, A.D., 1973. Markov chain analysis applied to an ancient alluvial palin
succession. Sedimentology. 20, 347-364.
Micheli-Tzanakou, E., 2000. Supervised and unsupervised pattern recognition:
feature extraction and computational intelligence. CRC Press, Boca Raton,
371.
Middleton, G.V., 1978, Facies, in Fairbridge, R.W., and Bourgeois, J., eds.,
Encyclopedia of Sedimentology: Stroudsbury, Pennsylvania, Dowden,
Huchison and Ross, P. 323-325.
Mollan, R.G., Forbes, V.R., Jensen, A.R., Exon, N.F., Gregory, C.M., 1972.
Geology of the Eddystone, Taroom and western part of the Munduberra
Sheet areas, Queensland. Bureau of Mineral Resources, Geology and
Geophysics, Australia.
Power, P.E., Devine, S.B., 1970. Surat Basin, Australia – subsurface
stratigraphy, history, and petroleum. American Association of Petroleum
Geologists Bulletin. 54, 2410-2437.
Powers, D.W., and Easterling, R.G., 1982. Improved methodology for using
embedded Markov chains to describe cyclical sediments. Journal of
Sedimentary Research. 52, 3.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Qi, L., Carr, T.R., 2006. Neural network prediction of carbonate lithofacies
from well logs, Big Bow and Sand Arroyo Creek fields, southwest Kansas.
Comput. Geosci. 32, 947-964.
Qing, H., Nimegeers, A.R., 2008. Lithofacies and depositional history of Midale
carbonate-evaporite cycles in a Mississippian ramp setting, Steelman-
Bienfait area, southeastern Saskatchewan, Canada. Bulletin of Canadian
Petroleum Geology. 56 (3), 209–234.
Raza, A., Hill, K.C., Korsch, R.J., 2009. Mid-Cretaceous uplife and denudation
of the Bowen and Surat Basins, eastern Australia: relationship to Tasman
Sea rifting from apatite and fission-track and vitrinite-reflectance data.
Australian Journal of Earth Sciences. 56, 501-531.
Saggaf, M.M., Nebrija, E.L., 2003. A fuzzy logic approach for the estimation of
facies from wire-line logs. AAPG Bull. 87(7), 1223-1240.
Sakurai, S., Melvin, J., 1988. Facies discrimination and permeability estimation
from well logs for the Endicott field. 29th Annual APWLA Symposium.
San Antonio, Texas.
Sell, B.H., Brown, L.N., Groves, R.D., 1972. Basal Jurassic sands of the Roma
area. Queensland Government Mining Journal. 73, 309-321.
Siripitayananon, P., Chen, H., Hart, B., 2001. A new technique for lithofacies
prediction: back-propagation neural network. Proceedings of the 39th
Annual ACM-SE Conference.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Tang, H., Meddaugh, W.S., Toomey, N., 2011. Using an artificial-neural-
network method to predict carbonate well log facies successfully. SPE
Reservoir Evaluation & Engineering. 14(01), 35-44.
Tang, H., White, C., Zeng, X., Gani, M., Bhattacharya, J., 2004. Comparison of
multivariate statistical algorithms for wireline log facies classification.
AAPG AnnualMeeting Abstract. 88, 13.
Wang, G., Carr, T.R., Ju, Y., Li, C., 2014. Identifying organic-rich Marcellus
Shale lithofacies by support vector machine classifier in the Appalachian
basin. Comput. Geosci. 64, 52-60.
Wang, G., and T. R. Carr, 2012b. Methodology of organicrich shale lithofacies
identification and prediction: A case study from Marcellus Shale in the
Appalachian Basin. Computers & Geosciences. 49, 151-163.
Wang, G., and T. R. Carr, 2012c. Marcellus Shale lithofacies prediction by
multiclass neural network classification in the Appalachian Basin.
Mathematical Geosciences, 44, 975-1004.
Wang, G.C., Timothy, R.C., 2012. Marcellus shale lithofacies prediction by
multiclass neural network classification in the Appalachian Basin. Comput.
Geosci. 44, 975-1004.
Wang, G.C., Timothy, R.C., 2013. Organic-rich Marcellus Shale lithofacies
modelling and distribution pattern analysis in the Appalachian Basin.
AAPG Bull. 97, 2173-2205.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Wang, J.H., La Croix, A.D., Gonzalez, S., He, J.H., Underschutlz, J., in press.
Sequence Stratigraphic Analysis of the Precipice Sandstone and Evergreen
Formation in the Surat Basin: implications for reservoir and seal
architecture for CO2 Storage. Marine and Petroleum Geology.
Waschbusch, P., Korsch, R.J., C., B., 2009. Geodynamic modelling of aspects
of the Bowen, Gunnedah, Surat and Eromanga Basins from the perspective
of convergent margin processes. Australian Journal of Earth Sciences. 56,
309-334.
Weissmann, G.S., 2005. Application of transition probability geostatistics in a
detailed stratigraphic framework. In: Workshop for GSA Annual Meeting,
Three-dimensional Geologic Mapping for Groundwater Applications.
University of New Mexico, USA, 105-108.
Wells, N.A., 1989. A program in BASIC for facies-by-facies Markov chain
analysis. Computers & Geosciences. 15(1), 143-155.
Wong, P., Jian, F., Taggart, I., 1995. A critical comparison of neural networks
and discriminant analysis in lithofacies, porosity and permeability
predictions. J. Pet. Geol. 18 (2), 191-206.
Wong, P.M., Henderson, D.J., and Brooks, L.J., 1998. Permeability
Determination Using Neural Network in the Ravva Field, Offshore India.
SPE Res Eval & Eng. 1(2), 90-104.
Xu, H., MacCarthy, I.A.J., 1998. Markov chain analysis of vertical facies
sequences using a computer software package (SAVFS): Courtmacsherry
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Formation (Tournaisian), Southern Ireland. Computers &
Geosciences. 24(2), 131-139.
List of Figures and Tables
Fig. 1. Structure contour map of the base-Surat unconformity within the study
area.
Fig. 2. Stratigraphic column for the Lower Jurassic in the Surat Basin (after
Hoffmann et al., 2009).
Fig. 3. An example of a Markov chain transiogram (modified from Carle, 1999
and Hsieh at al., 2015). The transition probability of F1 passing upward
into F2. The point at which the Markov chain levels out is the “sill” and the
lag distance at which the Markov chain reaches the sill is the “range”. The
transition rate is defined by the slope of the tangent line, and the mean lens
length is the lag distance at which the tangent line
Fig. 4. Schematic diagram showing the architecture of the Multilayer perceptron
classifier. Facies MA, MB, SMB, SMA, SD, SC, SB, SA, OA and OB are
the different types of logging facies; MA, MB: Mudstone facies; SMB,
SMA: Heterolithic facies; SD, SC, SB, SA: Sandstone facies; OA, OB:
Organics and Miscellaneous facies. See table 1 for facies descriptions.
Fig. 5. Comparison between the core facies and gamma ray log signature for the
20 core facies we observed in the Precipice Sandstone and Evergreen
Formation.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Fig. 6. (A) Results of liner discriminant analysis in key cored wells with all six
well logs that comprise a “full suite”. This shows how the twenty core
facies were grouped into ten wireline log facies (indicated by the coloured
circles). (B and C) Principal component analysis showing the relative
importance of the various input logs to facies differentiation (e.g. GR,
PDPE and SONIC). Note: LLD: deep resistivity; PDPE: Photoelectric
factor; GR: gamma ray; DEN: bulk density; SONIC: compressional
slowness; NEUTRON: neutron porosity.
Fig. 7. Rider chart showing the recognition models for the ten types of wireline
log facies. The average value of each log in the different wireline log facies
is displayed.
Fig. 8. Results of the Markov Chain Analysis of the various wireline log facies:
(A) Transition frequency matrix; (B) transition probability matrix; (C)
independent trials probability matrix; and, (D) difference matrix.
Statistically significant transitions have been marked with a black
rectangle.
Fig. 9. Conceptual models of the five facies associations determined by MCA.
An approximate scale indicates the thickness of individual facies. The
arrow shows the transitions from one facies to another. The number on the
arrow represents the level of significance of facies transitions (based on the
difference matrix). (A) Channel Complex Association. (B) Delta Plain
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Association. (C) Subaqeous Delta Association. (D) Shoreface Association.
(E) Tidal Flats and Channels Association.
Fig. 10. (A) Relative proportion of the ten wireline log facies predicted by
MLPC method with the prediction accuracy. (B) Rescaled distance
between different wireline log facies calculated from input variable space.
Fig. 11. An example of the wireline log facies prediction using the Multilayer
Perceptron Classification method compared to core-defined lithofacies in
the Condabri MB9-H well. J10: base-Surat unconformity; TS1: the
transgressive surface at the top of the Precipice Sandstone; MFS1:
maximum flooding surface; SB2: sequence boundary within the Evergreen
Formation; J20: unconformity at the approximate base of the Boxvale
Sandstone Member; TS3: transgressive surface underlying the Westgrove
Ironstone Member; MFS3: maximum flooding surface; J30: top of the
Evergreen Formation (Wang et al., in press). Wireline log facies colour
codes are the same as Figure 4.
Fig. 12. Wireline log facies prediction results for wells within the study area.
(A) well location map of study area with colours representing the number
of logs used in the neural network facies prediction; (B) Pie chart map
showing wireline log facies proportions for the interval between J10 and
TS1. The number in the pie charts indicates the cumulative thickness of
sandstone in meters; (C) Pie chart map showing wireline log facies
proportions for the interval between TS1 and MFS1. The number in the pie
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
charts indicates the cumulative thickness of sandstone in meters; (D) The
dominant (thickest) wireline log facies in their interval from TS1 to MFS1,
showing different facies zones and potential sediment provenance regions.
Fig. 13. (A) Final convergence error. (B) Prediction accuracy based on the input
log types used for wireline log facies prediction.
Fig. 14. Cross-validation results of the ten wireline log facies from the three key
wells: Condabri MB9-H, Reedy Creek MB3-H, and Woleebee Creek
GW4.
Fig. 15. Facies distribution maps based on wireline log facies determined from
neural networks. (A) The Precipice Sandstone. The zero-thickness
boundary shown in red is based on seismic interpretation (B) The Lower
Evergreen Formation.
Table 1 Description of the 20 core facies observed in the Precipice Sandstone
and Evergreen Formation.
Table 2 The relationship between core facies and wireline log facies from the
five key core wells.
Table 3 A summary of wireline log characteristics for the ten wireline log facies.
Table 4 Statistical data from the Markvo Chain Analysis: (A) Transition count
matrix; (B) Transition probability matrix; and, (C) Difference matrix.
Table 5 Plot of the wireline log facies from the MLPC, compared with the core-
defined facies, showing the proportion of correctly predicted facies on the
diagonal and miss-classified facies in the off diagonal.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Table 1a Description of the 20 core facies observed in the Precipice Sandstone and Evergreen Formation.
Facies Name Compositon Facies Description Interpretation
Conglomerate and
Breccia Facies
G1 >30%; granules
And coarser
Dark-grey coloured, structrureless interbedded conglomerate and sandstone
with a grain size of medium to very coarse sand and rarely exceeds granules
Lag deposit or
Channel base
G2 Breccia with structrureless, medium to very coarse-grained sandstone. The nature of
the lower contact is sharp or scouring
Channel base or
Channel bank collapse
Sandstone Facies
S1
>90% sand
Coarse-grained planar-tabular to trough cross-strtified sandstone, and some
Angular rip-up clasts, pebbles and pebble lags occur in the bottom
Braided channel or
Braided delta
distribution channel
S2 Fine-to-medium-grained, planar-tabular to current ripple laminated sandstone Meandering channel or
Distributary channel
S3 Wave to combined-flow ripple laminated sandstone with grain size ranging from
very fine sand to fine sand, and accessories consists of rootlets, carbonaceous
detritus and siderite horizons
Mouth bar
S4 Light grey-coloured, planar-parallel laminated sandstone with grain size ranging
from fine sand to medium sand
Channel levee
S5 Dark-grey, bioturbated sandstone and muddy sandstone with grain size ranging from
very fine sand to fine sand
Marine bay deposits
S6 Interbedded bioturbated muddy sandstone displaying wave-ripple lamination to
hummocky cross-stratification with a grain size of fine to medium sand
Shoreface
Mudstone Facies
M1
>90% mud;
Silt and clay
Dark grey or black plannar-laminated mudstone with thin sandstone laminae Prodelta
M2 Dark coloured structureless mudstone with a grain size of fine to coarse silt Floodplain
M3 Black coloured, rare horizontal planar parallel laminated, wavy or lenticular bedded,
bioturbated sandy mudstone
Brackish-bayfill or
Lagoon
M4 Coarse silt containing interstitial very fine to fine grained sand interbedded with rare
wave-ripple laminated to hummocky cross-stratification fine grained sandstone
Offshore/Shelf
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Table 1b Description of the 20 core facies observed in the Precipice Sandstone and Evergreen Formation.
Facies Name Compositon Facies Description Interpretation
Heterolithics Facies
SM1
>10% and <90%
sand
Light-grey to dark-grey coloured, medium to coarse silt interbedded with very fine
to fine grained sand and described as sand-diminated heterolithics
Proximal delta front
SM2 Less sand than mud (70%>sand>30%), heterolithics with current to combined flow
ripple lamination, wave ripple lamination and synaereses cracks
Distal delta front
SM3 A black or dark-grey colour, wave-influenced mud-dominated heterolithics with
medium to coarse silt and very fine to fine grained sand
Prosimal prodelta
SM4 Tide-influenced mixed heterolithics with more intense bioturbation accessories
consist of carbonaceous detritus, rootlets and rare sideritized horizons
Tidal flats
SM5 Grey colour, inclined heterolithics stratification with current ripples flashers, wavy,
and lenticular bedding and rare synaereses cracks
Tide-fluvial channel
Organic and
Miscellaneous Facies
O1
Bituminous to sub-bituminous coal Peat mire or
Interdistributary bay
O2 Grey colour, very fine silt to fine-grained sand, carbonaceous sandstone and siltstone Floodplain or
Interdistributary bay
O3 Reddish brown colour ironstone (oolithic or cemented) Lagoon or Restricted
embayment
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Table 2 The relationship between core facies and wireline log facies from the five key core wells. Core facies Key well thickness Porosity/% Permeability/ mD N Logging
Facies Feature of GR curve shape
G1 3 0.25-1.2/0.83 \ \ 6 Not Find Smooth concave bell shape G2 2 0.17-0.8/0.41 \ \ 5 Not Find Smooth concave bell shape S1 5 0.2-77.2/12.59 17.7-21.3/19.89 3.18-2500/2100 19 SA Smooth cylindrical shape S2 5 0.35-18.7/4.54 6.4-12.2/10.7 0.004-0.8/0.61 32 SB Smooth concave bell shape S3 5 0.2-10.33/2.26 5.9-11.2/9.05 0.002-0.23/0.038 64
SC Smooth concave funnel shape
S4 2 0.18-4.85/1.30 \ \ 19 Erratic concave funnel shape S5 3 0.1-3.2/0.98 8.1-9.1/8.5 0.085-0.26/0.18 11
SD Smooth concave funnel shape
S6 1 1.30-7.12/3.29 \ \ 4 Erratic concave funnel shape M1 5 0.14-3.65/1.43 \ \ 29
MA Erratic line shape
M2 4 0.12-7.60/1.15 \ \ 50 Smooth line shape M3 3 0.23-1.90/0.80 \ \ 10
MB Erratic line shape
M4 1 0.53-4.13/2.33 \ \ 2 Smooth line shape SM1 5 0.18-4.84/1.53 4.6-11.4/9.3 0.013-0.069/0.03 53
SMA Smooth concave egg shape
SM2 4 0.25-5.70/2.08 5.2-9.4/8.15 0.002-0.051/0.028 31 Erratic concave egg shape SM3 5 0.16-10.6/1.49 5.1-7.2/7.38 0.001-0.032/0.023 70
SMB Erratic line shape
SM4 5 0.11-5.50/1.54 2.8-6.8/5.7 0.001-0.029/0.021 94 Erratic line shape SM5 1 0.4-0.6/0.5 \ \ 2 Not Find Erratic concave egg shape O1 4 0.05-0.91/0.22 \ \ 21 OA Smooth convex egg shape O2 4 0.1-1.3/0.32 6.3-6.9/6.4 0.001-0.005/0.003 15 Not Find Erratic concave funnel shape O3 5 0.05-1.35/0.44 6.8-7.9/7.2 <0.001 33 OB Smooth convex egg shape
Note: 0.25-1.2/0.83: the minimum –maximum thickness/ the average thickness; N: the frequency of each core facies; 17.7-21.3/19.89, 3.18-2500/2100: the value reach the 25 th and 75 th percentiles of cumulative percentage/ average value
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Table 3 A summary of wireline log characteristics for the ten wireline log facies.
Logging facies Core Description Features of conventional logs GR(API) DEN(g/cm3) NEU(%) SONIC(us/f) LLD(ohmm) PDPE(B/E)
SA >90 sand 19.4-27.4/26.4 2.26-2.34/2.32 0.26-0.32/0.30 76.1-79.1/78.2 5.3-89.3/23.5 1.69-2.04/1.78 SB >90 sand 75.3-97.6/85.9 2.33-2.41/2.37 0.25-0.29/0.26 77.3-81.2/78.9 16.6-27.6/17.6 2.08-2.48/2.29 SC >90 sand 96.4-118.6/113.7 2.45-2.53/2.50 0.20-0.27/0.23 78.8-85.8/82.8 2.8-16.1/10.7 2.45-2.83/2.68 SD >90 sand 88.4-116.4/106.6 2.39-2.45/2.42 0.39-0.48/0.42 97.8-107.4/101.2 2.28-3.06/2.58 2.80-3.36/2.98
SMA 70%>sand>30% 91.9-118.8/98.4 2.40-2.48/2.46 0.26-0.32/0.26 77.1-91.2/79.32 2.28-24.7/6.13 2.39-2.84/2.41 SMB 30%>sand>10% 120.4-144.4/129.1 2.39-2.49/2.43 0.29-0.36-0.33 89.5-97.3/93.1 1.96-7.36/3.63 2.23-2.83/2.56 MA >90% mud;
Silt and clay 136.1-154.1/142.7 2.45-2.52/2.47 0.24-0.31/0.28 79.2-86.4/85.3 3.26-6.26/4.15 2.58-2.89/2.76
MB >90% mud; Silt and clay
112.4-127.2/120.4 2.40-2.45/2.44 0.41-0.49/0.45 99.6-107.7/103.4 2.36-3.02/2.65 2.68-3.12/2.89
OA COAL 88.9-119.9/108.7 2.12-2.28/2.19 0.31-0.48/0.40 87.5-104.3/95.9 3.26-25.5/11.26 2.03-2.18/2.11 OB Oolitic ironstone 30.4-87.4/66.5 2.71-2.98/2.91 0.04-0.24/0.13 68.8-87.8/84.2 3.06-29.06/7.86 5.72-8.72/6.98
Note: 19.4-27.4/26.4: the value reach the 25th and 75th percentiles of cumulative percentage/ average value
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Table 4 Statistical data from the Markvo Chain Analysis: (A) Transition count matrix; (B) Transition probability matrix; and, (C) Difference matrix.
(A) Transition count matrix
MA MB SMB SMA SD SC SB SA OA OB MA 0 2 36 5 3 11 4 2 10 15 MB 4 0 5 0 4 1 0 0 2 3
SMB 35 7 0 45 2 49 13 4 17 9 SMA 12 0 30 0 3 17 10 8 5 3 SD 1 3 5 2 0 0 0 0 3 3 SC 10 1 53 11 1 0 5 4 6 6 SB 3 0 13 13 1 5 0 1 1 2 SA 0 0 6 9 0 3 3 0 0 0 OA 7 2 21 1 0 9 2 1 0 1 OB 16 4 12 2 3 3 2 0 0 0
(B) Transition probability matrix
MA MB SMB SMA SD SC SB SA OA OB MA 0 0.02273 0.4091 0.05682 0.03409 0.125 0.04545 0.02273 0.1136 0.1705 MB 0.2105 0 0.1632 0 0.2105 0.01263 0 0 0.1053 0.1579
SMB 0.1934 0.03867 0 0.2486 0.01105 0.2707 0.07182 0.0221 0.09392 0.04972 SMA 0.1364 0 0.3409 0 0.03409 0.1932 0.1136 0.09091 0.05682 0.03409 SD 0.03882 0.1765 0.2941 0.0176 0 0 0 0 0.1765 0.1765 SC 0.1031 0.01031 0.5464 0.1134 0.01031 0 0.05155 0.04124 0.06186 0.06186 SB 0.07692 0 0.3333 0.3333 0.02564 0.1282 0 0.02564 0.02564 0.02128 SA 0 0 0.2857 0.4286 0 0.1429 0.1229 0 0 0 OA 0.1591 0.04545 0.4773 0.02273 0 0.2045 0.04545 0.02273 0 0.02273 OB 0.381 0.09524 0.2857 0.04762 0.07143 0.04143 0.04762 0 0 0
(C) difference matrix
MA MB SMB SMA SD SC SB SA OA OB MA 0 -0.0119 0.0788 -0.1038 0.0031 -0.0538 -0.0257 -0.0138 0.0333 0.0939 MB 0.0687 0 -0.0302 -0.1426 0.1829 -0.1062 -0.0632 -0.0324 0.0339 0.0898
SMB 0.0335 -0.0031 0 0.0552 -0.0263 0.0553 -0.0139 -0.0219 -0.0028 -0.0426 SMA -0.0241 -0.0347 0.0106 0 0.0031 0.0144 0.0424 0.0544 -0.0235 -0.0425 SD -0.0833 0.1758 0.0017 -0.0246 0 -0.1583 -0.0630 -0.0323 0.1054 0.1086 SC -0.0602 -0.0249 0.2106 0.0099 -0.0212 0 0.1208 0.0141 -0.0198 -0.0161 SB -0.0705 -0.0318 0.0301 0.1858 -0.0028 0.0359 0 0.0786 -0.0481 -0.0191 SA -0.1431 -0.0309 -0.0086 0.2855 -0.0276 -0.0165 -0.0795 0 -0.0715 -0.0683 OA 0.0105 0.0134 0.1716 -0.1259 -0.0287 0.0389 -0.0204 -0.0111 0 -0.0482 OB 0.2329 0.0633 0.0191 -0.1005 0.0428 -0.0936 -0.0180 -0.0337 -0.0741 0
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Table 5 Plot of the wireline log facies from the MLPC, compared with the core-defined facies, showing the proportion of correctly predicted facies on the diagonal and miss-classified facies in the off diagonal. Core-defiend
facies Predicted litofacies Core
total Proportion of correct
predictions MA MB SMB SMA SD SC SB SA OA OB
MA 690 212 3 8 913 75.57503 MB 8 171 18 7 204 83.82353
SMB 197 2531 4 50 2782 90.97771 SMA 292 710 66 1068 66.47940 SD 12 35 143 190 75.26316 SC 73 84 468 30 655 71.45038 SB 33 114 1922 5 2074 92.67117 SA 35 1 4152 4188 99.14040 OA 5 18 23 78.26087 OB 2 1 94 97 96.90722
Predicted total 895 183 3282 869 150 708 1954 4157 18 94 12194 83.0549 Predicted/core 0.980 0.897 1.179 0.814 0.789 1.081 0.942 0.993 0.783 0.969
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Fig. 1. Structure contour map of the base-Surat unconformity within the study area.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Fig. 10. (A) Relative proportion of the ten wireline log facies predicted by MLPC method with the prediction accuracy. (B) Rescaled distance between different wireline log facies calculated from input variable space.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Fig. 11. An example of the wireline log facies prediction using the Multilayer Perceptron Classification method compared to core-defined lithofacies in the Condabri MB9-H well. J10: base-Surat unconformity; TS1: the transgressive surface at the top of the Precipice Sandstone; MFS1: maximum flooding surface; SB2: sequence boundary within the Evergreen Formation; J20: unconformity at the approximate base of the Boxvale Sandstone Member; TS3: transgressive surface underlying the Westgrove Ironstone Member; MFS3: maximum flooding surface; J30: top of the Evergreen Formation (Wang et al., in press). Wireline log facies colour codes are the same as Figure 4.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Fig. 12. Wireline log facies prediction results for wells within the study area. (A) well location map of study area with colours representing the number of logs used in the neural network facies prediction; (B) Pie chart map showing wireline log facies proportions for the interval between J10 and TS1. The number in the pie charts indicates the cumulative thickness of sandstone in meters; (C) Pie chart map showing wireline log facies proportions for the interval between TS1 and MFS1. The number in the pie charts indicates the cumulative thickness of sandstone in meters; (D) The dominant (thickest) wireline log facies in their interval from TS1 to MFS1, showing different facies zones and potential sediment provenance regions.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Fig. 13. (A) Final convergence error. (B) Prediction accuracy based on the input log types used for wireline log facies prediction.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Fig. 14. Cross-validation results of the ten wireline log facies from the three key wells: Condabri MB9-H, Reedy Creek MB3-H, and Woleebee Creek GW4.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Fig. 15. Facies distribution maps based on wireline log facies determined from neural networks. (A) The Precipice Sandstone. The zero-thickness boundary shown in red is based on seismic interpretation (B) The Lower Evergreen Formation.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Fig. 2. Stratigraphic column for the Lower Jurassic in the Surat Basin (after Hoffmann et al., 2009).
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Fig. 3. An example of a Markov chain transiogram (modified from Carle, 1999 and Hsieh at al., 2015). The transition probability of F1 passing upward into F2. The point at which the Markov chain levels out is the “sill” and the lag distance at which the Markov chain reaches the sill is the “range”. The transition rate is defined by the slope of the tangent line, and the mean lens length is the lag distance at which the tangent line.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Fig. 4. Schematic diagram showing the architecture of the Multilayer perceptron classifier. Facies MA, MB, SMB, SMA, SD, SC, SB, SA, OA and OB are the different types of logging facies; MA, MB: Mudstone facies; SMB, SMA: Heterolithic facies; SD, SC, SB, SA: Sandstone facies; OA, OB: Organics and Miscellaneous facies. See table 1 for facies descriptions.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Note: Ro: Roots; Pa: Palaeophycos; PI: Planolites; Te: Teichichnus; Tm: tempestite (storm bed); Ph:phycosiphon
Fig. 5. Comparison between the core facies and gamma ray log signature for the 20 core facies we observed in the Precipice Sandstone and Evergreen Formation.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Fig. 6. (A) Results of liner discriminant analysis in key cored wells with all six well logs that comprise a “full suite”. This shows how the twenty core facies were grouped into ten wireline log facies (indicated by the coloured circles). (B and C) Principal component analysis showing the relative importance of the various input logs to facies differentiation (e.g. GR, PDPE and SONIC). Note: LLD: deep resistivity; PDPE: Photoelectric factor; GR: gamma ray; DEN: bulk density; SONIC: compressional slowness; NEUTRON: neutron porosity.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Fig. 7. Rider chart showing the recognition models for the ten types of wireline log facies. The average value of each log in the different wireline log facies is displayed.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Fig. 8. Results of the Markov Chain Analysis of the various wireline log facies: (A) Transition frequency matrix; (B) transition probability matrix; (C) independent trials probability matrix; and, (D) difference matrix. Statistically significant transitions have been marked with a black rectangle.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Fig. 9. Conceptual models of the five facies associations determined by MCA. An approximate scale indicates the thickness of individual facies. The arrow shows the transitions from one facies to another. The number on the arrow represents the level of significance of facies transitions (based on the difference matrix). (A) Channel Complex Association. (B) Delta Plain Association. (C) Subaqeous Delta Association. (D) Shoreface Association. (E) Tidal Flats and Channels Association.
MANUSCRIP
T
ACCEPTED
ACCEPTED MANUSCRIPT
Highlights
• Facies analysis of the Precipice Sandstone and Evergreen Formation • Core facies simplified into a set of wireline log facies (WLF) with unique
petrophysical properties using statistical methods. • Facies transition significance determined from Markov Chain Analysis used to
group the WLF into facies associations. • Neural networks trained with core and Markov Chain results for facies prediction
in wells with wireline logs but no core. • Facies prediction from logs demonstrate a high accuracy, and thus provide
important input for reservoir modelling for CO2 storage assessment.