3D Geological Modeling with Graph Neural Networks
3D Geological Modeling with Graph Neural Networks
[Link]
* Michael Hillier
[Link]@[Link]
1
Geological Survey of Canada, 601 Booth Street, Ottawa, ON K1A 0E8, Canada
2
Computational Geoscience and Reservoir Engineering (CGRE) RWTH Aachen University,
Wüllnerstrasse 2, 52062 Aachen, Germany
13
Vol.:(0123456789)
Math Geosci
1 Introduction
13
Math Geosci
13
Math Geosci
Notations and formal definitions are given for the graphs used throughout this paper.
First, the notations for scalar, vector, and matrix quantities are as follows: low-
ercase, bold lowercase, and bold uppercase characters, respectively.
Graphs are represented by G = (V, E), where V is the set of nodes (vertices)
and E is the set of edges (Fig. 1). The set V contains N nodes vi ∈ V and an edge
eij = (vi , vj ) ∈ E connecting from vi to vj . An edge, eij , can be associated with a
weight wij for weighted graphs and wij = 1 for unweighted graphs. The direct
neighborhood of node v, also known as the one-hop neighborhood, is defined as
N(v) = {u ∈ V|(u, v) ∈ E}. The global connectivity structure of a graph can be rep-
resented by an adjacency matrix A ∈ ℝN×N , where Aij = wij if eij ∈ E and Aij = 0
if eij ∉ E . Graphs can be categorized as either directed or undirected. As the name
implies, directed graphs have directions associated with the edges. In contrast, all
edges in undirected graphs are bidirectional—information can go in both directions.
Graphs in our work are undirected and have associated node features X ∈ ℝN×d with
each node xv ∈ ℝd containing relevant data associated with the node.
Fig. 1 Simple undirected graph with an associated adjacency matrix A and node feature matrix X
13
Math Geosci
2.2 GNNs
GNNs are deep learning models using graph structured data for various learn-
ing tasks such as regression, classification, and link prediction performed in an
end-to-end manner. There are four kinds of GNNs: recurrent (Scarselli et al.
2008), convolutional (Kipf and Welling 2016a; Gilmer et al. 2017; Hamilton
et al. 2017b), graph autoencoders (Kipf and Welling 2016b), and spatial–tempo-
ral (Yan et al. 2018). GNNs generate embeddings h which are low-dimensional
vectors that encode high-level representations of various graph elements such as
nodes and edges, as well as graph themselves, encapsulating topological struc-
ture and other relevant associated property information in a compressed form.
Embeddings are used to make predictions for relevant learning task(s), and the
information (e.g., their meaning) encoded within them is task-dependent. During
the training process, the network learns how to encode better embeddings that
capture domain-specific meaning and latent data characteristics. Furthermore,
embeddings are encoded such that similar things are positioned closer together
within the continuous embedding space. This provides a means for measuring
errors during training that can be minimized, producing higher-quality embed-
dings. The higher the quality of the embeddings, the more accurate the resultant
predictions.
The proposed GNN architecture used for three-dimensional structural geologi-
cal modeling is illustrated in Fig. 2. Spatial-based convolutional GNNs are used
to generate graph node embeddings by stacking multiple neural network layers,
each of which computes a convolution at every graph node v . Node embeddings
hv are initialized to node features xv and used as input to the network’s first layer.
Spatial context is provided to the GNN by setting each graph node’s features xv
to the normalized spatial coordinates of the node positioned in three-dimensional
space xv = (̂ yv ,̂zv ). Each normalized Cartesian coordinate is computed by cen-
xv , ̂
tering the data on 𝟎 and rescaling each coordinate by the maximum coordinate
range. For example, the normalized x-coordinate is computed as
Fig. 2 Proposed GNN architecture for three-dimensional structural geological modeling composed of
two GNN modules which generate scalar field predictions and rock unit predictions, respectively. Green
arrows indicate GNN input and red arrows GNN output. Training data (bottom) are associated which
each module. Iso-contours (dashed curves) represent interfaces between stratigraphic layers
13
Math Geosci
( )
xv − xc
̂
xv = ( ) , (1)
max xmax − xmin , ymax − ymin , zmax − zmin ∕2
2.3 Spatial Convolutions
13
Math Geosci
∑ � �
u∈Nvwuv hu W + b
(W ⋆ h)v = ∑ , (2)
u∈Nv wuv
where W ∈ ℝdin ×dout and b ∈ ℝ1×dout are respectively learnable matrices and vectors
used to linearly transform neighbor embeddings hu ∈ ℝ1×din with dimension din into
dimension dout . The edge weights wuv for this convolution are leveraged to incor-
porate additional geometric information (e.g., neighbor distances, tetrahedron vol-
ume) of the tetrahedral mesh into the graph structure. The geometry-dependent edge
weights derived from discrete Laplacian operators (Alexa et al. 2020) are computed
from
1∑
wij = l cot 𝜃kl . (3)
6 ijkl kl
Here the sum is taken over all tetrahedra containing edge eij , lkl is the length of
the edge ekl belonging to tetrahedron indexed by ijkl , and 𝜃kl is the dihedral angle
between the two faces of the tetrahedron containing ekl (Fig. 3).
The second convolution uses a Gaussian mixture model (GMM) (Monti et al.
2017) composed of K parameterized Gaussian kernels
K
1 ∑ 1∑ k ( ) ( )
(W ⋆ h)v = hv W node + b + wuv puv hu W kneigh , wkuv puv
|Nv | K
| | u∈Nv k=1 (4)
( ( )T ( ))
1
= exp − puv − 𝝁k Σ−1 k
p uv − 𝝁 k ,
2
13
Math Geosci
where (W node , W kneigh ) ∈ ℝdin ×dout and b ∈ ℝ1×dout are learnable matrices and vec-
tors to linearly transform the node v and their neighbor u ∈ Nv embeddings
(hv , hu ) ∈ ℝ1×din into dimension dout . GMM-based convolutions associate each edge
connecting two nodes u and v with K weights depending on the edge’s pseudo-coor-
dinate puv ∈ ℝd , and learnable d × 1 vector 𝜇k and d × d matrix Σ−1 k
correspond-
ing to the mean and covariance, respectively, of a Gaussian kernel. Note that our
implementations of GMM-based convolutions do not restrict the covariance Σk to
have diagonal form (Monti et al. 2017)—to ensure the learnable covariance matrix
( definite, we simply)use Σk = Σk Σk . Here we use the pseudo-coordinates
is positive � T
puv = ̂ xu − ̂
xv , ̂ yv ,̂zu − ̂zv ; however, this vectorial variable can also be lever-
yu − ̂
aged for other geometrical information that can be associated to the edge.
Fig. 4 Neighborhood aggregation of target node (red) in a two-layer GNN neural network
13
Math Geosci
dimension dout performed by the spatial convolutions (Eqs. 2, 4). For embeddings,
dout = dembed , while for predictions, dout = dpred.
2.5 Training
Training the neural network involves optimizing associated network variables such that
prediction accuracy is maximized. Since there are very few data observations in com-
parison to the number of nodes in our graphs, we utilize GNNs primarily in a semi-
supervised context where only errors on graph nodes associated with data observations
are computed via loss functions. However, when anisotropy information is available to
describe directions of structural continuity, local errors in modeled orientation can be
efficiently computed on all graph nodes—useful for both global and local anisotropy
modeling. At every iteration of the learning algorithm, errors are minimized through
the process of backpropagation where neural network variables are updated. As itera-
tions increase, node embeddings encode more and more useful high-level information
suitable for the learning objective, thereby maximizing prediction accuracy. Depending
on the type of network prediction (e.g., continuous variable, discrete classes) and learn-
ing objective, loss functions will vary (Sects. 2.6 and 2.7). The resulting loss is simply
the sum of the individual loss functions for corresponding available data constraints.
13
Math Geosci
scalar field s(x) and are obtained by iso-surface extraction methods (Treece et al.
1999). The subset of graph nodes v collocated with the set of sampled interface
points I are given the scalar field constraint
{ }
I = v ∈ V|sv = fv . (6)
The associated loss function for sample interface points measures the squared
difference between the network’s predictions zscalar
v
and the scalar field constraints
sv and sums them
|I|
∑ ( )2
Linterface = zscalar
v
− sv . (7)
v∈I
PTv Pv ∇zscalar
v
= PTv Sv , (8)
( )
where ∇zscalar
v = 𝜕zscalar
v ∕𝜕x, 𝜕zscalar
v ∕𝜕y, 𝜕zscalar
v ∕𝜕z is the estimated gradient of the
scalar field at node v, and matrices Pv and Sv are defined by
[ ]
̂
x −̂ xv ̂ yu − ̂ yv ̂zu − ̂zv
Pv = u
⋮
[ scalar ⋮ scalar ]⋮
z − zv
, ∀u ∈ Nv , (9)
Sv = u
⋮
where x̂,̂
y,̂z are the normalized spatial coordinates of a given node. The estimated
angle between the predicted scalar field gradient at node v and a collocated orienta-
tion vector 𝜶 v
𝜶 v ∙ ∇zscalar
v
cos𝜃vpred = (10)
‖𝜶 v ‖‖∇zscalar
v ‖
is used to measure the angular error compared with the observed angular con-
straint 𝜃vobs . In general, for any angular constraint 𝜃vobs , the loss function
|O|
∑ | |
LOrientation = |cos𝜃vobs − cos𝜃vpred | (11)
| |
v∈O
can be used. For normal data (e.g., nv) 𝜃vobs = 0◦, the resulting loss function
13
Math Geosci
�O�
� nv ∙ ∇zscalar
(12)
v
Lnormal = 1−
v∈O
‖nv ‖‖∇zscalar
v
‖
can be used, and for tangent data (e.g., tv) 𝜃vobs = 90◦,
�O� � scalar �
� � tv ∙ ∇zv �
Ltangent = � �.
� ‖tv ‖‖∇zscalar ‖ � (13)
v∈O � v �
It is important to note that for modeling multiple geological interfaces simulta-
neously, the polarity normal data nv is required to be consistent with the younging
direction (e.g., direction of younger stratigraphy). Tangent data tv can be used to
describe foliation data where planar polarity has no geological meaning as well
as for anisotropic characteristics of structural features such as local plunge vec-
tors (Hillier et al. 2014). For foliation data, a dip vector and strike vector are used
as tangent constraints to describe the planar orientation. Meanwhile, for local or
global anisotropy, a tangent vector is used to describe the direction of the plunge
axis—the loss function is independent of the polarity of the vector due to the
absolute value.
Our GNN architecture can also incorporate rock unit observations commonly
available from drill core and outcrop datasets. This is particularly advantageous
for geological datasets absent of interface observations and where there is an
abundance of rock unit observations. Since rock units are discrete properties this
GNN module is associated with a classification learning task. Given a set rock
unit observations R containing a total of Nclasses possible classes collocated with
a subset of graph nodes, each rock unit observation is assigned a one-hot vector
yv indicating the class to which it belongs. One-hot vectors are vectors whose
elements are all zero except for a single element which is given a value of 1. For
example, for a dataset containing Nclasses = 3, one-hot vectors corresponding to
classes a , b , c are respectively
yav = (1, 0, 0), (14)
The loss function associated with these data is the cross-entropy loss
∑( ( ))
LRockUnit = − yv ∙ log ypred
v
, (15)
v∈R
13
Math Geosci
where ypred
v
is the predicted class probabilities obtained from
� � zunit
unit e vi
ypred
vi = Softmax zvi = ∑ zunit , (16)
vj
j e
vi is the -th element of vector zv , which is the output of the rock unit
where zunit i unit
GNN module. The softmax function is routinely used in machine learning to nor-
malize network outputs to a probability distribution over the number of classes
Nclasses.
3 Case Studies
• Edges eij are associated with geometry-dependent weights wij (Eq. 3).
13
Math Geosci
3.1 Synthetic Example
The first case study is of a conformally folded stratigraphic layered volume where
folds are cylindrical (Fig. 5a). Constraint data composed of 67 interface points and
50 planar orientations are randomly sampled from synthetic interface surfaces. In
addition, 44 rock unit observations sampling six distinct rock units from simulated
drill-hole data are used as constraints for the GNN architecture. A tetrahedral mesh
(Fig. 5b) containing 239,784 tetrahedrons generated from these point constraints was
converted into a graph-structured dataset and used as the architecture’s input. Cou-
pled implicit and geological unit modeling results after 300 epochs of training using
weighted mean convolution (Eq. 2) are presented in Fig. 5c, d, respectively. Note
that “epoch” is the term to indicate one complete feed-forward pass (Algorithm 1)
and one backpropagation pass of all training nodes. Average computational time for
each epoch was approximately 0.05 s. Model performance as measured by loss func-
tions for each constraint type and the average nearest distance between synthetic and
modeled surfaces is tabulated in Table 2. The fact that this distance is Δd = 1.564 m
and that the average stratigraphic thickness is 15 m between interfaces clearly indi-
cates that the proposed GNN architecture is beneficial for three-dimensional struc-
tural geological modeling. By visual comparison, the GNN is nearly identical to the
synthetic model (Fig. 5e). It even maintains the cylindricity of the fold even though
no global plunge information has been used to constrain the model.
3.2 Real‑World Dataset
The second case study utilizes a real-world dataset from the Purcell Anticlino-
rium in the east Kootenay region of Southeastern British Columbia (de Kemp et al.
2016). A subset of the large regional-scale dataset is taken surrounding the Sullivan
mine area where a world-class sedimentary exhalative deposit (SEDEX) is located
along with several deep drill holes sampling several stratigraphic interfaces. Field
observations taken from outcrops sampling these interfaces, as well as bedding ori-
entation (S0), are also available. In total, the dataset contains 103 interface points
sampling six distinct interfaces and 187 bedding observations as shown in Fig. 6a.
Point constraints are heterogeneously sampled with dense drill-hole sampling in a
small portion of the model and a strong horizontal sampling bias (e.g., small vari-
ance in elevation) for the outcrop observations—typical of geological survey and
mineral exploration datasets. Furthermore, bedding observations exhibit large local
13
Math Geosci
Fig. 5 Three-dimensional structural geological modeling results using weighted mean spatial convo-
lution on a synthetic conformally folded (cylindrically) stratigraphic layered model. a Point data con-
straints including interface (spheres), normal orientation (white arrows), and rock unit observations
(triangles) sampled throughout the volume, b tetrahedral mesh constrained by point data constraints illus-
trating constraints are collocated with mesh vertices (inset), c implicit scalar field and d rock unit predic-
tions outputted by corresponding GNN modules after 300 epochs of training, e comparison of synthetic
(white) and modeled (red) surfaces viewed along the global plunge axis
13
Math Geosci
variations sampling multiple scales not necessarily important to the scale of the
region of interest in the model. These orientation data were used to fit a Kent distri-
bution (Kent 1982) and clearly indicate the presence of a folded structure. The fitted
distribution (𝜅 = 8, 𝛽 = 4) has strong ellipticity (e = 2𝛽∕𝜅 = 1) suggesting a more
cylindrically folded structure.
The resulting model presented in Fig. 6b was obtained using GMM spatial con-
volutions (Eq. 4) with one kernel (e.g., K = 1) for each layer of the Scalar Field
GNN module after 150 epochs of training. The average computational time for each
epoch was approximately 0.12 s using a mesh containing 555,396 tetrahedrons. Iso-
surfaces extracted from the scalar field model for the six stratigraphic interfaces are
shown in Fig. 6c, d from two viewing perspectives. Both cross sections (Fig. 6e,
f) highlight the GNN’s fitting characteristics on nearby structural geological data.
For this dataset, the GNN was trained for only 150 epochs to avoid overtraining—
also known as early stopping. With more training the GNN will attempt to transition
from capturing the global features of the model to minimizing the errors associated
to large localized structural variations (typical of real-world noisy datasets). If there
are structural contradictions or hard-to-reconcile features with the given data, as
with this dataset, more training can negatively impact the resulting model. This will
occur in later phases of training when loss function values begin to fluctuate signifi-
cantly from epoch to epoch. Table 3 summarizes the resulting model performance
metrics on this dataset.
4 Discussion
The presented case studies show that the GNN architecture can generate three-
dimensional structural geological models constrained by point data sampling geo-
logical units, stratigraphic interfaces, and planar structural orientation. The first case
study demonstrates that the architecture can perform both implicit and geological
unit modeling that is effective in producing a model that is nearly identical to the
synthetic model. In addition, the coupled scalar field and geological unit GNN mod-
ules of the architecture highlight the ability of the GNN to cope with continuous
and discrete properties simultaneously. Furthermore, its capacity to capture global
anisotropy of the fold without being provided this information from orientation
analysis is notable. In the second case study, we show that the architecture can pro-
duce a geologically reasonable structural model given noisy and hard-to-reconcile
13
Math Geosci
Fig. 6 Structural modeling results using GMM spatial convolutions. a Heterogeneously distributed point
constraints accompanied by fitted Kent distribution (right) of the bedding orientation, b predicted scalar
field outputted by GNN—red rectangular regions indicate cross sections shown in e and f, c and d iso-
surfaces extracted from mesh from two viewing perspectives—large red arrow indicates global plunge
direction determined from orientation analysis
13
Math Geosci
real-world data, and provides promise for general application for constrained three-
dimensional structural geological modeling. The developed architecture is a new
way to model geological structures and is, to the best of the authors’ knowledge, the
first deep learning approach to geological implicit modeling.
The proposed GNN approach opens new opportunities for enhancing three-
dimensional structural geological modeling. This is provided by the approach’s
expressive framework for incorporating interpolation constraints using loss func-
tions and the ability to consider both continuous and discrete properties. Loss func-
tions provide an intuitive mechanism for constraining resulting models by comput-
ing errors between the GNN’s predictions and constraints, which does not require
solving large systems of equations. Any model property (e.g., interfaces, orienta-
tion, rock units) derived from GNN’s predictions representing a structural model can
be compared with data or model constraints using loss functions. Discrete proper-
ties, like rock units, can be handled by GNNs by treating them as a classification
problem. Resulting GNN predictions of these properties are expressed as a vector
containing continuous class probabilities in a range between 0 and 1. The class
of a node is simply determined by the vector element with the largest probability.
Alternatively, rock unit observations can also be expressed as inequality constraints
(Kervadec et al. 2019) using scalar field values like previous implicit approaches
not requiring convex optimization methods. However, treating them as a classifica-
tion problem can be advantageous because it provides the lithostratigraphic model
directly from which additional model metrics can be potentially derived (e.g., area,
volume, thickness, spatial relationships between classes) for other model-based con-
straints. Furthermore, treating discrete properties this way permits integration of
other discrete geological properties that cannot be represented as continuous scalar
values.
Although not demonstrated here, GNNs have been shown to be massively scal-
able by application to graphs containing billions of nodes for web-scale recom-
mender systems (Ying et al. 2018). This is made possible by partitioning a graph
into batches of sampled sub-graphs (Hamilton et al. 2017b) describing k-hop neigh-
borhoods which can be distributed across cloud computing infrastructure for train-
ing and inference. This is a promising direction for massive-scale three-dimensional
geological modeling.
One modeling assumption is imposed by the presented GNN approach. This
assumption, also known as inductive bias, asserts that nearby information is more
useful than distant information for making geologically related predictions. This bias
is provided by the graph’s structure, which for the presented approach is a tetrahe-
dral mesh. Since predictions on graph nodes are derived from recursion aggregation
of neighboring node embeddings, and these neighboring nodes are spatially close,
13
Math Geosci
local information has a greater influence on these predictions than distant informa-
tion. For geological applications, this inductive bias is strongly supported and origi-
nates from Tobler’s first law of geography, which states that closer things are more
related to each other than things that are further away (Tobler 1970). Furthermore,
this law is the foundation for spatial interpolation methods such as kriging.
One potential limitation of the presented GNN approach are challenges with
reproducibility—an aspect of all deep learning methods. Since the network’s learn-
able parameters are initialized randomly, rerunning the algorithm under the same
conditions will not yield the exact same geological model. Normally, resultant mod-
els are quite similar, with small shifts in modeled geological surfaces. However, in
some circumstances the differences can be large if (1) the modeling capacity of the
GNN model is not sufficient (e.g., too few learnable parameters) or (2) a diverse
set of plausible structural geological solutions exist given the supplied constraints.
In these circumstances, there are many local minima in the neural network’s solu-
tion space. In contrast, existing implicit approaches achieve perfect reproducibility
because they are deterministic methods that return the global minimum solution
obtained from a linear system of equations or convex optimization problem. How-
ever, since three-dimensional geological modeling applications suffer from limited
data compared to the volumes they aim to resolve, there is an infinite set of plausible
geological models given the sampled data. Therefore, because the GNN can gener-
ate a family of solutions given the same modeling parameters, it provides a means
for exploring a larger set of plausible geological structures (Jessell et al. 2010).
Challenges with reproducibility of geological models obtained from GNNs could be
addressed by fully characterizing the space of solutions as well as generating model
uncertainty through Bayesian deep learning techniques on graphs (Ryu et al. 2019).
Modeling assumptions imposed by existing implicit approaches can in some
circumstances negatively impact the resultant structural models. All existing
implicit interpolants impose explicit smoothness criteria to obtain a unique inter-
polation. For meshless methods, this is the minimum norm interpolant criterion
that corresponds to the smoothest possible interpolation given the chosen ker-
nel function. On the other hand, for discrete methods, this is the constant scalar
gradient criterion between neighboring tetrahedra. Both smoothness criteria can
lead to poor geological modeling of highly variant structures due to the compro-
mising influence of local variations on the geological reasonableness (de Kemp
et al. 2017) of the model. An example of this behavior is shown in Fig. 7, in
which critical points (vanishing derivatives) within the scalar field create topo-
logical features (Ni et al. 2004) in the modeled interface which are not geologi-
cally reasonable. Although, this issue can be mitigated to varying degrees of
success by approximating solutions (e.g., nugget effect, regression smoothing)
instead of exact interpolation. For GNNs, these explicit smoothness criteria are
not imposed. Instead, smoothness of resulting predictions is implicitly driven by
the recursive spatial convolutions of node embeddings and loss functions associ-
ated with various constraints. Spatial convolutions in GNNs can be viewed as a
generalization of kernel functions in classical implicit interpolation (Gong et al.
2020) whose parameterization is optimized through training on given constraints.
In addition to the smoothness criterion, existing implicit approaches also impose
13
Math Geosci
Fig. 7 Example of a radial basis function (RBF) interpolation yielding a geologically unreasonable
model of an interface (black curve) using interface (blue) and normal constraints (red) sampling a highly
variable structure. For reference, a geologically reasonable interpretation is shown (dotted white curve)
13
Math Geosci
Experience with GNN modeling has led to some practical observations regard-
ing the effect of GNN model parameterization, including the types of graph con-
volution, edge weights, embedding dimension, regularization, and network depth.
The effect of changing the type of convolution, edge weights, and embedding
dimension on representing an overturned fold can be found in the “Appendix”. In
order to represent geological structures, the network requires sufficient modeling
capacity, with complex structures requiring more modeling capacity than simpler
structures. Increasing the network’s modeling capacity can be achieved by using a
spatial convolution that has sufficient parameters as well as increasing the embed-
ding dimension. Using a GMM convolution (Eq. 4), the modeling capacity can be
improved by simply increasing the number of Gaussian kernels K used. It should
be noted that we tested other graph-based convolutions (Fey et al. 2018; Verma
et al. 2018; Thekumparampil et al. 2018) using the PyTorch Geometric library
(Fey and Lenssen 2019), and preliminary results indicate that they are useful for
three-dimensional structural geological modeling. More work, beyond the scope
of this paper, would be required to fully assess their general use for geological
modeling. The use of edge weights, either geometrically derived or dynamically
learned by the network via GMM convolutions, consistently produces better rep-
resentations of geological structures. For regularization, various methods can be
used to avoid overfitting GNN models given noisy or structurally contradictory
data, including early stopping (as per the second case study) as well as L1 and L2
vector space norm regularization of network parameters and dropout (Srivastava
et al. 2014). With respect to network depth, we found no significant improvement
in geological modeling when increased beyond three layers, which is attributed to
a bottlenecking issue in current GNNs (Alon and Yahav 2020). This issue results
from information loss during successive spatial convolutions, each of which
compresses information into fixed-size embedding vectors. As network depth is
increased, the amount of information that needs to be compressed grows expo-
nentially, resulting in diminishing returns in prediction accuracy.
5 Conclusions
13
Math Geosci
Acknowledgements This research is part of and funded by the Canada 3D initiative at the Geological
Survey of Canada. We gratefully acknowledge and appreciate the productive discussions with colleagues
from RWTH Aachen University and the Loop 3D project (Australian Research Council: LP170100985).
We thank two anonymous reviewers for their comments and suggestions which greatly improved the
manuscript. NRCan Contribution Number 20200743.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License,
which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as
you give appropriate credit to the original author(s) and the source, provide a link to the Creative Com-
mons licence, and indicate if changes were made. The images or other third party material in this article
are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the
material. If material is not included in the article’s Creative Commons licence and your intended use is
not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission
directly from the copyright holder. To view a copy of this licence, visit [Link]
ses/by/4.0/.
To demonstrate the effect of using different model parameters within the pre-
sented graph neural network architecture, eight three-dimensional structural geo-
logical models are produced of a synthetic overturned fold (Fig. 8) using param-
eterization summarized in Table 4. Each structural model was produced after 150
epochs of training using a network depth of 3. Results show that GMM convolu-
tions (Fig. 8f–i) better reproduce the overturned fold as compared to mean-based
convolutions (Fig. 8b–e) due to increased modeling capacity and the ability to
dynamically learn optimal edge weights for modeling. When no edge weights are
used (Fig. 8c), the resulting iso-surfaces are not as smooth as when the geometri-
cally derived weights are used (Fig. 8b). In both cases, the use of a low embedding
13
Math Geosci
13
Math Geosci
Table 4 Modeling parameters used for each of the subfigures in Fig. 8 and associated error metrics
Linterface and Lnormal from each modeling result
Subfigure Graph convolution type Edge weights Embedding Linterface Lnormal
dimension
dimension (Fig. 8d, h) results in a modeled structure that is far from optimal, with
notable creases throughout. On the other hand, when a high embedding dimension
(Fig. 8e, i) is used, the resultant modeled structure is much smoother. Because of
an increase in learnable parameters when using a high embedding dimension, 150
epochs of training is insufficient to minimize the errors on the interface constraints
when using a mean-based convolution (Fig. 8e). Interestingly, this is not the case for
GMM-based convolution (Fig. 8i).
References
Alexa M, Herholz P, Kohlbrenner M, Sorkine-Hornung O (2020) Properties of laplace operators for tetra-
hedral meshes. Comput Graph Forum 39(5):55–68. [Link]
Allamanis M, Brockschmidt M, Khademi M (2017) Learning to represent programs with graphs. arXiv
preprint [Link]
Alon U, Yahav E (2020) On the bottleneck of graph neural networks and its practical implications. arXiv
preprint [Link]
Bronstein MM, Bruna J, LeCun Y, Szlam A, Vandergheynst P (2017) Geometric deep learning: going
beyond Euclidean data. IEEE Signal Process Mag 34(4):18–42
Calcagno P, Chiles JP, Courrioux G, Guillen A (2008) Geological modelling from field data and geologi-
cal knowledge Part I. Modelling method coupling 3D potential-field interpolation and geological
rules. Phys Earth Planet Inter 171(1–4):147–157
Caumon G, Gray G, Antoine C, Titeux MO (2013) Three-dimensional implicit stratigraphic model build-
ing from remote sensing data on tetrahedral meshes: theory and application to a regional model of
La Popa basin, NE Mexico. IEEE Trans Geosci Remote Sens 51(3):1613–1621
Correa CD, Hero R, Ma KL (2009) A comparison of gradient estimation methods for volume rendering
on unstructured meshes. IEEE Trans Vis Comp Graph 17(3):305–319
de Kemp EA, Schetselaar EM, Hillier MJ, Lydon JW, Ransom PW (2016) Assessing the workflow for
regional-scale 3D geologic modeling: an example from the Sullivan time horizon, Purcell Anticlino-
rium East Kootenay region, southeastern British Columbia. Interpretation 4(3):SM33–SM50
de Kemp EA, Jessell MW, Aillères L, Schetselaar EM, Hillier M, Lindsay MD, Brodaric B (2017)
Earth model construction in challenging geologic terrain: designing workflows and algorithms
that make sense. In: Proceedings of exploration 17: sixth decennial international conference on
mineral exploration, pp 419–439
de la Varga M, Wellmann F (2016) Structural geologic modeling as an inference problem: a Bayesian
perspective. Interpretation 4(3):SM1–SM16
13
Math Geosci
Dubrule O, Kostov C (1986) An interpolation method taking into account inequality constraints: I.
Methodology. Math Geol 18(1):33–51
Duvenaud DK, Maclaurin D, Iparraguirre J, Bombarell R, Hirzel T, Aspuru-Guzik A, Adams RP
(2015) Convolutional networks on graphs for learning molecular fingerprints. In: Adv Neural Inf
Process Syst, pp 2224–2232
Fey M, Lenssen JE, Weichert F, Müller H (2018) SplineCNN: fast geometric deep learning with con-
tinuous B-spline kernels. In: Proceedings of the CVPR, pp 869–877
Fey M, Lenssen JE (2019) Fast graph representation learning with PyTorch Geometric. arXiv preprint
[Link]
Frank T, Tertois A-L, Mallet JL (2007) 3D-reconstruction of complex geological interfaces from
irregularly distributed and noisy point data. Comput Geosci 33(7):932–338
Gilmer J, Schoenholz SS, Riley PF, Vinyals O, Dahl GE (2017) Neural message passing for quantum
chemistry. arXiv preprint [Link]
Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks.
In: Proceedings of the thirteenth international conference on artificial intelligence and statistics,
pp 249–256
Gong S, Bahri M, Bronstein M, Zafeiriou S (2020) Geometrically principled connections in graph
neural networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern
recognition, pp 11415–11424
Grose L, Ailleres L, Laurent G, Armit R, Jessell M (2019) Inversion of geological knowledge for fold
geometry. J Struct Geol 119:1–14
Hamilton WL, Ying R, Leskovec J (2017a) Representation learning on graphs: Methods and applica-
tions. arXiv preprint [Link]
Hamilton WL, Ying R, Leskovec J (2017b) Inductive representation learning on large graphs. In: Adv
Neural Inf Process Syst, pp 1024–1034
Hang S (2015) TetGen, a Delaunay-based quality tetrahedral mesh generator. ACM Trans Math Softw
41(2):1–36
Hillier MJ, Schetselaar EM, de Kemp EA, Perron G (2014) Three-dimensional modelling of geo-
logical surfaces using generalized interpolation with radial basis functions. Math Geosci
46(8):931–953
Jessell MW, Ailleres L, de Kemp EA (2010) Towards an integrated inversion of geoscientific data:
what price of geology? Tectonophysics 490(3–4):294–306
Kent JT (1982) The Fisher-Bingham distribution on the sphere. J R Stat Soc Ser B Methodol 44(1):71–80
Kervadec H, Dolz J, Tang M, Granger E, Boykov Y, Ayed IB (2019) Constrained-CNN losses for weakly
supervised segmentation. Med Image Anal 54:88–99
Kipf TN, Welling M (2016a) Semi-supervised classification with graph convolutional networks. arXiv
preprint [Link]
Kipf TN, Welling M (2016b) Variational graph auto-encoders. arXiv preprint [Link]
07308
Lajaunie C, Courrioux G, Manuel L (1997) Foliation fields and 3D cartography in geology: principles of
a method based on potential interpolation. Math Geol 29(4):571–584
Laurent G, Ailleres L, Grose L, Caumon G, Jessell M, Armit R (2016) Implicit modeling of folds and
overprinting deformation. Earth Planet Sci Lett 456:26–38
Loshchilov I, Hutter F (2017) Decoupled weight decay regularization. arXiv preprint [Link]
abs/1711.05101
Monti F, Boscaini D, Masci J, Rodolà E, Svoboda J, Bronstein MM (2017) Geometric deep learning on
graphs and manifolds using mixture model cnns. In: Proceedings of the CVPR, pp 5115–5124
Ni X, Garland M, Hart J (2004) Fair Morse functions for extracting the topological structure of a surface
mesh. ACM Trans Graph 23(3):613–622
Qi X, Liao R, Jia J, Fidler S, Urtasun R (2017) 3d graph neural networks for rgbd semantic segmentation.
In: Proceedings of the ICCV, pp 5199–5208
Ryu S, Kwon Y, Kim WY (2019) A Bayesian graph convolutional network for reliable prediction of
molecular properties with uncertainty quantification. Chem Sci 10(36):8438–8446
Scarselli F, Gori M, Tsoi AC, Hagenbuchner M, Monfardini G (2008) The graph neural network model.
IEEE Trans Neural Netw 20(1):61–80
Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to
prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958
13
Math Geosci
Thekumparampil KK, Wang C, Oh S, Li LJ (2018) Attention-based Graph Neural Network for Semi-
supervised Learning. arXiv preprint [Link]
Thiele ST, Jessel MW, Lindsay M, Ogarko V, Wellmann F, Pakyuz-Charrier E (2016) The topology of
geology 1: topological analysis. J Struct Geo 91:27–38
Tobler WR (1970) A computer movie simulating urban growth in the Detroit region. Econ Geogr
46:234–240
Treece GM, Prager RW, Gee AH (1999) Regularized marching tetrahedral: improved iso-surface extrac-
tion. Comput Graph 23(4):583–598
Verma N, Boyer E, Verbeek J (2018) FeaStNet: feature-steered graph convolutions for 3D shape analysis.
In: Proceedings of the CVPR, pp 2598–2606
Wellmann F, Caumon G (2018) 3-D Structural geological models: Concepts, methods, and uncertainties.
Adv Geophys 59:1–121
Wu Z, Pan S, Chen F, Long G, Zhang C, Philip SY (2020) A comprehensive survey on graph neural net-
works. arXiv preprint [Link]
Yan S, Xiong Y, Lin D (2018) Spatial temporal graph convolutional networks for skeleton-based action
recognition. arXiv preprint [Link]
Ying R, He R, Chen K, Eksombatchai P, Hamilton WL, Leskovec J (2018) Graph convolutional neural
networks for web-scale recommender systems. In: Proceedings of the 24th ACM SIGKDD interna-
tional conference on knowledge discovery & data mining, pp 974–983
13
GNNs benefit geological modeling by handling complex structural information and relationships better than CNNs . They allow representation of data in graph form, which is ideal for geological applications that require flexibility in topology and constraint handling . GNNs use interpolation constraints and classification for discrete properties, enhancing modeling of stratigraphic relationships without the need for large-scale optimization .
Angular constraints allow for orientation relationships that go beyond simple normal and tangent data, accommodating non-orthogonal angular relationships, such as bedding-cleavage configurations . This expands the capability for geological modeling to represent complex orientational controls without being restricted to unit normals or equal norm values for stratigraphic features, as in traditional methods .
GNNs are scalable due to their capability to partition graphs into manageable sub-graphs based on k-hop neighborhoods, facilitating distributed processing across cloud infrastructure . This reduces computational demand and allows GNNs to handle datasets with billions of nodes efficiently, which is beneficial for extensive geological modeling tasks .
Highly deformed settings pose challenges for GNNs because variations in stratigraphic thicknesses can be difficult to model when scalar gradient constraints always equal a constant normal vector . Additionally, existing approaches fail to incorporate angular controls without arbitrarily chosen norms, which might restrict accurate modeling in such complex settings .
The inductive bias in GNNs assumes that nearby information is more useful than distant information for predictions, benefiting geological models by focusing on local topological constraints . However, this may limit the ability to accurately predict features influenced by distant data, potentially restricting model applicability in certain complex geological settings .
Loss functions in GNNs measure the error between predictions and constraints, eliminating the need for solving large systems of equations typical in traditional models . They enable the integration of both continuous and discrete properties into the model, allowing for a more expressive and flexible representation of geological features .
GNNs omit explicit smoothness criteria, using recursive spatial convolutions and constraint-driven loss functions to naturally achieve smooth predictions . This implicit handling allows more flexibility in adapting the model to available data without enforcing potentially unrealistic constraints, improving the quality of geological interpretations .
Tetrahedral meshes allow GNNs to represent geological data in three dimensions effectively, by aligning with the natural stratification and orientation of geological features . They can adaptively vary in granular resolution, accommodating heterogeneous datasets and optimizing data storage while maintaining high resolution where necessary .
GNNs handle discrete geological properties as classification problems, where properties are expressed as vectors containing continuous class probabilities. This direct approach enables GNNs to derive metrics like area and volume from predictions without reliance on convex optimization methods, contrasting previous methods using inequality constraints on scalar values .
Graph-based topology allows the representation of relationships between geological features such as stratigraphy and faults within the graph's structure. This can be exploited to incorporate topological constraints, enhancing the models' realism and fidelity to geological data . The ability to model these relationships in graph form is one of the primary advantages of using GNNs over traditional methods .