Professional Documents
Culture Documents
ARMA-IGS-2022-184 - Well Log Prediction Using Deep Sequence Learning
ARMA-IGS-2022-184 - Well Log Prediction Using Deep Sequence Learning
ABSTRACT: Sonic logs including compressional and shear travel time logs (DTC and DTS, respectively) are important
measurements for subsurface elastic and geomechanic property characterization. However, these log types are not always measured
in practice or incomplete in many oil and gas wells for economic and other practical reasons. We propose to accurately predict these
types of log data from the traditional common types of well log measurement data using deep sequence learning methods. After
anomalous data removal and augmenting the ratio DTC/DTS as a new lithology differentiating feature, the preprocessed inputs were
fed into a bidirectional Long Short-Term Memory (LSTM) model for training to minimizing the error in predicting the sonic logs.
Once trained, the model is then applied at the target wells to predict the sonic logs. The predicted sonic logs match the actual values
with good accuracy. This entirely data-driven approach significantly reduces dependency on prior domain knowledge, providing a
generalization advantage that enables automated large-scale well log prediction across fields.
2.4 Workflow
The process taking raw well log data from training wells
to predict sonic logs at testing wells using machine
learning models consists of four stages, as shown in
Figure 4. It is worth noting that before training and
applying the trained models to predict sonic logs for new
wells, there are several important steps for data
preprocessing (bad data removal), exploration (additional
features and clustering etc.), and sample preparation (data
scaling and normalization, forming sequence samples and
Fig. 2. Bidirectional long short-term memory (LSTM) based subset partition for training, validation and testing). These
well log prediction. steps ensure the quality of the data samples subsequently
fed into the machine learning models to achieve good
The basic architecture of bidirectional LSTM consists of quality prediction.
two hidden layers of opposite order directions with the
same output, as shown in Figure 2, before they are
combined and fed through an activation layer and a dense
layer to generate sonic log predictions. Structures such as
this aim to learn bidirectional order dependence in
sequence prediction problems. As shown in Figure 3, an
LSTM cell can learn to recognize an important input with
an input gate, store it in the long-term state, and preserve
it for as long as needed, ensure that it is maintained in the
forget gate, and learn to extract it when needed. This
greatly helps the model capture long-term sequence
patterns and has been quite successful in learning time
series, text sequences, and audio clips. It is well suited to
tasks like prediction with sequence data. Because the
properties measured by various well-log techniques are
influenced by properties from the adjacent depth of both
upside and downside, this form of sequence deep-learning Fig. 4. Workflow for deep sequence learning based sonic log
model can be a good fit for the well-log prediction prediction.
problem. Before feeding the log data into the LSTM
model, significant effort was made to remove anomalies The well log data were then formed into sequences of
and choose a good combination of features, as well as samples of the same short length sliding down the depth
experiment with different hyperparameters during model direction, with padding at the boundaries. For data from
training, including input sequence length, number of training wells, we split them into 75% for training and
hidden layers, number of nodes, batch size, dropout 25% for validation, with random partition.
ration, regularization weighting, learning rate, and early
stop (number of epochs). The LSTM model was trained
3. RESULTS AND CONCLUSIONS 𝑅𝑀𝑆𝐸 =
The dataset we use comes from the Equinor Volve field 3.3 Results
data, we applied the machine-learning algorithm and used
Following the workflow in Figure 4, we removed samples
the “easy-to-acquire” conventional logs to predict the
with log data values that are either NaN, out of range or
DTC and DTS logs. A total number of 20,525 data points
detected as anomalous, then applied normalization to
(corresponding to distinct depths) collected from three
scale the data values within the same dynamic range.
wells were used to train the models using machine-
After that we clustered samples using minibatch-K-
learning techniques. Each of the data points has seven
Means, based only on the log data that will be used as
features, which are the conventional “easy-to-acquire”
inputs for prediction, excluding the log data to be
Fig. 5. Well logs from the training set (left to right): Gamma
ray, density, porosity and sonic logs (DTC and DTS)
3.2 Performance metrics
Fig. 7. Training results: predicting sonic logs DTC, DTS from
The prediction performance of the model was evaluated gamma ray, bulk density and neutron porosity.
using root mean squared error (RMSE) as the metric,
shown in the equation below:
REFERENCES