Forecasting Rare Earth Stock Prices With Machine Learning

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 10

Resources Policy 86 (2023) 104248

Contents lists available at ScienceDirect

Resources Policy
journal homepage: www.elsevier.com/locate/resourpol

Forecasting rare earth stock prices with machine learning


Irene Henriques, Perry Sadorsky *
Schulich School of Business, York University, 4700 Keele Street, Toronto, Ontario, M3J 1P3, Canada

A R T I C L E I N F O A B S T R A C T

Keywords: Rare earth elements (REEs) are indispensable for producing green technologies and electronics. Demand for REEs
Machine learning in clean energy technologies in 2040 are projected to be three to seven times higher than today and will be
Random forests critical to the clean technology transition needed to stave off catastrophic climate change. Forecasting rare earth
Forecasting
stock prices is critical for making well informed investment decisions concerning this important asset class.
Rare earth elements
Despite the latter, the literature on forecasting rare earth stock prices is scarce. We use machine learning
techniques to forecast daily rare earth stock price direction. The analysis reveals that random forests, extremely
randomized trees, RNN, and support vector machine have higher prediction accuracy than Lasso or Naïve Bayes.
We find that the 10- to 20-day forecasts using random forests, extremely randomized trees, and support vector
machine achieve prediction accuracies greater than 85% with some prediction accuracy reaching 90%. Lasso
prediction accuracy is higher than Naïve Bayes but never greater than 67%. The MA200, MA50, on balance
volume, VIX, and WAD are the most important predictive features of rare earth stock price direction. A switching
portfolio that uses trading signals from an Extra Trees model impressively outperforms a buy and hold portfolio.
Our results reveal the high prediction accuracy of using machine learning methods in forecasting rare earth stock
price direction which should be useful to investors, policy makers and venture capitalists.

1. Introduction transition needed to stave off catastrophic climate change.


Henriques and Böhm (2022) provide a historical analysis of the rare
The modern-day green technologies needed for the transition away earth industry. From 1920 to 1949, the rare-earth industry began as a
from fossil fuels such as clean energy, semiconductors and batteries military race for rare earths, followed by the United States dominating,
would not be viable without rare earths (Jyothi et al., 2020). So, what then losing the rare-earth market to China (1950–1990) and where from
are some of the uses of rare earths? Rare earth elements (REEs) include 1990 to 2009 China’s dominance continued nearly unchallenged.
Neodymium (Nd) – used in wind turbines, cell phones, computer hard Today, a worldwide clean-technology race for rare-earths propelled by
drives and hybrid/electric vehicles; Samarium (Sm) – used in guidance countries accelerating their investments in clean energy systems and by
systems and high-powered magnets; Cerium (Ce) – used in cell phones, the increased demand for electric cars, solar cells, wind turbines, and
rechargeable batteries, cameras; Scandium (Sc) – used in aluminum al­ battery power all requiring various REEs has begun (Bo et al., 2019;
loys and aerospace components; Yttrium (Y) – used in efficient light International Energy Agency, 2021). To ensure a smooth energy tran­
bulbs, and LCD and plasma screens; Europium (Eu) – used in LCD and sition to clean technologies, sufficient investment in diversifying rare
plasma screens, and optical electronics; Terbium (Tb) – used in cordless earth suppliers and boosting supply chain market resilience and trans­
power tools and guidance systems; Lanthanum (La) – used in catalytic parency are critical (International Energy Agency, 2021).
converters, hybrid and electric-car batteries, and cameras; and Gado­ The International Energy Agency (2021) emphasizes the importance
linium (Gd) – used in MRI contrast agents and cancer therapy (Henri­ of government policy in increasing investment in rare earths. Govern­
ques and Böhm, 2022; Hsu, 2019; Kanazawa and Kamitani, 2006). The ments seeking to increase investment in rare earths have an interest in
IEA (International Energy Agency, 2021, p. 50) has estimated that, rare earth stock prices for two main reasons. First, governments seeking
relative to 2021, the demand for REEs in clean energy technologies to increase rare earth mining need leading indicators of the industry in
(primarily for electric motor vehicles and wind turbines) in 2040 may be order to craft their resources policy. Rare earth stock prices are a useful
three to seven times higher and will be critical to the clean technology leading indicator of the rare earth industry. Rare earth stock price

* Corresponding author.
E-mail addresses: ihenriques@schulich.yorku.ca (I. Henriques), psadorsky@schulich.yorku.ca (P. Sadorsky).

https://doi.org/10.1016/j.resourpol.2023.104248
Received 1 August 2022; Received in revised form 2 May 2023; Accepted 1 October 2023
Available online 10 October 2023
0301-4207/© 2023 Elsevier Ltd. All rights reserved.
I. Henriques and P. Sadorsky Resources Policy 86 (2023) 104248

forecasts are an important component to government mining policy that between the nodes. Rather than relying on just one machine learning
determines mining royalty payments and licensing fees. method, six methods are used. This provides a comparison between
Second, governments may also play an active role investing in rare methods and adds an element of robustness to the analysis.
earths through a Sovereign Wealth Fund (SWF). A SWF is a state-owned Third, we include technical indicators and market volatility as fea­
investment fund financed, in many cases, by natural resource rents. For tures in our analysis. The literature on predicting asset prices has
example, in 2020 Angola’s SWF made an investment in Pensana Rare identified the importance of technical indicators (Bustos and
Earths to help develop their rare earths mining project.1 SWFs have Pomares-Quimbaya, 2020; Neely et al., 2014; Y. Wang et al., 2020b; Yin
specific rules of governance which are set up by their sponsoring gov­ et al., 2017; Yin and Yang, 2016) and stock price direction (Basak et al.,
ernment. For example, the risk and return profile of SWF investments 2019; Sadorsky, 2021a, 2021b, 2022). Asset market volatility can affect
may be determined by government policy. Setting and monitoring risk asset prices and the decision to invest. Higher volatility is associated
and return objectives require accurate forecasts of the underlying assets. with higher uncertainty which can increase the cost of capital and create
Rare earths do not have traded futures. As an alternative, investing and incentives to postpone investment. This lowers growth forecasts, ex­
risk management can be conducted using the stocks of companies pected profitability and equity returns.
engaged in the extraction and refining of rare earths.2 For example, a Fourth, we calculate multistep forecasts for one-day to twenty-days.
direct investment in rare earths cannot be hedged with futures but can A five-day forecast represents a week of trading days while a 10-day
be hedged with an investment in rare earth stocks. Forecasts of rare forecast represents two weeks of trading days. A 20-day forecast corre­
earth stock prices are crucial for understanding rare earth industry sponds to roughly one month of trading days. A multistep forecast ho­
trends and the risk management of rare earth investments. rizon offers useful information on how prediction accuracy varies across
The importance of rare earth stock price forecasting raises several different forecast horizons.
questions. First, which methods produce the most accurate forecasts of Fifth, price direction forecasts are used to construct an equity port­
rare earth stock prices? Second, how does forecast accuracy vary with folio where an investment in rare earth stocks is made if the price di­
the forecast horizon? Third, which features are most important for rection forecast is positive and no investment made otherwise. This
predicting rare earth stock prices? Fourth, how does a trading rule switching portfolio is compared to a buy and hold portfolio.
constructed from a machine learner compare with a buy and hold Several important findings emerge from our analysis. First, random
portfolio? forests, extremely randomized trees, and support vector machine have
To address these research questions, we use the following approach. higher prediction accuracy than Lasso or Naïve Bayes. Random forests,
First, we predict rare earth stock price direction. The reason for focusing extremely randomized trees, and support vector machine each have
on stock price direction instead of stock prices is because of the signif­ prediction accuracies greater than 85% for 10-day to 20-day forecasts.
icance of market timing. Fund managers and investors are most drawn to Second, Shapley feature importance indicates WAD, MA200, VIX, MA50
the sign of asset prices as opposed to their actual value when making a and on balance volume are among the most important features for 5-day,
tactical asset allocation decision (Pesaran and Timmermann, 2002). For 10-day and 20-day forecast horizons. Of the non-technical indicators,
example, portfolio managers may overweight assets that have a pre­ VIX is, on average, the most important feature. Third, a switching
dicted positive trend and underweight or even short assets that have a portfolio that uses trading signals from an Extra Trees model out­
negative predicted trend. Interestingly, research has demonstrated that performs a buy and hold portfolio.
asset price direction can be predicted with significant precision (Ballings The remainder of this paper proceeds as follows. Section 2 presents
et al., 2015; Basak et al., 2019; Leung et al., 2000; Lohrmann and some background literature. Section 3 describes our methodology. The
Luukka, 2019; Nyberg, 2011; Nyberg and Pönkä, 2016; Pönkä, 2016). data are examined in Section 4 while Section 5 presents our empirical
Second, we forecast rare earth stock prices using machine learning results on forecast accuracy, feature importance and an equity portfolio
techniques. Machine learning methods/techniques are useful when the comparison. A discussion of the results and conclusion are found in
data are complex. As discussed in the first paragraph of the introduction, Sections 6 and 7 respectively.
the demand for rare earths is driven by the automotive (electric vehi­
cles), clean energy, defense, health, and technology sectors. The supply 2. Background literature
of rare earths is heavily dominated by just a few countries (Henriques
and Böhm, 2022; J. Wang et al., 2020a). Coupled with demand and The existing literature has examined the impact of rare earth prices
supply conditions; supply chain disruptions (Xia et al., 2023), national on clean energy stock prices, renewable energy prices or the relationship
security issues (Salim et al., 2022), and geopolitics (Shuai et al., 2023) between rare earth stock prices and other asset prices. Baldi et al. (2014)
create complex dynamics for rare earth prices and rare earth stock pri­ use multifactor models to help understand the relationship between the
ces. The metal price forecasting literature has found that metal price prices of two rare earths (neodymium and dysprosium) on six clean
dynamics are complex and high forecasting accuracy can be achieved energy stock prices indices. Rising prices of dysprosium and neodymium
using machine learning methods like decision trees and random forests were found to have a negative impact on clean energy stock prices.
(Díaz et al., 2020; Liu et al., 2017; Zhang et al., 2021), support vector Apergis and Apergis (2017) use cointegration techniques to investigate
machine (Zhang et al., 2021), and neural networks (Khoshalan et al., the relationship between rare earth prices and the consumption of
2021; Sánchez Lasheras et al., 2015; Shi et al., 2023; Zhang et al., 2021). renewable energy. They find a negative relationship between rare earth
Building on this literature, the machine learning methods employed in prices and renewable energy consumption both at the local and global
this paper include random forests, extremely randomized trees (Extra levels.
Trees), least absolute shrinkage and selection operator (Lasso), support Fernandez (2017a) analyses selected rare earth prices and rare earth
vector machine (SVM), recurrent neural networks (RNNs), and Naïve stock prices. Systematic risk (beta) varies across time and by market
Bayes. Random forests and extremely randomized trees are ensembles of index. Rare earth stock price betas show considerable variation ranging
decision trees. SVM partitions the data using separating hyperplanes. between − 1.0 and 4.0. Rare earth stock prices tend to track rare earth
RNNs are a neural network that uses sequential pattern recognition commodity prices closely but there is a low correlation between rare
earth stock prices and the prices of other commodities. Chen et al.
(2020) use multivariate GARCH (BEKK, DCC) to examine the volatility
1
https://www.reuters.com/article/pensana-rare-equity-angola-idAF spillovers and dynamic correlations between international crude oil,
L4N2GO0TR. new energy, and rare earth markets in China. They find high spillovers
2
Fernandez (2017a) finds that rare earth stock prices tend to track rare earth between new energy and rare earth markets. The rare earth market is
commodity prices closely. found to be the main source of these spillovers. Reboredo and Ugolini

2
I. Henriques and P. Sadorsky Resources Policy 86 (2023) 104248

(2020) use Markov switching models and vector autoregression (VAR) gradient boosting tree (GBT) and random forest to predict monthly
to study the price transmission between global stock prices, rare earth copper prices. They find that MLP has the highest accuracy. Shi et al.
stocks, base metal stocks, clean energy stocks, oil prices, and gold prices. (2023) use LSTM-GRU and LSTM-CNN models to forecast daily metal
In a low volatility regime, rare earth stocks prices are closely associated prices. They find that these methods outperform particle swarm opti­
with base metal stock prices and weakly linked to gold, oil, clean energy mization and the genetic algorithm. This discussion on forecasting
and global stock prices. In a high volatility regime, however, there is methods for metal prices indicates that, overall, nonlinear methods have
considerable co-movement between rare earth stock prices and the other higher prediction accuracy.
assets. Bouri et al. (2021) study the dynamic return and volatility rela­ Our literature review suggests that there is a knowledge gap with
tionship between rare earth stock prices and the stock prices of clean respect to forecasting rare earth stock prices. The next section of the
energy, health care equipment, telecommunications, consumer elec­ paper describes the methods used for predicting rare earth stock prices.
tronics, and aerospace and defense. They use quantile connectedness to
investigate the tail dependence between these assets over the full sample 3. Methods
period as well as the COVID-19 period. Tail dependence increases during
periods of stress, but rare earths are receivers of spillovers for all of the The following methods are employed to predict the direction of rare
periods studied. Song et al. (2021) employ time-varying parameter earth stock prices.
vector autoregression (TVP-VAR) to investigate the dynamic connect­
edness between rare earth stocks, clean energy stocks, world equities, 3.1. Naïve Bayes
base metals, crude oil, and gold. They find that the volatility connect­
edness and return vary over the sample period. The rare earths equities The Naïve Bayes classifier is used to model and forecast a binary
are mostly receivers of spillovers. Rare earth stocks have also exhibited a response variable (James et al., 2022). Here the direction of stock price
stronger connectedness with other assets during the COVID-19 period. changes can be classified as a binary variable. Positive stock price
Zheng et al. (2021) use Chinese company level data to study asymmetric changes from one period to another are classified as up. Non-positive
spillovers and connectedness between rare earth markets and new en­ stock price changes from one period to another are classified as down.
ergy. They find that there exists a risk transfer between renewable en­ The Naïve Bayes classifier uses Bayes theorem to make predictions.
ergy markets and rare earth markets and that the network structure of Naïve Bayes models are easy to estimate and are often used as a
risk transfers changes between firms across time. benchmark. In this paper, multi-step forecasts are produced to deter­
Several different methods have been used to forecast metal prices. mine how forecast accuracy changes with the forecast horizon. Features
Some representative examples include the following. Dooley and Leni­ include technical indicators and asset market volatility. Technical in­
han (2005) find that ARIMA has slightly higher accuracy than lagged dicators are important predictors of asset prices (Bustos and
forward models when predicting lead and zinc prices. Chen et al. (2010) Pomares-Quimbaya, 2020; Neely et al., 2014; Y. Wang et al., 2020b; Yin
find that the exchange rates of major commodity producing countries et al., 2017; Yin and Yang, 2016) and stock price direction (Basak et al.,
are good predictors of metal prices. Kriechbaumer et al. (2014) combine 2019; Sadorsky, 2021a, 2021b, 2022). Asset market volatility affects
wavelets with ARIMA to forecast the prices of aluminum, copper, lead, risk which in turn can affect asset prices (Banerjee et al., 2007; Brandt
and zinc. Wavelet multiresolution helps to increase the predictive ac­ and Kang, 2004; Cederburg et al., 2023).
curacy of the ARIMA models. He et al. (2015) find that a Curvelet
forecasting algorithm has higher metal price prediction accuracy than a 3.2. The lasso
benchmark model. Chen et al. (2016) find that for forecasting aluminum
and nickel prices, a gray wave forecasting model outperforms an ARIMA Another method is the Least absolute shrinkage and selection oper­
model. Fernandez (2017b) finds that the information content of futures ator (Lasso). Lasso is a shrinkage estimator which is similar to ridge
prices is somewhat limited for forecasting industrial metals prices. regression (James et al., 2022). Whereas ridge regression shrinks
Pincheira Brown and Hardy (2019) find that the Chilean exchange rate parameter estimates down to small but non-zero values, Lasso may set
has the ability to predict six base metals (aluminum, copper, lead, parameter values equal to zero resulting in a sparse model. Compared to
nickel, tin, and zinc) that trade on the London Metal Exchange. For regression or ridge regression, prediction accuracy can be enhanced by
forecasting aluminum, copper, lead, nickel and zinc prices, Fernandez using Lasso.
(2020) finds that the convenience yield and detrended oil price have
higher predictive power than interest rates and exchange rates for one to 3.3. Random forests
twelve month out-of-sample forecasts. Rubaszek et al. (2020) find that
while mean reverting models outperform a random walk for forecasting Random forests (RFs) are ensembles of decision trees. The problem
aluminum, copper, nickel, and zinc adding a threshold structure does with decision trees, however, is that they suffer from high variance as
not increase forecasting performance. Kwas et al. (2021) use dynamic modest changes in the data can produce large differences in the pre­
factor models, vector autoregression (VAR), and a random walk to dictions. Random forests reduce variance by creating decorrelation be­
forecast the real prices of aluminum, copper, nickel and zinc. They find tween the trees (Breiman, 2001). More specifically, a large number of
that for aluminum, nickel and zinc VAR models outperform the random decision trees are constructed using bootstrapped training samples. At
walk and dynamic factor models. each split in a tree, a random sample of predictors is selected from the
There is a developing literature that uses machine learning to fore­ full set of predictors. The number of predictors selected is determined by
cast metal prices. Sánchez Lasheras et al. (2015) find that neural net­ the floor of the square root of the total number of predictors (James
works outperform ARIMA models when forecasting COMEX copper spot et al., 2022). Although it may appear unintuitive to choose predictors at
prices. Liu et al. (2017) find that decision trees produce high accuracy random, taking the average of results from non-correlated trees is more
for predicting copper prices. Building on the work of Liu et al. (2017), efficient at reducing variance than averaging highly correlated trees.
Díaz et al. (2020) forecast copper prices using decision trees, random
forests, and gradient boosting. The random walk model, however, offers 3.4. Extremely randomized trees
comparable forecast accuracy. Khoshalan et al. (2021) found that
compared to gene expression programming, adaptive neo-fuzzy infer­ Extremely randomized trees which are often referred to as Extra
ence system, and ant colony optimization, ANN is better able to predict Trees, is a decision tree ensemble method akin to random forests (Geurts
copper prices. Zhang et al. (2021) use multi-layer perception (MLP) et al., 2006). Similar to RFs, at each split Extra Trees selects a randomly
neural network, k-nearest neighbors (KNN), support vector machine, chosen subset of predictors, but unlike RFs, which chooses the best fit for

3
I. Henriques and P. Sadorsky Resources Policy 86 (2023) 104248

each predictor, Extra Trees chooses the best fit from a randomly chosen SVM. The number of epochs was set at 800. A rectified linear unit
small number of split points for each of the selected predictors. In the activation function was used. The objective criterion is cross entropy
simplest case, one random split is chosen for each randomly selected loss and the optimizer was Adam. The learning rate was 0.01. The input
feature. Extra Trees subsequently selects the best split from the smaller layer has 17 nodes (17 features) and the output layer has 2 nodes
number of choices. This ensemble of trees comprises trees that have (classification). For the hidden layers, several different configurations
greater variability but have less correlation than the trees derived from were tried. The number of hidden layers was varied between two and
the random forests method (Geurts et al., 2006). Extra Trees requires three. The number of nodes per layer varied between 8 and 16. The final
less computational time than RFs because splits are randomly chosen. model was 17-8-16-2.
Different measures of forecast accuracy can be constructed from the
3.5. Support vector machine confusion matrix. One such measure is prediction accuracy. Prediction
accuracy is calculated as the number of true positives and true negatives
SVM is a machine learning method/technique that works by parti­ divided by the total number of predictions. Prediction accuracy ranges
tioning a dataset into clusters employing a boundary. This boundary is a from zero to unity and is sometimes expressed as a percentage. Another
hyperplane (James et al., 2022). Many hyperplanes can be used to measure is the kappa statistic which adjusts forecast accuracy by ac­
separate the data and what is of greatest interest is the optimal hyper­ counting for the possibility of a chance occurrence of a correct predic­
plane or maximum margin hyperplane (MMH). The MMH is the hy­ tion. Consequently, the kappa statistic is a better accuracy indicator than
perplane that has the biggest separation between the groups of data and the standard accuracy measure when the classification data are unbal­
located using a search algorithm. Support vectors are defined as the anced. The F1 score, sometimes called the F measure, is a performance
points from each cluster that are closest to the MMH. Support vectors measure combining precision and recall into a single number. The F1
can be easily discovered for linear separable data but, in the case of score is a convenient measure for comparing models and like kappa, is a
nonlinear separable data, more advanced vector geometry is required. better measure of prediction accuracy when classification data are
More specifically, data are mapped into higher dimensional space using unbalanced.
kernels. The benefit of kernel mapping is that a nonlinear relationship in Variable importance is explained using Shapley values. Conditional
a low dimensional space may be linear in higher dimensional space game theory provides the theoretical justification for computing Shapley
thereby reducing complexity. values. Basically, the Shapley value represents the contribution of each
variable towards the predicted value compared to the average predic­
3.6. Recurrent neural network tion for the data set (Lundberg and Lee, 2016, 2017). Consider the
calculation of a Shapley value for one feature, say X1. The accuracy of
Recurrent neural network (RNN) is a machine learning method that every combination of features not including X1 is recorded and then a
uses a sequential approach to using data (James et al., 2022). RNNs take comparison is conducted to see how adding X1 to each combination
information from prior inputs and use this information to update the improves the accuracy. Shapley values are more robust and theoretically
learning process to generate outputs. The learning algorithm uses back consistent than a permutation based variable importance approach. The
propagation and gradient descent. An RNN has an input layer, an output tradeoff is that Shapley values are computationally intensive which
layer, and usually several hidden layers to capture the complexity of the takes much more execution time. All possible coalitions of feature values
data. have to be estimated with and without the feature of interest in order to
calculate the exact Shapley value. As the number of features increases
3.7. Setup of the models this becomes computational intractable. In practice Monte-Carlo sam­
pling is used to approximate the required computations. In this paper,
To avoid look ahead bias the features were lagged by one day. The 100 Monte-Carlo simulations were used. Shapley Additive exPlanations
dataset was randomly split so that 70% of the data were employed for (SHAP) are based on Shapley values but include information for the how
model training and 30% were used for testing the accuracy of the the size and sign of a feature affects the prediction outcomes.
methods. Five hundred trees were used to estimate random forests and Estimation was conducted using the R software (R Core Team, 2022).
extremely randomized trees. Here predictors were randomly chosen More specifically, we employed the following R packages: e1071 pack­
using the floor of the square root of the number of features at each split. age (Meyer et al., ); random forests machine learning package (Breiman
Provided many trees are chosen, random forest models are not sensitive et al., 2018); ranger package (Wright et al., 2022); fastshap package
to the number of trees. More specifically, a very large number of trees (Greenwell, 2021); torch (Falbel et al., 2023), and caret package (Kuhn
does not result in overfitting, whereas a small number of trees can lead et al., 2020). For portfolio analysis the PerformanceAnalytics package
to high test error. Preliminary analysis indicated that 500 trees were was used (Peterson et al., 2020).
more than enough to accommodate any decrease in test error.
The SVM used in this paper uses a radial basis function with two 4. Data
tuning parameters - a cost parameter and a gamma parameter. A 10-fold
cross validation with 10 repeats were used to determine the optimal To undertake our analysis, we collected daily rare earth stock prices
values of these parameters. A grid search from a list of values (0.1, 1, 10, and some important market uncertainty data. Rare earth stock prices are
100, 1000) was used to choose the cost parameter. The gamma measured using the VanEck Rare Earth/Strategic Metals ETF (REMX).
parameter was chosen using grid search from the following list of values This is the largest and most liquid rare earth exchange traded fund
(0.2, 0.5, 1, 2, 3, 4). Distance-based algorithms are impacted by the (ETF). The REMX is invested in companies that are focused on activities
range of features. Since a distance-based algorithm is used in the related to the production, refining, and recycling of rare earth elements
computation of SVMs, we scaled the features to have a mean of zero and and strategic metals. Companies included in REMX must generate at
unity standard deviation. Data scaling was not applied to the random least 50% of their revenue from rare earths. As of March 31, 2023, the
forests or extremely randomized trees as splitting on nodes takes place top ten holdings account for 60% of total assets.3 The top ten holdings
on a single feature and not a range of features. are China Northern Rare Earth (Group) High-Tech Co Ltd, Zhejiang
Training the RNN is not as easy as for the other methods used in this Huayou Cobalt Co Ltd, Liontown Resources, Pilbara Minerals, Allkem
paper. There are no well accepted guidelines on how many hidden layers
to use or how many neurons to include. The training process involves a
trial and error process (Zhang et al., 2021). The training process can be 3
https://www.vaneck.com/us/en/investments/rare-earth-strategic-metals-
computationally demanding. The data were scaled as for the case of etf-remx/.

4
I. Henriques and P. Sadorsky Resources Policy 86 (2023) 104248

Ltd., Lynas Rare Earths Ltd, MP Materials Corp, Sigma Lithium, Livent characterized by a rare earth crisis and its impact on REMX is pro­
Corp, and Xiamen Tungsten Co Ltd. In terms of country representation, nounced.4 Trade tensions between China and Japan resulted in
the top five countries are Australia (40%), China (30%), United States increased prices for rare earths to Japan which affected Japanese high-
(14%), Canada (6%), and Brazil (5%). tech manufacturing. The price of rare earths spiked even further when
The features used to predict rare earth stock prices consist of tech­ China cut export quotas by 40%. This event was shocking because, for
nical indicators and stock market volatility. The technical indicators the first time, many companies had to deal with the fact that China was
used include the 50-day and 200-day moving averages, stochastic the world’s dominant producer of rare earths and could use this
oscillator (slow, fast), advance – decline line (ADX), the relative strength advantage to advance geopolitical issues. As a result of a 2014 World
indicator (RSI), moving average cross-over divergence (MACD), on Trade Organization (WHO) ruling against China’s rare-earths export
balance volume (OnBalanceVolume), money flow index (MFI), price restrictions, China loosened export quotas in 2016 causing rare-earth
rate of change (ROC), and the Williams accumulation and distribution prices to plummet pushing many rare earth producers outside of
(WAD) (Achelis, 2013). The default settings in the R package TTR are China such as Lynas to near bankruptcy (Henriques and Böhm, 2022;
used to calculate technical indicators (Ulrich, 2020). Early 2020 saw an increase in the rare earth stock prices owing to both
Asset price volatility is a measure of risk and can have an important the 2019 US-China trade embargo and the COVID-19 pandemic.
impact on stock prices. High levels of volatility coincide with stock We observe large increases in volatility (VIX, OVX, VXN, and ICLN)
market turmoil. Banerjee et al. (2007) find that the VIX is a good pre­ around March 2020 when the World Health Organization (WHO)
dictor of stock returns. The VIX is the CBOE volatility index representing declared COVID-19 a pandemic (Fig. 1b). Although these values have
the US stock market expectations of volatility for the next 30 days. decreased since March 2020, variability has yet to match pre-COVID-19
Christoffersen and Diebold (2006) find that sign probability forecast are levels. In fact, the most recent values from March 2023 indicate a slight
impacted by volatility (VIX). Bekiros and Georgoutsos (2008) use downward trend. The absolute value of ICLN has, in addition to the
recurrent neural networks (RNN) to predict the price direction of the onset of COVID-19, experienced several large spikes in volatility.
S&P500. They find that the VIX is an important predictor. Oil market Over the sample period, REMX has a slightly negative mean value
volatility may also be an important predictor of rare earth stock prices indicating that the overall trend in the data was down (Table 1). The
because the oil market incorporates business cycle information about volatility variables (VIX, OVX, VXN, and ICLN) each have positive mean
the demand and supply of oil as well as geopolitical information. OVX values and the coefficient of variation indicates that ICLN is the most
provides an estimate of the expected US crude oil price volatility over variable while VXN is the least variable. The null hypothesis of normally
the next thirty days. The VXN is an estimate of technology stock market distributed data is rejected for all variables.
volatility over the next 30 days and may affect REMX since rare earths
are used in the production of electronics. The absolute return of the 5. Results
iShares Global Clean Energy ETF (ICLN) are included to represent clean
energy stock price volatility. ICLN is the largest clean energy stock price Results from predicting rare earth stock price direction are reported
ETF by market capitalization. ICLN volatility is included to account for
the fact that rare earths are used in clean energy products and processes.
All data are retrieved from Yahoo Finance. The daily dataset covers
the period October 28, 2010 (the first day REMX price data are avail­
able) to February 28, 2023. The variables VIX, OVX, VXN, and the ab­
solute returns of ICLN are smoothed using a five-day moving average.
A plot of REMX adjusted closing prices shows a downward trend
from the start of the sample period to the middle of 2015 (Fig. 1a). From
late 2015, REMX prices increased until 2017 and then declined before
trending upwards in 2019. The REMX prices are heavily affected by
events that impact the supply and demand of REEs. The year 2010 was

Fig. 1b. Time series plots of five period moving averages of VIX, OVX, VXN,
and the absolute returns of ICLN.

4
https://www.sciencehistory.org/learn/science-matters/case-of-rare-earth-e
Fig. 1a. Time series plot of REMX stock prices. lements-history-future.

5
I. Henriques and P. Sadorsky Resources Policy 86 (2023) 104248

Table 1
Summary statistics.
median mean std.dev coef.var skewness kurtosis W W(p)

REMX 0.000 − 0.020 2.102 − 107.044 − 0.196 3.920 0.969 0.000


VIX 16.616 18.413 7.111 0.386 2.342 10.083 0.820 0.000
OVX 34.316 37.617 18.035 0.479 4.456 32.147 0.671 0.000
VXN 18.968 21.256 7.413 0.349 1.590 4.224 0.866 0.000
ICLN 0.011 0.012 0.008 0.632 2.976 18.737 0.797 0.000

Data for the period October 29, 2010 to February 28, 2023. REMX is measured in log returns. VIX, OVX and VXN are five-period moving averages of their respective
values. ICLN is a five-period moving average of the absolute value of log returns. W represents the Wilcox test for normality and W(p) is the associated p-value.

in this section. The most striking observation about the prediction ac­
curacy is that after 10 days, random forests, RNN, Extra Trees, and SVM
have much higher prediction accuracy than Lasso or Naïve Bayes
(Fig. 2). Between one day and eight days the prediction accuracy of
random forests, Extra Trees, and SVM increases quickly to 85%. At 20
days, random forests, Extra Trees, RNN, and SVM have accuracy
approaching 90%. The Extra Trees is particularly impressive as it rea­
ches accuracy of slightly over 90% in the 12-day to 20-day range. By
comparison, Lasso never reaches an accuracy higher than 67% while the
Naïve Bayes accuracy is generally less than 60%.
The kappa values presented in Fig. 3 show a pattern that are similar
to those depicted in Fig. 3. Random forests, Extra Trees, RNN, and SVM
have the highest kappa values. Lasso and Naïve Bayes have the lowest
kappa values.
The prediction accuracy can also be assessed using the F1 measure
(Fig. 4). Consistent with the results in Figs. 2 and 3, random forests,
Extra Trees, RNN, and SVM have the highest accuracy while Lasso and Fig. 3. Kappa values for REMX price direction prediction.
Naïve Bayes have lower F1 values.
The random forest model can be used to calculate Shapley feature
importance. Here, features with large absolute Shapley values are more
important than features with small absolute Shapley values. For
assessing global feature importance, the average of the absolute Shapley
values per feature is calculated. Looking first at the Shapley feature
importance for a 20-day forecast horizon, WAD is the most important
feature (Fig. 5). WAD changes the predicted probability of being in the
UP state by on average 6% (0.06 on the horizontal axis). After WAD, the
next most important features are MA200, MA50, OnBalanceVolume, and
VIX. Feature importance drops off abruptly after VIX. Notice that these
five features (WAD, MA200, VIX, MA50 and OnBalanceVolume) are
among the most important features for 5-day, 10-day and 20-day fore­
cast horizons although the ranking of these features changes slightly
depending on the forecast horizon. For one day forecasts OnBalance­
Volume is the most important feature. Overall, the technical indicator
features tend to be more important than the volatility features.
The Shapley value reported in Fig. 5 show the importance of each Fig. 4. F1 values for REMX price direction prediction.
feature but does not provide information on the sign of the features.
SHAP values provide information on both the importance and the sign of the feature value. Fig. 6 shows SHAP values calculated from a random
forest for 20-day forecasts. Notice that the ranking of the features is the
same as those in Fig. 5. WAD and MA200 are the most important features
and for each of these features lower values indicate higher importance.
The next most important group of features are MA50, OnBalanceVo­
lume, and VIX. It is also the case that for each of these features lower
values indicate higher importance. MACDSignal is an example of a
feature where higher feature values have greater importance.
The results thus far have been obtained by randomly splitting the
dataset into a training dataset and a test dataset (a cross-validation (CV)
approach). Random shuffling of the data, however, may lead to inac­
curate predictions if the time series features in the data are not pre­
served. Bergmeir et al. (2018) discuss this and demonstrate that with
autoregressive models k-fold cross validation is useful when errors are
uncorrelated. As the focus of our paper is predicting a classification
variable rather than a continuous variable (REMX price direction) and
some of our technical features such as WAD, MA50 and MA200 contain
significant past information on stock prices, the residual serial correla­
Fig. 2. REMX price direction prediction accuracy. tion is assuaged.

6
I. Henriques and P. Sadorsky Resources Policy 86 (2023) 104248

Fig. 5. Shapley values for 1-day forecast horizon (top left), 5-day forecast horizon (top right), 10-day forecast horizon (bottom left), and 20-day forecast horizon
(bottom right). Shapley values computed from a random forest.

window approach to predict REMX price direction. Extra Trees had the
highest accuracy for most of the forecasts. The initial training dataset to
construct predictions from one period through to twenty periods uses
the first 70% of the data. Subsequently, one additional observation is
added to the end of the training data set while the earliest observation is
removed, and predictions are recomputed. The fixed window of training
data is rolled through the data until the end of the data is attained. These
predictions are referred to as time-series cross-validation (tsCV).
For 10-day forecasts the CV value of 0.8952 is similar to the tsCV
value of 0.8708 (Table 2). It is also the case that for the 20-day forecasts,
the CV accuracy of 0.9214 is similar to the tsCV accuracy of 0.9109. The
same pattern is observed for kappa and F1 where for a particular forecast
horizon, the CV and tsCV values are similar. It is concluded that CV and
tsCV approaches produce similar accuracy values.
It is important to evaluate the practical significance of the forecasts.
To do so a portfolio comparison is made between a buy and hold (B&H)
portfolio and a switching (Switch) portfolio that uses the price direction
forecasts from Extra Trees to make an investment in REMX. The Extra

Table 2
Extra Trees prediction accuracy comparisons for REMX.
10-day 20-day

Acc CV 0.8952 0.9214


tsCV 0.8708 0.9109
Kappa CV 0.7892 0.8424
Fig. 6. SHAP values for random forests 20-day forecasts. tsCV 0.7373 0.8157
F1 CV 0.8870 0.9169
tsCV 0.8855 0.9246
To gain additional insight, we use an Extra Trees rolling fixed

7
I. Henriques and P. Sadorsky Resources Policy 86 (2023) 104248

Trees model is used since it has the highest accuracy in the 10-day to 20-
day range. If the price direction forecast is positive, then invest in REMX
for the duration of the forecast period. Otherwise do not invest in REMX.
Trading signals are created using the previously described tsCV
approach. Portfolio summary statistics are shown in Table 3. For 10-day
forecasts, the B&H portfolio has a mean value of 0.361 (36.1%)
compared to the switching portfolio mean value of 1.426 (143%). The
switching portfolio has a higher mean return, Sharpe ratio, and Omega
value.5 The switching portfolio has lower standard deviation, value at
risk (VaR), and expected shortfall (ES) values. Thus, the switching
portfolio has a higher return to risk ratio. It is also the case that for 20-
day forecasts, the swathing portfolio outperforms the B&H portfolio.
The superiority of the switching portfolio is demonstrated visually by
the equity curves shown in Figs. 7 and 8.

6. Discussion

Fig. 7. Equity curves (Extra Trees, 10 day forecasts).


Rare earth elements are a group of 15 elements that are classified as
part of the lanthanide series in the periodic table of elements. The REEs
have atomic numbers between 57 and 71. Scandium and yttrium are not
members of the lanthanide series but given their similar properties to the
lanthanide series are often classified as rare earth elements. REEs are
essential to the high-technology society that we live in. REEs are widely
used in electronics, clean energy, health care technology, automotive
(electric vehicles), and defense. The US was the world’s largest supplier
of REEs until the 1990s when China aggressively entered the industry.
By 2012, China was the world’s largest supplier of REEs and currently
holds monopoly power over the processing of REEs (Bo et al., 2019). The
demand for REEs is expected to grow rapidly in the coming years with
the expansion of technological innovation and clean energy adoption.
Consequently, an increased demand for REEs needs to be met with
increased investment for locating, producing and processing REEs.
The introduction of this paper presented several research questions.
The first two research questions were “which methods produce the most
accurate forecasts of rare earth stock prices” and “how does forecast
accuracy vary with the forecast horizon?” Building on research showing
that machine learning methods have higher accuracy when predicting Fig. 8. Equity curves (Extra Trees, 20-day forecasts).
complex metal price data, this paper used six machine learning methods
(random forests, RNN, SVM, extremely randomized trees, Lasso, and the highest prediction accuracy. For forecasts in the 10-day to 20-day
Naïve Bayes) to forecast rare earth stock price direction. Forecasts were range, random forests, extremely randomized trees, and support vec­
computed over a multi-step horizon to see how prediction accuracy tor machine have prediction accuracies greater than 85%. Lasso never
changes with the forecast period. Rare earth stock prices are measured achieves an accuracy greater than 67% while the accuracy of Naïve
using the REMX ETF and the features consist of technical indicators and Bayes is less than that of Lasso. The extremely randomized trees method
market volatility measures. Random forests, Extra Trees and SVM have is particularly attractive because it has high prediction accuracy (in
excess of 90% over the 12-day to 20-day range) and takes less compu­
tational time than either random forests or SVM. Forecast accuracy in­
Table 3 creases quickly over the one-day to eight-day period for random forests,
Portfolio summary statistics for a buy and hold (B&H) portfolio and a switching Extra Trees, SVM, and RNN. Between 15-days and 20-days, the accuracy
portfolio using trading signals generated from an Extra Trees model. of these models is in the range between 87% and 91%. These results are
10-day 20-day in line with the metal price forecasting literature that has found random
B&H Switch B&H Switch forests (Díaz et al., 2020; Liu et al., 2017; Zhang et al., 2021), support
vector machine (Zhang et al., 2021), and neural networks (Khoshalan
mean 0.361 1.426 0.360 1.263
sd 0.439 0.254 0.439 0.330
et al., 2021; Sánchez Lasheras et al., 2015; Shi et al., 2023; Zhang et al.,
Sharpe 0.793 5.524 0.792 3.767 2021) have high accuracy for forecasting metal prices.
Omega 1.166 2.427 1.166 1.766 The third research question was “which features are most important
VaR − 0.042 − 0.021 − 0.042 − 0.030 for predicting rare earth stock prices?” Feature importance was assessed
ES − 0.058 − 0.031 − 0.058 − 0.043
using Shapley values. For forecasts in the 5-day to 20-day range, WAD,
The mean, standard deviation (sd), and Sharpe ratio are calculated using MA200, VIX, MA50 and OnBalanceVolume are the most important
annualized values. ES and VaR calculated at the 5% level using historical values. features. These results lend support to research on the importance of
technical indicators for predicting stock price direction. Of the non-
technical indicators VIX is the most important. OVX and VXN are
important but ICLN is not very important.
5 The fourth research question was “how does a trading rule con­
For the Sharpe calculation, the risk-free rate was set at 1% which is the
average annual yield on a 3 month T bill for the time period under study.
structed from a machine learner compare with a buy and hold portfo­
Trading costs were assumed negligible which is consistent with institutional lio?” To answer this question portfolio comparisons were made between
trading or brokerage accounts that offer no fees for trading ETFs. a buy and hold portfolio and a portfolio that uses price direction

8
I. Henriques and P. Sadorsky Resources Policy 86 (2023) 104248

predictions from an Extra Trees model to produce trading signals. importance. In choosing features for predicting rare earth stock prices, it
Portfolios constructed from 10-day and 20-day forecasts show the Extra is recommended that these variables be included.
Trees based switching portfolios have higher risk adjusted performance The practical implications from forecasting rare earth stock prices
and lower VaR and expected shortfall. In particular, the switching are demonstrated by comparing a buy and hold portfolio with a
portfolios using signals from Extra Trees had higher Sharpe ratios and switching portfolio that uses trade signals generated from Extra Trees
Omega than their corresponding buy and hold portfolios. price direction forecasts. The switching portfolio has higher risk
The market for rare earths is affected by trade disputes, geopolitical adjusted returns.
tensions, and supply chain disruptions. As countries realize the critical There are several future research directions. It may be of interest to
and strategic importance of REEs, they are turning their attention to­ expand the number of features to include business cycle and monetary
wards developing their own domestic rare earth elements industry. variables. Although some of the impact of these variables are already
While countries can enact resource policies to promote the exploration, accounted for in the existing feature set, there may be other aspects or
mining, and processing of rare earth elements, the development of rare nuances worth exploring. Another future direction could be to take a
earth industries requires massive amounts of financial capital. A suc­ longer-term forecast horizon. The analysis in this paper was conducted
cessful rare earths resource policy needs to provide specifics about using daily data and forecasts from one day to twenty days. Forecasts
financing as companies move through various stages of development. using lower frequency data like monthly or quarterly may also be useful
Governments, via subsidies, tax breaks, and special or strategic funding for investors and policy makers.
initiatives, can provide early stage financing to companies specializing
in REEs. In order to make successful policy decisions about rare earth CRediT author statement
mining governments need rare earth industry trend indicators. Rare
earth stock prices can act as an overall gauge of the strength of the rare Perry Sadorsky: Conceptualization, Data curation, Methodology,
earths industry. Thus, rare earth stock price forecasts are important in­ Software, Writing- Original draft preparation, Writing- Reviewing and
puts into government mining policy that determines mining royalty Editing. Irene Henriques: Conceptualization, Methodology, Writing-
payments and licensing fees. Original draft preparation, Writing- Reviewing and Editing.
Governments can also invest in rare earths through SWFs. SWFs are
set up by government policy and have specific guidelines regarding their
investing objectives. Unlike many other commodities, rare earths do not Declaration of competing interest
have traded futures markets. As a substitute, risk management of rare
earths can be conducted using rare earths equities. Accurate forecasts of The authors declare that they have no known competing financial
rare earth stock prices are central to the risk management of rare earths. interests or personal relationships that could have appeared to influence
For example, if rare earth stock prices are predicted to fall, then a short the work reported in this paper.
position can be taken to offset downside risk. The results presented in
this paper on forecasting rare earth stock prices provides useful infor­ Acknowledgements
mation on which methods to use, which features are most important,
and the practical implications of using a trading strategy based on Extra We thank the Guest Editors (Qiang Ji, Duc Khuong Nguyen, Dayong
Trees. Zhang) and the anonymous reviewers for their helpful comments. Irene
Henriques’ research is supported by funding from the Social Sciences
7. Conclusions and Humanities Research Council of Canada (SSHRCC) grant number
435-2018-1104.
Rare earth elements are very much in demand for their usage in the
production of electronic and clean energy equipment. To put it simply, References
REEs are essential to the technology-oriented society we live in. Future
Achelis, S.B., 2013. Technical Analysis from A to Z, second ed. McGraw-Hill Education,
demand for rare earths elements is projected to grow rapidly and this New York.
will require an increase in investment in the rare earths sector. Rare Apergis, E., Apergis, N., 2017. The role of rare earth prices in renewable energy
earth stock prices can act as an indicator of the strength of the rare earth consumption: the actual driver for a renewable energy world. Energy Econ. 62,
33–42. https://doi.org/10.1016/j.eneco.2016.12.015.
industry and in the absence of rare earths futures markets can be used for
Baldi, L., Peri, M., Vandone, D., 2014. Clean energy industries and rare earth materials:
risk management. To date, however, little is known about predicting Economic and financial issues. Energy Pol. 66, 53–61. https://doi.org/10.1016/j.
rare earth stock prices. The purpose of this research is to predict rare enpol.2013.10.067.
earth stock prices and learn something about which features are most Ballings, M., Van den Poel, D., Hespeels, N., Gryp, R., 2015. Evaluating multiple
classifiers for stock price direction prediction. Expert Syst. Appl. 42, 7046–7056.
important. https://doi.org/10.1016/j.eswa.2015.05.013.
The analysis reported in this paper shows that rare earth stock price Banerjee, P.S., Doran, J.S., Peterson, D.R., 2007. Implied volatility and future portfolio
direction prediction accuracy is high for random forests, Extra Trees, returns. J. Bank. Finance 31, 3183–3199.
Basak, S., Kar, S., Saha, S., Khaidem, L., Dey, S.R., 2019. Predicting the direction of stock
RNN and SVM and lower for Lasso and Naïve Bayes. For 10-day to 20- market prices using tree-based classifiers. N. Am. J. Econ. Finance 47, 552–567.
day forecasts random forests, extremely randomized trees, and support https://doi.org/10.1016/j.najef.2018.06.013.
vector machine achieve prediction accuracies greater than 85%. These Bekiros, S.D., Georgoutsos, D.A., 2008. Non-linear dynamics in financial asset returns:
the predictive power of the CBOE volatility index. Eur. J. Finance 14, 397–408.
results are important in demonstrating the high accuracy of using https://doi.org/10.1080/13518470802042203.
random forests, Extra Trees and SVM to predict rare earth stock price Bergmeir, C., Hyndman, R.J., Koo, B., 2018. A note on the validity of cross-validation for
direction. Extra Trees has prediction accuracy greater than 90% for evaluating autoregressive time series prediction. Comput. Stat. Data Anal. 120,
70–83. https://doi.org/10.1016/j.csda.2017.11.003.
forecasts in the range of 12-days to 20-days and have the added benefit Bo, L., Böhm, S., Reynolds, N.-S., 2019. Organizing the Environmental governance of the
of having the fastest computation time. It is recommended that rare-earth industry: China’s passive revolution. Organ. Stud. 40, 1045–1071.
extremely randomized trees be used to calculate fast and accurate pre­ https://doi.org/10.1177/0170840618782278.
Bouri, E., Kanjilal, K., Ghosh, S., Roubaud, D., Saeed, T., 2021. Rare earth and allied
dictions of rare earth stock price direction.
sectors in stock markets: extreme dependence of return and volatility. Appl. Econ.
Shapley values indicate that WAD, MA200, VIX, MA50 and on bal­ 1–21. https://doi.org/10.1080/00036846.2021.1927971.
ance volume are among the most important features for 5-day, 10-day Brandt, M.W., Kang, Q., 2004. On the relationship between the conditional mean and
and 20-day forecast horizons. Four of these features are technical in­ volatility of stock returns: a latent VAR approach. J. Financ. Econ. 72, 217–257.
https://doi.org/10.1016/j.jfineco.2002.06.001.
dicators. Of the non-technical indicators VIX is the most important. Breiman, L., 2001. Random forests. Mach. Learn. 45, 5–32. https://doi.org/10.1023/A:
SHAP values show that lower values of these features result in greater 1010933404324.

9
I. Henriques and P. Sadorsky Resources Policy 86 (2023) 104248

Breiman, L., Cutler, A., Liaw, A., Wiener, M., 2018. randomForest: breiman and cutler’s Lundberg, S.M., Lee, S.-I., 2017. A unified approach to interpreting model predictions. In:
random forests for classification and regression. R package version 4, 6–14. Advances in Neural Information Processing Systems. Curran Associates, Inc.
Bustos, O., Pomares-Quimbaya, A., 2020. Stock market movement forecast: a Systematic Meyer, D., Dimitriadou, E., Hornik, K., Weingessel, A., Leisch, F., C++-code), C.-C.C.
review. Expert Syst. Appl. 156, 113464 https://doi.org/10.1016/j. (libsvm, C++-code), C.-C.L. (libsvm2021. e1071: misc Functions of the Department
eswa.2020.113464. of Statistics, Probability Theory Group (Formerly: E1071), TU Wien..
Cederburg, S., Johnson, T.L., O’Doherty, M.S., 2023. On the Economic significance of Neely, C.J., Rapach, D.E., Tu, J., Zhou, G., 2014. Forecasting the equity risk premium:
stock return predictability. Rev. Finance 27, 619–657. https://doi.org/10.1093/rof/ the role of technical indicators. Manag. Sci. 60, 1772–1791. https://doi.org/
rfac035. 10.1287/mnsc.2013.1838.
Chen, Y., He, K., Zhang, C., 2016. A novel grey wave forecasting method for predicting Nyberg, H., 2011. Forecasting the direction of the US stock market with dynamic binary
metal prices. Resour. Pol. 49, 323–331. https://doi.org/10.1016/j. probit models. Int. J. Forecast. 27, 561–578. https://doi.org/10.1016/j.
resourpol.2016.06.012. ijforecast.2010.02.008.
Chen, Y., Zheng, B., Qu, F., 2020. Modeling the nexus of crude oil, new energy and rare Nyberg, H., Pönkä, H., 2016. International sign predictability of stock returns: the role of
earth in China: an asymmetric VAR-BEKK (DCC)-GARCH approach. Resour. Pol. 65, the United States. Econ. Modell. 58, 323–338. https://doi.org/10.1016/j.
101545 https://doi.org/10.1016/j.resourpol.2019.101545. econmod.2016.06.013.
Chen, Y.-C., Rogoff, K.S., Rossi, B., 2010. Can exchange rates forecast commodity prices? Pesaran, M.H., Timmermann, A., 2002. Market timing and return prediction under model
Q. J. Econ. 125, 1145–1194. https://doi.org/10.1162/qjec.2010.125.3.1145. instability. J. Empir. Finance 9, 495–510. https://doi.org/10.1016/S0927-5398(02)
Christoffersen, P.F., Diebold, F.X., 2006. Financial asset returns, direction-of-change 00007-5.
forecasting, and volatility dynamics. Manag. Sci. 52, 1273–1287. https://doi.org/ Peterson, B.G., Carl, P., Boudt, K., Bennett, R., Ulrich, J., Zivot, E., Cornilly, D., Hung, E.,
10.1287/mnsc.1060.0520. Lestel, M., Balkissoon, K., Wuertz, D., Christidis, A.A., Martin, R.D., Zhou, Z.
Díaz, J.D., Hansen, E., Cabrera, G., 2020. A random walk through the trees: forecasting “Zenith”, Shea, J.M., 2020. PerformanceAnalytics: Econometric Tools for
copper prices using decision learning methods. Resour. Pol. 69, 101859 https://doi. Performance and Risk Analysis.
org/10.1016/j.resourpol.2020.101859. Pincheira Brown, P., Hardy, N., 2019. Forecasting base metal prices with the Chilean
Dooley, G., Lenihan, H., 2005. An assessment of time series methods in metal price exchange rate. Resour. Pol. 62, 256–281. https://doi.org/10.1016/j.
forecasting. Resour. Pol. 30, 208–217. https://doi.org/10.1016/j. resourpol.2019.02.019.
resourpol.2005.08.007. Pönkä, H., 2016. Real oil prices and the international sign predictability of stock returns.
Falbel, D., Luraschi, J., Selivanov, D., Damiani, A., Regouby, C., Joachimiak, K., Badr, H. Finance Res. Lett. 17, 79–87. https://doi.org/10.1016/j.frl.2016.01.011.
S., Rstudio, 2023. Torch: Tensors and Neural Networks with “GPU” Acceleration. R Core Team, 2022. R: A Language and Environment for Statistical Computing.
Fernandez, V., 2020. The predictive power of convenience yields. Resour. Pol. 65, Reboredo, J.C., Ugolini, A., 2020. Price spillovers between rare earth stocks and financial
101532 https://doi.org/10.1016/j.resourpol.2019.101532. markets. Resour. Pol. 66, 101647 https://doi.org/10.1016/j.
Fernandez, V., 2017a. Rare-earth elements market: a historical and financial perspective. resourpol.2020.101647.
Resour. Pol. 53, 26–45. https://doi.org/10.1016/j.resourpol.2017.05.010. Rubaszek, M., Karolak, Z., Kwas, M., 2020. Mean-reversion, non-linearities and the
Fernandez, V., 2017b. A historical perspective of the informational content of commodity dynamics of industrial metal prices. A forecasting perspective. Resour. Pol. 65,
futures. Resour. Pol. 51, 135–150. https://doi.org/10.1016/j. 101538 https://doi.org/10.1016/j.resourpol.2019.101538.
resourpol.2016.12.002. Sadorsky, P., 2022. Forecasting solar stock prices using tree-based machine learning
Geurts, P., Ernst, D., Wehenkel, L., 2006. Extremely randomized trees. Mach. Learn. 63, classification: how important are silver prices? N. Am. J. Econ. Finance 61, 101705.
3–42. https://doi.org/10.1007/s10994-006-6226-1. https://doi.org/10.1016/j.najef.2022.101705.
Greenwell, B., 2021. Fastshap: Fast Approximate Shapley Values. Sadorsky, P., 2021a. A random forests approach to predicting clean energy stock prices.
He, K., Lu, X., Zou, Y., Keung Lai, K., 2015. Forecasting metal prices with a curvelet J. Risk Financ. Manag. 14, 48. https://doi.org/10.3390/jrfm14020048.
based multiscale methodology. Resour. Pol. 45, 144–150. https://doi.org/10.1016/j. Sadorsky, P., 2021b. Predicting gold and silver price direction using tree-based
resourpol.2015.03.011. classifiers. J. Risk Financ. Manag. 14, 198. https://doi.org/10.3390/jrfm14050198.
Henriques, I., Böhm, S., 2022. The perils of ecologically unequal exchange: contesting Salim, H., Sahin, O., Elsawah, S., Turan, H., Stewart, R.A., 2022. A critical review on
rare-earth mining in Greenland. J. Clean. Prod. 349, 131378 https://doi.org/ tackling complex rare earth supply security problem. Resour. Pol. 77, 102697
10.1016/j.jclepro.2022.131378. https://doi.org/10.1016/j.resourpol.2022.102697.
Hsu, J., 2019. Don’t panic about rare earth elements. Scientific American May 31. Sánchez Lasheras, F., de Cos Juez, F.J., Suárez Sánchez, A., Krzemień, A., Riesgo
International Energy Agency, 2021. The Role of Critical Minerals in Clean Energy Fernández, P., 2015. Forecasting the COMEX copper spot price by means of neural
Transitions. International Energy Agency, Paris. France. https://www.iea.org/report networks and ARIMA models. Resour. Pol. 45, 37–43. https://doi.org/10.1016/j.
s/the-role-of-critical-minerals-in-clean-energy-transitions. resourpol.2015.03.004.
James, G., Witten, D., Hastie, T., Tibshirani, R., 2022. An introduction to statistical Shi, T., Li, C., Zhang, W., Zhang, Y., 2023. Forecasting on metal resource spot settlement
learning: with applications in R. In: Springer Texts in Statistics, second ed. Springer- price: new evidence from the machine learning model. Resour. Pol. 81 https://doi.
Verlag, New York. https://doi.org/10.1007/978-1-4614-7138-7. org/10.1016/j.resourpol.2023.103360.
Jyothi, R.K., Thenepalli, T., Ahn, J.W., Parhi, P.K., Chung, K.W., Lee, J.-Y., 2020. Review Shuai, J., Zhao, Y., Shuai, C., Wang, J., Yi, T., Cheng, J., 2023. Assessing the international
of rare earth elements recovery from secondary resources for clean energy co-opetition dynamics of rare earth resources between China, USA, Japan and the
technologies: grand opportunities to create wealth from waste. J. Clean. Prod. 267, EU: an ecological niche approach. Resour. Pol. 82, 103446 https://doi.org/10.1016/
122048 https://doi.org/10.1016/j.jclepro.2020.122048. j.resourpol.2023.103446.
Kanazawa, Y., Kamitani, M., 2006. Rare earth minerals and resources in the world. Song, Y., Bouri, E., Ghosh, S., Kanjilal, K., 2021. Rare earth and financial markets:
J. Alloys Compd. 408–412, 1339–1343. https://doi.org/10.1016/j. dynamics of return and volatility connectedness around the COVID-19 outbreak.
jallcom.2005.04.033. Resour. Pol. 74, 102379 https://doi.org/10.1016/j.resourpol.2021.102379.
Khoshalan, H.A., Shakeri, J., Najmoddini, I., Asadizadeh, M., 2021. Forecasting copper Ulrich, J., 2020. TTR: Technical Trading Rules.
price by application of robust artificial intelligence techniques. Resour. Pol. 73, Wang, J., Guo, M., Liu, M., Wei, X., 2020a. Long-term outlook for global rare earth
102239 https://doi.org/10.1016/j.resourpol.2021.102239. production. Resour. Pol. 65, 101569 https://doi.org/10.1016/j.
Kriechbaumer, T., Angus, A., Parsons, D., Rivas Casado, M., 2014. An improved resourpol.2019.101569.
wavelet–ARIMA approach for forecasting metal prices. Resour. Pol. 39, 32–41. Wang, Y., Liu, L., Wu, C., 2020b. Forecasting commodity prices out-of-sample: can
https://doi.org/10.1016/j.resourpol.2013.10.005. technical indicators help? Int. J. Forecast. 36, 666–683. https://doi.org/10.1016/j.
Kuhn, M., Wing, J., Weston, S., Williams, A., Keefer, C., Engelhardt, A., Cooper, T., ijforecast.2019.08.004.
Mayer, Z., Kenkel, B., , R Core Team, Benesty, M., Lescarbeau, R., Ziem, A., Wright, M.N., Wager, S., Probst, P., 2022. Ranger: A Fast Implementation of Random
Scrucca, L., Tang, Y., Candan, C., Hunt, T., 2020. Caret: Classification and Regression Forests.
Training. Xia, Q., Du, D., Cao, W., Li, X., 2023. Who is the core? Reveal the heterogeneity of global
Kwas, M., Paccagnini, A., Rubaszek, M., 2021. Common factors and the dynamics of rare earth trade structure from the perspective of industrial chain. Resour. Pol. 82,
industrial metal prices. A forecasting perspective. Resour. Pol. 74, 102319 https:// 103532 https://doi.org/10.1016/j.resourpol.2023.103532.
doi.org/10.1016/j.resourpol.2021.102319. Yin, L., Yang, Q., 2016. Predicting the oil prices: do technical indicators help? Energy
Leung, M.T., Daouk, H., Chen, A.-S., 2000. Forecasting stock indices: a comparison of Econ. 56, 338–350. https://doi.org/10.1016/j.eneco.2016.03.017.
classification and level estimation models. Int. J. Forecast. 16, 173–190. https://doi. Yin, L., Yang, Q., Su, Z., 2017. Predictability of structural co-movement in commodity
org/10.1016/S0169-2070(99)00048-5. prices: the role of technical indicators. Quant. Finance 17, 795–812. https://doi.org/
Liu, C., Hu, Z., Li, Y., Liu, S., 2017. Forecasting copper prices by decision tree learning. 10.1080/14697688.2016.1225977.
Resour. Pol. 52, 427–434. https://doi.org/10.1016/j.resourpol.2017.05.007. Zhang, H., Nguyen, H., Vu, D.-A., Bui, X.-N., Pradhan, B., 2021. Forecasting monthly
Lohrmann, C., Luukka, P., 2019. Classification of intraday S&P500 returns with a copper price: a comparative study of various machine learning-based methods.
random forest. International Journal of Forecasting, Special Section: Supply Chain Resour. Pol. 73, 102189 https://doi.org/10.1016/j.resourpol.2021.102189.
Forecasting 35, 390–407. https://doi.org/10.1016/j.ijforecast.2018.08.004. Zheng, B., Zhang, Y., Chen, Y., 2021. Asymmetric connectedness and dynamic spillovers
Lundberg, S., Lee, S.-I., 2016. An unexpected unity among methods for interpreting between renewable energy and rare earth markets in China: evidence from firms’
model predictions. In: Presented at the Advances in Neural Information Processing high-frequency data. Resour. Pol. 71, 101996 https://doi.org/10.1016/j.
Systems. https://doi.org/10.48550/arXiv.1611.07478 arXiv. resourpol.2021.101996.

10

You might also like