Professional Documents
Culture Documents
1 s2.0 S0377221721004483 Main
1 s2.0 S0377221721004483 Main
a r t i c l e i n f o a b s t r a c t
Article history: In this research, we develop a trading strategy for the optimal liquidation problem of large-order trading,
Received 18 November 2020 with different market microstructures, in an illiquid market. We formulate the liquidation problem as a
Accepted 17 May 2021
discrete-time Markov decision process. In this market, the flow of liquidity events can be viewed as a
Available online 25 May 2021
point process with stochastic intensity. Based on this fact, we model the price impact as a linear func-
Keywords: tion of a self-exciting dynamic process. Our trading algorithm is designed in such a way that when no
Finance favourite orders arrive in the Limit Order Book (LOB), the optimal solution takes offers from the lower
Optimal liquidation problem levels of the LOB. This solution might contradict conventional optimal execution methods, which only
Illiquid market trade with the best available limit orders; however, our findings show that the proposed strategy may
Markov-modulated poisson process reduce final inventory costs by preventing orders not being filled at earlier trading times. Furthermore,
Hawkes processes the results indicate that an optimal trading strategy is dependent on characteristics of the market mi-
crostructure.
© 2021 Elsevier B.V. All rights reserved.
https://doi.org/10.1016/j.ejor.2021.05.020
0377-2217/© 2021 Elsevier B.V. All rights reserved.
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
the 2008-09 global financial problem was rooted in the illiquid fi- well as the price and size of stocks, which are available during the
nancial market (Tirole, 2011). When insolvent financial institutions execution time. To find an optimal solution to the liquidation prob-
must liquidate massive amounts of illiquid assets, they no longer lem, one needs to consider a trade-off between liquidity risk and
follow their optimal trading strategies. In this situation, financial changing the stock price. The former results in slow order execu-
institutions are forced to liquidate the assets based on available tion, while the latter is caused by exogenous events or a rapid liq-
orders as fear of the crisis spreads. Consequently, the illiquid asset uidation. An early liquidation causes an unfavourable influence on
price further drops, which can lead to fire sales (Diamond, 1982). the stock price, and a late execution has liquidity risk since the
Our liquidation model is primarily an algorithmic method. These stock price can move away from that at the beginning of the pe-
methods apply automatic procedures to make decisions about sell- riod.
ing or buying of shares and the submitting of orders. In modern There are different ways to measure illiquidity (Vayanos &
financial markets, algorithmic trading methods are employed to Wang, 2013), such as bid-ask spread Glosten and Milgrom (1985),
trade illiquid assets. Algorithmic trading represents a large share market depth (Kyle, 1985), turnover, and other asset characteristics.
of market activity and can enhance liquidity supply and fragmen- In our market model, illiquidity is caused by a disparity between
tation of order flows (Foucault & Menkveld, 2008; Hendershott, supply and demand, which means that the execution of larger po-
Jones, & Menkveld, 2011) even when financial markets experience sitions requires longer periods of time and cannot be performed in
systemic events like the flash crash on May 6, 2010 (Kirilenko, Kyle, continuous time. Therefore, we formulate the liquidation problem
Samadi, & Tuzun, 2017). as a continuous-time stochastic control problem. By applying the
To understand the behaviour of market participants, we need to Markov decision processes (MDP) approach, we decompose the liq-
analyse the stochastic fluctuations in stock prices and the reaction uidation problem into multiple deterministic optimal control prob-
of players in the market. These fluctuations can be explained by a lems. The discrete-time framework is closely related to classical
sequence of market equilibria, which are determined by demand trading algorithms. In most continuous liquidation problem setups,
and supply. Generally, informed traders prefer to trade mainly dur- it is assumed that supply and demand are in balance; however,
ing times of liquidity. As opposed to when the market is liquid this is only a reasonable assumption in a liquid market.
(Collin-Dufresne & Fos, 2015; Kyle, 1985), traders are willing to Through our research, we formulate the liquidation problem as-
submit small orders when the market is illiquid Philip (2020). sociated with the depth function of the LOB. Our algorithm con-
However, traders are not entirely rational, and their trading be- structs trading boundaries built-in on market depth to determine
haviours are governed to some degree by their beliefs or senti- an optimum strategy. When the stock prices hit one of these trad-
ments (Shleifer & Summers, 1990). This disagreement in the mar- ing boundaries, i.e. there is a liquidity event, the algorithm liquids
ket creates an imbalance between supply and demand for limit or- some part of shares by submitting limit orders. Past studies show
ders, which causes illiquidity in the market. In this market, assets the link between price impact and market depth and show that
can be traded only infrequently, contingent on the arrival of ran- there is a linear relationship between an orders flow imbalance
domly occurring liquidity events. Furthermore, the dynamics of or- and price changes in high-frequency markets. This fact is investi-
der arrival are influenced by the liquidity of assets. Therefore, trad- gated by Kempf and Korn (1999) who measure the market depth
ing activities have indirect effects on the dynamics of price. We use as a surplus demand amount that is needed for jumping one unit
this fact and model the dynamics of incoming orders and liquidity price. They show that there is a non-linear relationship between
events with the Hawkes process. This process can capture irregu- market depth and the price impact of the orders flow.
lar patterns of asset prices. Following this approach, we measure In our numerical simulation, we compare the performance of
the price impact of order executions based on a stochastic inten- our algorithm concerning various market characteristics, as well
sity process using the mutually- and self- exciting properties of the as price impact functions. In the case where favourite offers do
Hawkes process. not come, our algorithm will reduce the speed of trading and go
In order-driven markets, buy and sell orders arrive at different deeper into LOB to avoid the not filling of orders and facing an
time points and wait in the LOB to be traded. We study the dy- ultimate inventory penalty. This solution might contradict conven-
namics of the arrival rate of limit orders and the model distribu- tional optimal execution methods, which only trade with the best
tion of limit orders in a LOB with the help of order statistics. We available limit orders. In fact, in illiquid markets, lower price lev-
analytically express the probability of orders ranking, sorted based els of market depth are more attractive than upper levels; these
on price, in the LOB. Modelling the LOB distribution helps us to depth levels include orders with significantly larger volumes. Most
forecast order in the hidden levels of the LOB and exploit the or- optimal liquidation methods focus on the best sell and buy price
der information stored more in-depth in the LOB (Biais, Hillion, & levels since their imbalance can move prices. By causing the lack
Spatt, 1995). More precisely, knowing the LOB distribution and ex- of available liquidity, we need to take the lower level of market
pected order quantities in the hidden levels guides the algorithm depth into account.
to select the limit orders of top levels of the LOB (both best and The main contributions of this paper can be summarized as fol-
deeper levels of the LOB) to avoid risk execution and lack of offers lows. First, we develop a new algorithm to liquidate a large asset in
in the future. It is consistent with the finding of (Kavajecz, 1999) an illiquid market. This algorithm uses trading boundaries built-in
which limit order traders maintain a reasonable level of depth on the market depth and adjusts the speed of trading. It helps to
around illiquidity events to reduce their exposure to adverse se- avoid not filling of the order by picking up not only the best limit
lection losses. Our algorithm mimics informed traders’ behaviour orders from LOB. Second, we model the arriving pattern of limit or-
that uses depth as a strategic decision factor to adjust trading rate ders based on the Poisson cluster process. We then use the same
(Goldstein & Kavajecz, 20 0 0; Kavajecz, 1999). approach to model price impact (Proposition 1) and show how this
We set up a stochastic control framework to maximize the ex- model can extend to two main classes of price impact functions,
pected revenue of trading with consideration to liquidity restric- i.e. permanent and temporary (Lemma 2). Third, we determine the
tions in trading a large order. We formulate the liquidation prob- distribution of a random number of sorted limit orders in LOB
lem as a multi-stopping problem with Poisson arrival patterns of (Theorem 1). Forth, we solve the liquidation problem numerically
orders in a discrete-time model. The optimal liquidation problem with apply MDP approach (Theorem 2) and prove the uniqueness
can be expressed as an optimal control problem, to determine opti- of the solution (Theorem 3). Fifth, with employing this algorithm,
mal strategies of trading stock portfolios by minimizing some cost we investigate different trading scenarios in various market charac-
functions. These strategies depend on the state of the market, as teristics. The results show that a market with a higher level of the
1051
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
Table 1
Summary of the result of simulation the level of inventory and its corresponding optimal trading rate under different market conditions.
Panel I Quantile
Self-damping Revenue 10% 25% 50% 75% 100%
self-exciting property of limit order arriving can be more profitable tween the current system state and the policys action. A similar
(Table 1). problem, which has been considered in the market microstructure
The rest of the paper is organized as follows. We review the re- literature by Garman (1976), who study a trading problem of a
lated literature in Section 2. Section 3 explains the market model market maker who maximizes her profit by assuming order arrival
setup and the problem statement and describes the statistical rates depending on the price dynamics governed by the Poisson
model of the LOB. Section 4 presents the stochastic dynamics of process. Some other studies explain the unconditional and steady-
the intensity of the order arrival process, and we turn to model state distributions of the order book.
price impact. Section 5 describes the procedure of solution with The homogeneous Poisson process of sell and buy orders ar-
using discrete-time Markov Process. In Section 6, we explain the rival rate was the primary assumption of past studies. Garman
numerical method for optimal stopping time and simulate with (1976) explain conditions necessary and sufficient for the order ar-
different market microstructures. Section 7 summarizes the results rival patterns to be modelled by a homogeneous Poisson proces-
and concludes the paper with further remarks. sor. In this framework, none of the agents’ trading can dominate
other agents, or they place a large number of limit orders in a
2. Related literature finite time. Nevertheless, most of these assumptions are violated
in high-frequency trading. Empirical studies of high-frequency data
Our paper relates to the recent and growing literature on op- show that there are significant cross-correlation patterns of the ar-
timal execution problems. The primary research on this subject rival rate of similar limit buy or sell orders and significant auto-
date back to 1990s, and mainly focus on the discrete-time models, correlation in durations of events, (see: Cont, 2011). Dufour and
in which optimal strategies are determined as optimal liquidation Engle (20 0 0) use a framework provided by Hasbrouck (1991) and
rates per unit time. Bertsimas and Lo (1998), as a discrete-time investigate the time duration between trades. They determine that
model, study the optimal liquidation of a large block of shares with the trade duration process is correlated with the information about
linear permanent price impact in a fixed time horizon. Almgren current and past transactions. Some empirical studies like Zheng,
and Chriss (2001), as one of the most cited optimal execution Roueff, and Abergel (2014) show that the dynamics of the Bid and
models, constructs an efficient frontier of execution strategies via Ask price can be modelled using Hawkes processes, introduced by
mean-variance analysis of expected costs of liquidation and divide Hawkes (1971). The irregularity properties of high-frequency fi-
the price impact into temporary and/or linear permanent price im- nancial data can be explained by the self-exciting and mutually-
pact. They use diffusion price processes in a continuous-time trad- exciting properties of the Hawkes processes. We use this fact and
ing space. In our study, we model a similar liquidation problem model dynamics of incoming limit orders using Hawkes processes.
with a restriction on liquidity in continuous-time trading time. We In a similar study, Engle and Lunde (2003) model trades and quote
then solve the model in a discrete-time space. arrivals with bivariate point processes. Some empirical studies like
Our study of an optimal stopping problem is a link between Bacry, Delattre, Hoffmann, and Muzy (2013) show that high fre-
two different areas: control theory and market microstructure. In quency data can be modelled using Hawkes processes. Our base-
control theory, this problem has been studied as a single or multi- line model only considers the submission of limit orders in a LOB.
stopping problem in the classical best choice problems by us- Therefore, we apply univariate bivariate point processes to model
ing homogeneous Poisson processes. A single stopping time prob- the dynamics of incoming limit orders. Our baseline model can
lem governed by the Poisson process was formulated as the best be extended to apply bivariate point processes for submission and
choice problem in the late 1950s by Lindley (1961). The op- cancellation processes. Several studies extend Hawkes processes.
timal k-stopping problem with finite and infinite time horizon Chehrazi, Glynn, and Weber (2019) develop a traceable model
was presented with the complete solution by Peskir and Shiryaev based on a controlled self-exciting point processes framework to
(2006) in a Bayesian formulation. A recent study by Ciocan and predict the repayment behaviour of unsecured loans placed in
Mišić (2020) develops an interpretable optimal stopping model credit portfolios. They also consider the Hawkes processes along-
which helps the decision-maker to determine the connection be- side an extra predictable process that models account-treatment
1052
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
actions. Similarly, Dassios and Zhao (2017) extend Hawkes pro- one the component of price model (Almgren & Chriss, 2001; Bert-
cesses with a diffusion component to estimate default probability simas & Lo, 1998). Empirical researches show that in a liquid mar-
and model a portfolio loss process. ket, trading activities cannot alter the dynamics of the price for a
In our research, we study the statistical model of the LOB. We long time.
determine an analytic expression of the distribution of limit or- Conversely, this component can be one of the basic building
ders in Theorem 1. Garman (1976), Bayraktar, Horst, and Sircar blocks of price impact models in the illiquid market. The second
(2007) study the fluctuation of the orders arriving in the elec- tier of the price impact model is related to modelling the tem-
tronic trading market. Cont, Stoikov, and Talreja (2010) develops porary impacts that just have an effect on the current orders for
a tractable stochastic model of limit order markets to capture the a short time and not for the entire trading time. This component
main statistical features of LOBs. They show the short-term price cannot alter the dynamics of the price for an extended period and
movements could be explained by the information on the current just has an impact on the immediate execution of the trades. Some
state of the LOB. Their model is based on the dynamics of the best studies like Chen, Feng, Peng, and Ye (2014), Chen, Feng, and Peng
bid and offer queues since the best orders can move the price. (2015) consider both permanent and temporary price impact in op-
This study uses Laplace transforms to analyse the behaviour of the timal deleveraging problem and other studies like Brunovskỳ et al.
order book with Poisson arriving patterns of buy- and sell- limit (2018) leaves out the permanent effect on optimal trade execution
orders. Unlike these studies, we express the distribution of LOB, problem. The transient impact is the third class of price impact;
sorted by price, thoroughly based on order statistics methodology. this component can be significant for a finite period, and it even-
This distribution, which is defined analytically in Theorem 1, can tually vanishes (see: Alfonsi, Fruth, & Schied, 2008; Gatheral, 2010),
help us to estimate the distribution of a random number of arrival Predoiu, Shaikhet, and Shreve (2011) models price impact by con-
limit orders in a LOB. sidering this component. The theoretical model by Huberman and
Our paper considers a liquidation problem when trader faces Stanzl (2004) show that permanent price impact can be linear with
liquidity restrictions, and she should adjust her trading strategies. trade size. This model also shows that with having temporary price
The impact of illiquidity on trading activities has been investigated impact in a general form, the permanent price impact must be lin-
in some theoretical studies like (Easley & O’hara, 1987; Kyle, 1985) ear.
as well as some empirical studies like (Chordia, Roll, & Subrah- The last class of the price impact modelling is to control the
manyam, 2001; Chordia, Subrahmanyam, & Tong, 2014). The the- rate of arrival of limit orders via the trading rate process. Cai, Han,
oretical models mostly focus on mechanisms underlying the trad- Li, and Li (2019) study the price impact of herding in the fixed-
ing activities faced with the cost of liquidity constraints, e.g. a the- income market and show that buy and sell herdings are asso-
oretical model by Grossman and Miller (1988) tries to determine ciated with permanent price impact and transitory price impact.
the equilibrium of liquidity level of the market. Empirical studies Alfonsi and Blanc (2016) introduce a mixed market impact Pois-
mostly concern about changing depth and trading activities due to son model to analyze a temporary shift of dynamics of the rate of
some significant macroeconomic factors and revolutions in trading order arrival. This model used the advantage of the self-exciting
technologies. property of the Hawkes process to change the direction of trad-
In studies by Bayraktar and Ludkovski (2014), Horst and Nau- ing in the same or opposite direction of order arrivals. Studies by
jokat (2014), the liquidity model of Almgren (2003) has been ap- Bayraktar and Ludkovski (2014), Guéant, Lehalle, and Fernandez-
plied to determine temporary and permanent impacts of trading Tapia (2012) control the intensity of limit orders for the liquida-
on fundamental price of asset. These studies design optimal liq- tion problem in a risk-neutral and risk-averse model, respectively.
uidation strategies in illiquid markets. Bayraktar and Ludkovski Gueant and Lehalle (2015) extend this model to a general form of
(2014) formulate the liquidation problem as a Hamilton–Jacobi– dynamics of intensity. From a traditional viewpoint on this prob-
Bellman equation association with the depth function of the LOB. lem (see: e.g. Alfonsi & Schied, 2010), the optimal liquidation de-
In our paper, we study a liquidation problem which is formulated pends on the existence of a sufficiently large limit order in the LOB,
similarly to Bayraktar and Ludkovski (2014); however, we solve and price impact is a function of the shape and depth of the LOB.
the problem numerically in a discrete space similar to (Bäuerle & Our modelling of price impact mainly belongs to the third class
Rieder, 2009; Huang & Guo, 2020). of price impact models. Essentially, in illiquid markets, the trader
faces a liquidity problem, and she should wait for a longer time
2.1. Related literature on price impact to execute her orders. Empirical studies show that this market is a
plastic (inelastic) market, and limit orders do not arrive in contin-
In the current study, we use a stochastic intensity process to uous time. The characteristics of this market allow us to model the
measure the price impact of order executions. Price impact is a tra- pattern of the arrival of orders in discrete space and express an im-
ditional topic in conventional market microstructure research (see: pact price function as a consequence of trading on this pattern. In
e.g. Dufour & Engle, 20 0 0; Hasbrouck, 1991; Kyle, 1985). Modelling Proposition 1, we introduce a new price impact, defined on an im-
of price impact on illiquid markets is not widely studied; we re- pact of trading on the arriving order book. Similar to impact price
view some different generations of price impact modelling in the models in the third class, such as Bayraktar and Ludkovski (2014),
literature. An illiquid asset can only be liquidated contingent on Guéant et al. (2012), our price impact model can control the in-
the arrival of liquidity events. Accordingly, we cannot consider as a tensity of limit orders, and resultantly can control/mitigate the im-
circumstance in which agents trade continuously in time, and there pact of price trading. In the Lemma 2, we show how this model
is no price impact (Kogan, Ross, Wang, & Westerfield, 2006). can be related to the first and second classes of price impact mod-
We can loosely classify price impact models based on their ef- els. Moreover, similar to the Kyle (1985) model, our impact price
fects and how long they influence the dynamics of the prices in model is conditional on the state of liquidity in the market.
the following categories. We then show the link that our price im-
pact model has to these classes. 3. Market model setup
The first class is a permanent price impact. As a consequence
of a significant discrepancy between supply and demand in the It is assumed that our financial market consists of an illiquid
market and a spread of bulk trading information, the dynamics of asset and a risk-free asset. The illiquid asset and the risk-free as-
price have stable shifts. Kyle (1985) proposed a basic microstruc- set can be considered as a risky asset, and as a numeraire with
ture model to analyze the price impact. Permanent impact was the interest rate r, respectively. The market for the risk-free asset
1053
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
is liquid, that means traders can liquidate a large amount of this to predict different markets quantities conditional on the current
security without facing costs of price impact. state of the order book. On the other hand, this information might
In a complete financial market, the price process St is a stochas- be used by counter-parties to take advantage of price movement
tic process on a complete filtered probability space (; F; F; P ) and submit or cancel strategic orders. The idea of the dark pool is
where F is the filtration generated by {Ft }t∈T . This space is to reduce the information linkage and adverse price risk.
bounded by a maturity time T . The assumption of the finite Underlying our approach is that the dynamics of limit order
discrete-time space is contradictory to conventional liquidation arrivals follow a Poisson pattern of prices depending on rates of
problems. However, it is not far away from reality; order arrival trading. We study the stochastic dynamics of the arrival rate of the
patterns of buy and sell orders are not the same. Due to fluctua- limit order by using order statistics methodology.
tions in the stock market, especially in a high-frequency environ-
ment, orders might not appear regularly in the LOB. Some empir- Distribution of a limit order book
ical studies show that, in a short period, the percentage changes j
of a stock price are not uniformly distributed with the same cen- We assume in a financial market, the price processes (St )( j=1···d )
trality, but price processes can be internally steady-state processes. of limit orders satisfy the following stochastic dynamics for t ∈
[0, T ] is dSt = St (μt dt + σ dWt ).
j j
Therefore, the model should be solved and interpreted in discrete
j
time and space. We use these facts and propose a model with Given a vector of price of buy (sell) orders (St )1≤ j≤d on the
the rate of incoming buy orders as a Poisson process in a finite probability space (; F; P ), the orders are sorted into a vector
discrete-time and space. (St(1) , St(2) , · · · , St( j ) ) satisfy the following chain conditions:
1054
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
(0,t ) +N (t ,t +dt )
= E[F ( y )N ] 3.3. Problem statement
N (0,t ) N (t ,dt )
= E[F ( y ) ]E[F ( y ) ] The dynamics of the price is described by a right-continuous
= Gt (F (y ))(1 − λdt + λdtF (y )) process (St )(t≥0 ) changing while the times when the book order
Gt+dt (F (y )) − Gt (F (y )) process meets boundary conditions. Later on, we discuss the im-
= −λ(1 − F (y ))Gt (F (y )) pact of the current trade on the underlying price by using a par-
dt
d ticular approach to model price impact based on exogenous factors
(Gt (F (y )) = −λ(1 − F (y ))Gt (F (y )) as well as characteristics of the stock price processes. We show
dt
how to model the dynamics of the intensity of orders arrival cor-
d
ln(Gt (F (y )) = −λ(1 − F (y )) responding feedback effects of trading and the state of the market
dt
during orders execution period.
Solving for F (y ) gives We consider a problem for an investor holding a large volume
Gt (F (y )) = e−λt (1−F (y )) (3.1) (Q 0 ) shares of illiquid assets and a risk-free asset. The objective of
the shareholder is to maximize her profit with subject to liquidity
With assumption of a L number of unexecuted orders at the time constraints and limited execution time.
point s, (s < t ), the generating function for the time interval (0, s ) The illiquid asset can be considered a risky asset with the fol-
is: lowing dynamic for t ∈ [0, T ]
λ0 λ1 dSt = St (μt dt + σ dWt ),
Gs (F (y )) = e −λ ( F ( y ) 0 ) + e −λ ( F ( y ) 1 )
0! 1!
and a risk-free asset used as a numeraire with the interest rate r
λL
+ ··· + e −λ ( F ( y ) L ) with the following dynamic:
L!
dBt = B0 ert dt.
(λF (y ))0 (λF (y ))1 (λF (y ))L
= e −λ + + ··· +
0! 1! L! Let Sˆ = e−rt St be a martingale with respect to the measure P on a
complete filtered probability space(; F; P ).
Use Taylor series with remainder:
The trading strategy of the trader is characterized by a trading
f (c )λL+1 rate (γt )t∈T . The vector γ contains the information on the amount
Gs (F (y )) = e−λ eλ(F (y )) − (3.2)
( L + 1 )! of trading at each time point t.
The dynamics of the inventory of the investor holding Q 0 shares
For c ∈ [0, 1], f (c ) = F (y )L+1 ≈ ecλ . For the sake of simplicity, it is of an illiquid asset with the trading rate γ as a control process is
assumed that c = 0. given with the following counting process:
1055
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
sequence Ti (i ≤ m ). The trader wants to maximize the expected
V (T , Q ) = sup Et,q
0
e −rT
γt St λt dt , (3.4)
γ ∈A 0
utility function of her final wealth. The expected utility allows the
trader to compare the gain and loss of her trading strategies. Ex- where T
= T ∧ inf{t ≥ 0 : Q 0 − qt = 0} is trading time and trad-
pected utility is comparable to expected revenue which means that ing process γt is a control process, which has influence over the cash
the trader prefers a trading strategy that generates the highest ex- process, the inventory process, and the dynamics of prices. The vec-
pected utility. tor γ contains the information on the amount of trading at each time
The optimum expected revenue of the trade can be modelled as point (γt ).
a dynamic optimization problem which is an interaction between
Proof. It is assumed that the limit orders arrival rate is a point
price impact and price dynamics. We apply the Bellman’s optimal-
process with intensity rate λt . At each stopping time, we liquid qt
ity equation in a recursive format to solve the m-stopping prob-
shares of an illiquid asset with dynamics dqt γ = −γt at price Sˆ =
lem.
e−rt St . The trading rate γt is a control process. From Eq. (3.3) we
max(Expected Revenue ) have:
= max(Immediate Exercise + Continuation ). V (T , Q 0 ) = sup Et,q [H (T , Q 0 )]
γ ∈A
The trader holds Q0
number of shares and places a selling order an
k = γ (we have assumed = 1) illiquid asset in the market with
n
the trading rate γ in the time horizon T , the performance criteria = sup Et,q e −rTi
γt St 1(Ti ≤T )
γ ∈A
with the strategy γ is given by: i=1
T
H γ (t, Q 0 ) = hγ (t, k ) + Et,q [H γ (t + 1 ), Q 0 − k], (when n → ∞ )(t ∈ T ) = sup Et,q e −rt
γt St dNt
γ ∈A
where h(t, k ) is the depth function (concave and increasing in k;
0
decreasing in t): T
= (S(t,k ) − ES(t,k ) )+ ,
where:
4. Modeling stochastic intensity and price impact
n
S(t,k ) = St(i ) 1(i≤k ) .
i=n−k+1 The rate of arrival of limit orders depends on the price and size
of orders; the cheaper orders will remain on the LOB for a shorter
Let ES(t,k ) be the expected boundaries, constructed from the set
time. It is empirically shown that the distribution of the price is
of best limit orders in the LOB by using the distribution of the or-
not constant and depends on the current state of the LOB. The ex-
der book at the stopping time t.
istence of significant autocorrelation of price movement and cor-
n
relations across time periods rejects the natural assumption of a
ES(t,k ) = P (i ≤ k )St(i ) . constant intensity of order arrivals rate (see: e.g. Cont, 2011). In an
i=n−k+1
illiquid market, bid and ask orders do not arrive consistently, and
At the stopping time, when one of these expected boundaries counterparties do not meet their demands regularly. These irreg-
(see Fig. 1)is hit by the stock price, we execute some proportion of ular proprieties of a high-frequency environment lead to applying
the inventory via limit order at the LOB with the depth function point process to model time series of the bid or ask price move-
h(t, k ) with a boundary condition: h(T , 0 ) = h(0, Q 0 ) = 0. ments. Garman (1976) propose models, in which order arrivals are
In this regard, the goal of the investor is to maximize her governed by a point process with constant intensity. There are sev-
wealth at the end of the period T associated with this dynamic eral conditions in previous trading models that do not hold our
1056
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
setting. First, a small number of traders cannot dominate the mar- is influenced by the arrival time of orders, the order value, and in-
ket with large-scale orders. The second condition is that the sub- directly, the price will be affected by trading.
mitting of orders is independent, and we mainly have an assump- Generally, the supply/demand of financial securities is not elas-
tion of market efficiency. These conditions are essential for having tic (Obizhaeva & Wang, 2013). According to fundamental concepts
a constant intensity of order arrivals. However, the structure of the of economics, in equilibrium, the relationship between supply and
market is dynamic, and high-frequency traders dominate over sev- demand determines the price that traders are willing to take for
enty percent of the market; therefore, none of the above conditions positions during a specified period. If the market is illiquid, trad-
can be satisfied. ing a large number of shares produces a large price fluctuation
Hawkes process as a point process was initially applied to (Mrázová & Neary, 2017). High-frequency traders Often execute
model earthquake occurrences. Some empirical studies show that trades in the course of the order book imbalance. As a result of this
the Hawkes process can fit with high-frequency data (Bacry et al., shock, the equilibrium of supply and demand could be changed,
2013; Bacry, Jaisson, & Muzy, 2016; Chavez-Demoulin & McGill, and the price must rise/decline to reach a new equilibrium (see:
2012). This process can explain the irregularity properties of data Fig. 10). The price movement occurs when a change in demand is
based on the positive and negative jumping behaviour of the asset caused by a change in a number of market orders. This change in
prices. Cartea, Jaimungal, and Ricci (2014) use the Hawkes process the price equilibrium is sensitive to shifts in both prices, and quan-
to express the dynamics of market orders and the LOB. This pro- tity demanded, which is called the ”elasticity of demand” (Mrázová
cess is a general form of a standard point process. The intensity of & Neary, 2017).
this process is conditional on the recent history, increase the rate We model this impact with close form solutions for the dynam-
of arrival of the same type of event (self-exciting property), and it ics of price impact. Let function f (γt ) be a general form of the
captures the impact of arrivals of orders on other types of orders impact market, conditional on the state of liquidity of the market.
(mutually-exciting property). In A.1, we explain the basic concept Similar to Kyle (1985) model, we use parameter α , as inverse liq-
of a point process and the dynamic of the Hawkes process. uidity of the market, to measure price impact. Function (t − s )
is the decay of price impact function, which is independent of the
state of the market. This function can be shown by the exponential
4.1. Price impact model or power-law decay of price impact. Gabaix (2016) shows power-
law functions can explain several patterns in the stock market, in-
Estimating and modelling price impacts is a crucial research cluding price impact.
area in market microstructure literature. It can be expressed as a Different types of price impact and decay functions have been
relationship between trading activities and price movements. Mon- used in the algorithm trading and market microstructure litera-
itoring and controlling the impact of trading are the main parts ture. We define two well-known impact functions: exponential and
of algorithm trading. To minimize the market impact, traders split power-law functions, linked to the exponential decay function. The
their orders into smaller chunks based on the current liquidity in parameter α is a proxy for the inverse of the market liquidity:
the LOB. Price impact might be dependent on exogenous factors • Power law impact function:
like trade rate and some endogenous factors, such as liquidity and
volatility. It is empirically observed that there is a changing of the f (γ ) = γ α , α > 0.
volatility of prices on trading activities. • Exponential impact function:
Kyle (1985) proposes a simple model for the evolution of mar- f (γ ) = exp(αγ ), α > 0.
ket prices and price movement. In his model, noise traders and
inform traders submit orders, and a market maker executes the Gatheral (2010) examines the dynamic-arbitrage for different
orders. The price is adjusted to a linear relationship between the combinations of the market impact function and decay function
trade size and a proxy for market liquidity. As a consequence of to not admit price manipulation strategies. (Alfonsi and Schied
trading, the price moves permanently, information affects the price (2010), proposition 2) proves the convex and non-constant expo-
for a long time, and price changes are strongly autocorrelated. The nential price impact, leads to the strictly positive definite property,
most recent literature on market microstructure shows that for liq- to avoid arbitrage implication, which is already stated in Gatheral
uid markets, price impact cannot be permanent. In highly liquid (2010). Nonlinear impact function that is not a related state of the
markets, outstanding shares are small and need less time to be liq- market is questionable; however, the above functions defined in an
uidated. illiquid market depend on the state of the market.
In illiquid markets, the temporary price impact governs the en- We model the dynamics of the order arriving intensity with the
during impact, and after a considerable number of trades, the price Hawkes process as a point process. This model can capture irreg-
movement shows a high resilience. Empirical studies show that ular properties of the high-frequency data. The strength of incen-
an elastic market can be converted to a plastic (inelastic) mar- tive to generate the same event is presented with parameter σ .
ket as a consequence of lacking counterparties, such that high- With choosing the price impact function f as a function of a trad-
volume trading has a long-lasting impact. In this market, traders ing rate γt and the parameter κ as the exponent of the decay of
face difficulty in finding counterparties at a particular time, and market impact, we can model a liquidation effect on the trade ar-
they should wait for a longer time to execute the orders or else rival dynamics. This model can be shown by a particular form of
cooperate with counterparties. The effect of price impact corre- the Hawkes process with the following SDE:
sponding to the liquidation strategy can be significantly large for dλt = ( f (γt ) − κλt )dt + σ dNt . (4.1)
substantial risk-averse traders who liquidate shares at a fast rate. In the following proposition, we model the impact of trading on
For this study, we use a stochastic intensity process to measure the rate of order arriving based on the Hawkes process:
the price impact. This model is based on a counting process us-
ing the mutual-exciting property of the Hawkes process. In illiquid Proposition 1 (Price Impact Model). Let f be a price impact func-
markets, the imbalance between supply and demand causes illiq- tion and a function of a trading rate γt . σ and κ represent the magni-
uidity, and the execution of a larger position needs a longer time tude of self-exciting and the exponent of the decay of market impact,
and effect on the arrival of the new orders, or to stay longer in the respectively. The solution of the SDE is expressed by:
LOB. Also, we explained the coming pattern of orders governed by t t
a stochastic point process. Therefore, the intensity of order arrival λt = ( f (γs )(t − s ))ds + σ e−κ (t−s ) dNs , (4.2)
0 0
1057
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
t+ε
where the function is the decay of impact.
= lim eαγs e−κ (t−s ) ds ≈ e(−κ t+αγε ) = λε .
ε →0 t
Proof. Following SDE represents the impact of trading on the dy-
namics of the rate of orders’ arrival: We can simply define the instantaneously affected stochastic in-
tensity by:
dλt = ( f (γt ) − κλt )dt + σ dNt .
t
To prove, we can move the first term of SDE to the left side, then λtInst = λε + σ e−κ (t−s ) dNs . (4.4)
multiply it by eκ t (see: Norberg, 2004), or we can define an initial 0
guess for the solution of above SDE as follows:
t
λt = λ0 + σ e−κ (t−s ) dNs , 5. Solving model by discrete-time MDP
0
where The liquidation problem (3.4) is a continuous-time stochastic
t
control problem. Bayraktar and Ludkovski (2014) formulate this
λ0 = e−κ t D(0 ) + f (γs )(t − s )ds.
problem as a Hamilton–Jacobi–Bellman equation, and provided vis-
0
cosity solutions as closed-form solutions associated with the LOB
λ0 Verify by Itô’s lemma on eκ t λt
model and its depth function. The classical stochastic control ap-
t
proach solves nonlinear partial differential equations, and it is nec-
eκ t λt = D(0 ) + eκ t f (τs )e−κ (t−s ) ds
0 essary the differentiability of the value function to be satisfied.
t In contrast, a discrete-time Markov Decision Processes (MDP) ap-
+ eκ t σ e−κ (t−s ) dNs proach provides a set of optimal policies, condition on the differ-
0
t t entiability of the value function. Bäuerle and Rieder (2009) use this
= f (γs )eκ s ds + σ eκ s dNs approach and by applying dynamic programming principles, prove
0 0 the existence and uniqueness of the solution. This liquidation prob-
κ eκ t λt dt + eκ t dλt = f (γt )eκ t dt + σ eκ t dNt lem is a mathematical abstraction of real problems in which an in-
κλt dt + dλt = f (γt )dt + σ dNt vestor should decide on several stopping times to gain certain rev-
enue at each stage. The investor has a finite period to liquidate a
dλt = ( f (γt ) − κλt )dt + σ dNt .
position and maximize the total revenue at the end of the period.
Therefore, in this problem with a finite number of sub-periods,
a mapping function should be applied to compute optimal poli-
Lemma 2. The impact stochastic intensity equation: (4.3) is a general
cies through the limited number of steps of the dynamic program-
function of price impact. It can then measure an instantaneous price
ming algorithm. She must find an equilibrium to minimize the cost
impact in the short term, and a permanent price impact on the long
of the present extensive trading against the future abandon risk
run.
where the overall cost is not predictable. This problem can be for-
Proof. From Equation (4.3) we have: mulated as a deterministic or stochastic optimal control problem
t with Markov or semi-Markov decision property under different se-
lim λ0 = lim (e−κ t D(0 ) + f (γs )(t − s )ds ) tups.
t→∞ t→∞ 0
Bertsekas and Shreve (1996) distinguish between the stochas-
t
tic optimal control problems from its deterministic form regard-
= lim f (γs )e−κ (t−s ) ds
t→∞ 0 ing available information. In a deterministic optimal control prob-
t lem, we can specify a set of states and policy as a control pro-
= lim e−κ t f (γs )eκ s ds cess in advance. Thereby, a succeeding state is the function of the
t→∞ 0
t present state and its control process. On the other hand, in the
0 ( f (γs )eκ s ds stochastic control problem, controlling the succeeding state of the
= lim
eκ t t→∞ system leads to evaluate unforeseen states; therefore, control vari-
f (γt )eκ t ables that are no longer appropriate or have ceased to exist.
(apply l’Hopital’s rule) = lim
t→∞ κ eκ t In this paper, we use a Piecewise-Deterministic Markov Deci-
eαγt sion Model (PDMD) to decompose the liquidation (problem: 3.4)
= lim
t→∞ κ as a continuous-time stochastic control problem into discrete-time
1 + αγt + o(αγt ) 1 + αγT problems. Alternatively, we can apply a method by Mamer (1986),
= lim ≈ = λ∞ . which is successive approximations for semi-Markov decision pro-
t→∞ κ κ
cesses (Huang & Guo, 2011) or the pathwise optimization method
We defined the liquidation problem as a finite time investing prob-
(Desai, Farias, & Moallemi, 2012).
lem on a limited time horizon T . λ∞ represents the long-run trad-
ing impact on the intensity of order arrivals rate, we can think of
5.1. Solution by PDMP
it as a permanent price impact as ”base” intensity part of stochas-
tic intensity. It is a linear function of the trading rate to avoid dy-
Piecewise-Deterministic Markov Process (PDMP), introduced by
namic arbitrage (Gatheral, 2010). We can express the permanently
Davis (1984), is now applied mainly in various areas such as natu-
effected stochastic intensity by:
ral science, engineering, optimal control, and finance. The PDMP
t
is a member of the cádlág Markov Process family; it is a non-
λtPerm = λ∞ + σ e−κ (t−s ) dNs . (4.3)
0 diffusion stochastic dynamic model, with a deterministic motion
that is punctuated by a random jump process.
Instantaneous market impacts can be measured from the small
interval of trading, and the difference between the pre-trade and Definition 1 (Piecewise Deterministic Markov Process). A
post-trade price movements: piecewise-deterministic Markov process (PDMP) is a cádlág
t+ε Markov process with deterministic emotion controlled by the
lim λ0 = lim (e−κ t D(0 ) + f (γs )(t − s )ds ) random jump at jump time.
ε →0 ε →0 t
1058
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
Markov process (PDMP). We note that since Ji > −1 the price pro-
cess stays positive.
location of the post-jump Z2 = K (.|φ (Z1 ), . ) by using the stochas- To be more precise on how we can solve the liquidation prob-
tic Kernel K (.|φ (Z1 ), . ). This iterative procedure will be continued lem with PDMD, we explain the formal expression of deterministic
until it hits a boundary of the state space. optimal control problems, which is well documented in Bertsekas
As mentioned earlier, the process of the illiquid asset for satis- and Shreve (1996). In a formal deterministic optimal control prob-
fies the stochastic difierential equation: lem, xi presents the state of the system at stage i and function ci
is a corresponding control at that stage. System equation xi+1 =
dSt = St (μt dt + σ dWt ), t ∈ [0, T ]
f (xi , ci ) is the generating function of next state xi+1 from current
In this market, the price processes can have jumps that could state xi and its control ci . Function g(xi , ci (xi )) is the rewarding
be occurred at random time points and followed by a Poisson pro- function of state i a associate with function ci .
cess. We denote by T = {0 := T0 < T1 · · · < Tm < n} the set of jump The total expected revenue after N decision steps is defined
time points of the Poisson process. The price processes for the time by:
between two jumps i.e. t ∈ [Ti , Ti+1 ) for i ≤ m is given by
N
σ 2 J (x0 ) = Et,x
C
g (xi , ci (xi )) .
C
(5.5)
St = STn exp(μt − )(t − Ti ) + σ Wt ,
2 i=0
and at time of a jump, the price process is: The set C contains all control functions (ci )i={0,··· ,N} , i.e. the ex-
pected total revenue is set of states and corresponding a sequence
STi − STi+1 = STi+1 Ji , of Markovian decision controls.
where Ji is an independent and identically distributed random vari- Let = {π1 , π2 , · · · , πN } be the sequence of all Markovian de-
able. The price process St is a so-called piecewise-deterministic cision controls πi . These decision controls are corresponding to
1059
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
Markovian policy for predictable admissible process γ defined in been given in the Equation (3.3) and is more consistent with the
the set . Each Markovian policy is a function of the jump time dynamic programming principle.
component (Tk ) and the post jump location (ZTk ). Bertsekas and Shreve (1996) define the universal measurable
In each stage, the reward obtained as a result of sequential mapping T to map Equation 3.3 from (5.3) as follows:
Markovian decisions after the jump by applying the strategy πi
T π ( )(x ) = H[x, π , ]. (5.8)
is:
t Therefore the operator T πn can be decomposed to T π0 · T π1 ·
R(Zi , πi (Zi )) = U π (φs (Xi ))dNs . (5.6) T π2 · · · T πn−1 .
0
From above definitions, we have:
The total expected revenue after N steps is defined by:
πn = T πn (r )
N
= T π0 T π1 · · · T πn−k πn−k .
π (t, x ) = Et,x R ( Z i , πi ( Z i ) . (5.7)
i=0 The mapping T π is an universal measurable mapping, let T π0 = 0
In the following theory, we explain how the above equation is and k ≤ n, (see: Bertsekas & Shreve, 1996 (chapter 1)) we have:
mathematically equivalent to our main problem: (3.4). πk = T π0 T π1 · · · T πk−1 π0 .
Theorem 2. Suppose policy set = {π1 , π2 , · · · , πN } is a Markovian In the following theorem, we prove the uniqueness of the solution
policy set and Z = {Z1 , Z2 , · · · , ZN } is a set of post jump location of by applying a piecewise-deterministic Markov process model.
PMDP and R is the reward function equation: (5.3). Then the value
Theorem 3 (Uniqueness of the solution). Let V (t, q ) be a revenue
function is the expected reward of the PDMP under the Markovian
function, and the optimal solution of the liquidation problem is a con-
policy at time point t and in the state x:
cave and decreasing function in q and increasing in t. Then the solu-
V (t, q ) = sup π (t, x ), t ∈ T, q ∈ [0, Q 0 ] tion of the liquidation problem by applying a PMDP model is converg-
π ∈
ing to a unique solution.
where
Proof. In the Theorem 2, we have shown that
N
π (t, x ) = Et,x R(Zi , πi (Zi )) . V (t, q ) = sup H γ (t, q )
γ
i=0
= sup π (t, x ),
Proof. By using the information on the jump location from Z = π ∈
(Z1 , Z2 , · · · , ZN ) as a set of post jump of PMDP, we have: with defining the sequence of = {π0 , π2 , · · · , πn } as set of
T
Markovian policies (Bertsekas & Shreve, 1996), we have:
V π (t, q ) = Et,q e −rt
γt st λˆt dt (t ∈ T )
0 πn = lim T π0 · T π1 · T π2 · · · T πn−1 π0 (x0 )
T
n→+∞
= Et,x [ U π (φ (XTi ))dNt Therefore under the same condition, the optimal solution is de-
i=0
Ti fined as:
1060
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
1061
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
Fig. 5. Inventory level of a trader with given the initial inventory Q 0 and time to maturity T.
• The parameter κ as the exponent of the decay of market impact strategies. In the case of coming of not favourite offers, the algo-
• The size of the bid-ask spread is used as a proxy to measure rithm reduces the speed of trading (panel I: self-damping prop-
illiquidity erty) and waited for a longer time to find better matching coun-
terparties. In unstable market conditions, are indicated with the
These parameters can be estimated from real market data. higher level of self-damping, the trader should pay for final inven-
Plotting the trading boundaries (Fig. 7) shows that the optimal tory to liquidate the whole position of initial shares. At this point,
trading level depends on the price process, the remainder of the the best strategy is to accept offers in the LOB to avoid never to
inventory (Fig. 5), and time to maturity. face severity penalties at the end of period. If estimated parame-
Due to market conditions and asset characteristics, an agent ters of markets might show that the higher chance of same types
with a higher degree of risk aversion cares more about the exe- of orders’ occurrences (panel II: self-exciting property), the algo-
cution risk and price fluctuations. She starts trading with available rithm reduces the trading rate in the hope of getting better offers.
orders at a deeper level of the LOB to avoid risk execution and lack The second column shows the related entries of the value function
of offers in future. She splits the original order into smaller slices as a result of the implicit and explicit cost of trading. The results
to mitigate price impact. demonstrate that the second type of market characteristics with a
Table 1 summarizes results of simulations by our model, includ- higher level of self-exciting property can be more profitable.
ing the level of inventory (Fig. 5) and its corresponding optimal It is worthwhile to compare our algorithm with some bench-
trading rate (Fig. 6) for different scenarios of implementations of marks, such as the conventional algorithm that trades only the best
1062
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
Fig. 7. Trading boundary condition for multi-Stopping time and with Poisson arrival.
LOB orders. Table 1 (panel III: Conventional method) presents the shares at the given time. The central assumption of the majority of
results of the conventional algorithm simulations. The conventional limit order models is to trade at the best bid and ask prices (see:
optimal execution methods only trade with the best available limit Cao, Hansch, & Wang, 2008). We allow the trading procedure to go
orders in the LOB. We thus replicated the simulation experiments to more in-depth into the LOB to avoid not filling the order and
with both market microstructure conditions: self-exciting and self- face last-minute inventory penalties.
damping properties. Our results showed that the trader faces more
costs at the end of the period and fewer profits than our algorithm 7. Discussion and further works
in both scenarios. The conventional algorithm also considers orders
at the top level of the LOB, and is less flexible when the market is In this study, we proposed an analytical solution to the optimal
illiquid. liquidation problem, taking a dynamic approach and building nu-
Having a look at the graphs (Fig. 8) showing the optimal trading merical boundaries of multi-stopping problems in an illiquid mar-
rate of different scenarios, one can conclude that there are more ket. We simulated optimal splitting order models according to the
fluctuations in trading if the probability of arrival new offers is re- existing liquidity in the order book with different parameters and
duced. price impact models. We used the PDMD to decompose the liqui-
Our numerical results also show that the proportion of the gain dation problem into discrete period problems, and applied Markov
of the liquidation model considerably depends on the specification decision rules to obtain the solution. We examined the uniqueness
of the price impact function. It indicates that market conditions and practical existence of the optimal solution. We showed that
have an effect on the final inventory level and causes a substantial the percentage gain of the liquidation model depends on the mar-
dropping in the final wealth of the trader. An important aspect of ket conditions and specifications of the price impact function. In
the optimal strategies, which we have developed, is to take into ac- contrast to most limit order models for liquidating a market, which
count the execution risk in an illiquid market, i.e. inability to liquid only trade at the best bid and ask prices, our model allows the
1063
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
Acknowledgement
Appendix A
1064
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
β . In the self-exciting point process framework, this damping fac- Chavez-Demoulin, V., & McGill, J. A. (2012). High-frequency financial data modeling
tors magnitude indicates the Hawkes processes self-exciting and using Hawkes processes. Journal of Banking & Finance, 36(12), 3415–3426.
Chehrazi, N., Glynn, P. W., & Weber, T. A. (2019). Dynamic credit-collections opti-
self-damping properties. The former property corresponds to the mization. Management Science, 65(6), 2737–2769.
positive value of β , while the latter corresponds to the negative Chen, J., Feng, L., & Peng, J. (2015). Optimal deleveraging with nonlinear temporary
value of β . An alternative choice for the kernel function f (t ) is a price impact. European Journal of Operational Research, 244(1), 240–247.
Chen, J., Feng, L., Peng, J., & Ye, Y. (2014). Analytical results and efficient algo-
power-law function. rithm for optimal portfolio deleveraging with market impact. Operations Re-
As a result of converting the integrated intensity into indepen- search, 62(1), 195–206.
dent components via exponentially distributed variables, we can Chordia, T., Roll, R., & Subrahmanyam, A. (2001). Market liquidity and trading activ-
ity. The Journal of Finance, 56(2), 501–530.
apply most of the analytical methods to analyze these statistical
Chordia, T., Subrahmanyam, A., & Tong, Q. (2014). Have capital market anomalies
random variables. One can calibrate the parameters of the Hawkes attenuated in the recent era of high liquidity and trading activity? Journal of
process with parametric estimation methods like maximum likeli- Accounting and Economics, 58(1), 41–58.
Ciocan, D. F., & Mišić, V. V. (2020). Interpretable optimal stopping. Management Sci-
hood estimation (Ozaki, 1979) or with non-parametric estimators
ence.
like the Expectation-Maximization (EM) algorithm (Bacry, Dayri, & Collin-Dufresne, P., & Fos, V. (2015). Do prices reveal the presence of informed trad-
Muzy, 2012). The goodness of the fit of these models also can be ing? The Journal of Finance, 70(4), 1555–1582.
examined with conventional statistical tests. Comerton-Forde, C., & Putniņš, T. J. (2015). Dark trading and price discovery. Journal
of Financial Economics, 118(1), 70–92.
Cont, R. (2011). Statistical modeling of high-frequency financial data. Signal Process-
ing Magazine, IEEE, 28(5), 16–25. https://doi.org/10.1109/MSP.2011.941548.
References Cont, R., & Fournié, D.-A. (2010). Change of variable formulas for non-anticipative
functionals on path space. Journal of Functional Analysis, 259(4), 1043–1072.
Alfonsi, A., & Blanc, P. (2016). Dynamic optimal execution in a mixed-market-impact Cont, R., Stoikov, S., & Talreja, R. (2010). A stochastic model for order book dynamics.
Hawkes price model. Finance and Stochastics, 20(1), 183–218. Operations Research, 58(3), 549–563. https://doi.org/10.1287/opre.1090.0780.
Alfonsi, A., Fruth, A., & Schied, A. (2008). Constrained portfolio liquidation in a limit Dassios, A., & Zhao, H. (2017). Efficient simulation of clustering jumps with CIR in-
order book model. Banach Center Publications, 83, 9–25. tensity. Operations Research, 65(6), 1494–1515.
Alfonsi, A., & Schied, A. (2010). Optimal trade execution and absence of price ma- Davis, M. H. A. (1984). Piecewise-deterministic Markov processes: A general class of
nipulations in limit order book models. SIAM Journal on Financial Mathematics, non-diffusion stochastic models. Journal of the Royal Statistical Society. Series B
1(1), 490–522. https://doi.org/10.1137/090762786. (Methodological), 353–388.
Almgren, R., & Chriss, N. (2001). Optimal execution of portfolio transactions. Journal De Saporta, B., Dufour, F., Gonzalez, K., et al. (2010). Numerical method for opti-
of Risk, 3, 5–40. mal stopping of piecewise deterministic Markov processes. The Annals of Applied
Almgren, R. F. (2003). Optimal execution with nonlinear impact functions and trad- Probability, 20(5), 1607–1637.
ing-enhanced risk. Applied Mathematical Finance, 10(1), 1–18. Desai, V. V., Farias, V. F., & Moallemi, C. C. (2012). Pathwise optimization for optimal
Ang, A., Papanikolaou, D., & Westerfield, M. M. (2014). Portfolio choice with illiquid stopping problems. Management Science, 58(12), 2292–2308.
assets. Management Science, 60(11), 2737–2761. Diamond, P. A. (1982). Aggregate demand management in search equilibrium. Jour-
Bacry, E., Dayri, K., & Muzy, J. F. (2012). Non-parametric kernel estimation for sym- nal of Political Economy, 90(5), 881–894.
metric Hawkes processes. Application to high frequency financial data. The Eu- Dufour, A., & Engle, R. F. (20 0 0). Time and the price impact of a trade. The Journal
ropean Physical Journal B, 85(5). https://doi.org/10.1140/epjb/e2012-21005-8. of Finance, 55(6), 2467–2498.
Bacry, E., Delattre, S., Hoffmann, M., & Muzy, J. F. (2013). Modelling microstructure Easley, D., & O’hara, M. (1987). Price, trade size, and information in securities mar-
noise with mutually exciting point processes. Quantitative Finance, 13(1), 65–77. kets. Journal of Financial Economics, 19(1), 69–90.
https://doi.org/10.1080/14697688.2011.647054. Engle, R. F., & Lunde, A. (2003). Trades and quotes: A bivariate point process. Journal
Bacry, E., Jaisson, T., & Muzy, J.-F. (2016). Estimation of slowly decreasing Hawkes of Financial Econometrics, 1(2), 159–188.
kernels: Application to high-frequency order book dynamics. Quantitative Fi- Foucault, T., & Menkveld, A. J. (2008). Competition for order flow and smart order
nance, 16(8), 1179–1201. routing systems. The Journal of Finance, 63(1), 119–158.
Bally, V., Pages, G., et al. (2003). A quantization algorithm for solving multidimen- Gabaix, X. (2016). Power laws in economics: An introduction. Journal of Economic
sional discrete-time optimal stopping problems. Bernoulli, 9(6), 1003–1049. Perspectives, 30(1), 185–206.
Bally, V., Printems, J., et al. (2005). A quantization tree method for pricing and hedg- Garman, M. B. (1976). Market microstructure. Journal of Financial Economics, 3(3),
ing multidimensional american options. Mathematical Finance, 15(1), 119–168. 257–275. https://doi.org/10.1016/0304-405X(76)90 0 06-4.
Barclay, M. J., Hendershott, T., & McCormick, D. T. (2003). Competition among trad- Gatheral, J. (2010). No-dynamic-arbitrage and market impact. Quantitative Finance,
ing venues: Information and trading on electronic communications networks. 10(7), 749–759.
The Journal of Finance, 58(6), 2637–2665. Giesecke, K., Kakavand, H., & Mousavi, M. (2011). Exact simulation of point pro-
Bäuerle, N. (2001). Discounted stochastic fluid programs. Mathematics of Operations cesses with stochastic intensities. Operations Research, 59(5), 1233–1245.
Research, 26(2), 401–420. Glosten, L. R., & Milgrom, P. R. (1985). Bid, ask and transaction prices in a specialist
Bäuerle, N., & Rieder, U. (2009). MDP algorithms for portfolio optimization problems market with heterogeneously informed traders. Journal of Financial Economics,
in pure jump markets. Finance and Stochastics, 13(4), 591–611. 14(1), 71–100.
Bäuerle, N., & Rieder, U. (2010). Optimal control of piecewise deterministic Markov Goldstein, M. A., & Kavajecz, K. A. (20 0 0). Eighths, sixteenths, and market depth:
processes with finite time horizon. Modern Trends in Controlled Stochastic Pro- Changes in tick size and liquidity provision on the NYSE. Journal of Financial
cesses: Theory and Applications, 123, 143. Economics, 56(1), 125–149.
Bayraktar, E., Horst, U., & Sircar, R. (2007). Queuing theoretic approaches to financial Grossman, S. J., & Miller, M. H. (1988). Liquidity and market structure. The Journal
price fluctuations. Handbooks in Operations Research and Management Science, 15, of Finance, 43(3), 617–633.
637–677. Gueant, O., & Lehalle, C.-A. (2015). General intensity shapes in optimal liquidation.
Bayraktar, E., & Ludkovski, M. (2014). Liquidation in limit order books with con- Mathematical Finance, 25(3), 457–495.
trolled intensity. Mathematical Finance, 24(4), 627–650. Guéant, O., Lehalle, C.-A., & Fernandez-Tapia, J. (2012). Optimal portfolio liquidation
Bertsekas, D. P., & Shreve, S. E. (1996). Stochastic optimal control: The discrete time with limit orders. SIAM Journal on Financial Mathematics, 3(1), 740–764.
case. Athena Scientific. Ha, Y., & Zhang, H. (2020). Algorithmic trading for online portfolio selection under
Bertsimas, D., & Lo, A. W. (1998). Optimal control of execution costs. Journal of Fi- limited market liquidity. European Journal of Operational Research.
nancial Markets, 1(1), 1–50. Hasbrouck, J. (1991). Measuring the information content of stock trades. The Journal
Biais, B., Hillion, P., & Spatt, C. (1995). An empirical analysis of the limit order book of Finance, 46(1), 179–207.
and the order flow in the paris bourse. The Journal of Finance, 50(5), 1655–1689. Hawkes, A. G. (1971). Spectra of some self-exciting and mutually exciting point pro-
Brogaard, J., Hendershott, T., & Riordan, R. (2019). Price discovery without trading: cesses. Biometrika, 58(1), 83–90. https://doi.org/10.1093/biomet/58.1.83.
Evidence from limit orders. The Journal of Finance, 74(4), 1621–1658. Hendershott, T., Jones, C. M., & Menkveld, A. J. (2011). Does algorithmic trading im-
Brunovskỳ, P., Černỳ, A., & Komadel, J. (2018). Optimal trade execution under en- prove liquidity? The Journal of Finance, 66(1), 1–33.
dogenous pressure to liquidate: Theory and numerical solutions. European Jour- Henderson, V., & Hobson, D. (2013). Risk aversion, indivisible timing options, and
nal of Operational Research, 264(3), 1159–1171. gambling. Operations Research, 61(1), 126–137.
Cai, F., Han, S., Li, D., & Li, Y. (2019). Institutional herding and its price impact: Ev- Horst, U., & Naujokat, F. (2014). When to cross the spread? trading in two-sided
idence from the corporate bond market. Journal of Financial Economics, 131(1), limit order books. SIAM Journal on Financial Mathematics, 5(1), 278–315.
139–167. Huang, Y., & Guo, X. (2011). Finite horizon semi-Markov decision processes with
Cao, C., Hansch, O., & Wang, X. (2008). Order placement strategies in a pure limit application to maintenance systems. European Journal of Operational Research,
order book market. Journal of Financial Research, 31(2), 113–140. 212(1), 131–140.
Carriere, J. F. (1996). Valuation of the early-exercise price for derivative securi- Huang, Y., & Guo, X. (2020). Multiconstrained finite-horizon piecewise determinis-
ties using simulations and splines. Insurance: Mathematics and Economics, 19(1), tic Markov decision processes with unbounded transition rates. Mathematics of
19–30. Operations Research, 45(2), 641–659.
Cartea, A., Jaimungal, S., & Ricci, J. (2014). Buy low, sell high: A high frequency trad- Huberman, G., & Stanzl, W. (2004). Price manipulation and quasi-arbitrage. Econo-
ing perspective. SIAM Journal on Financial Mathematics, 5(1), 415–444. metrica, 72(4), 1247–1275.
1065
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
Kavajecz, K. A. (1999). A specialist’s quoted depth and the limit order book. The Ozaki, T. (1979). Maximum likelihood estimation of Hawkes’ self-exciting point pro-
Journal of Finance, 54(2), 747–771. cesses. Annals of the Institute of Statistical Mathematics, 31(1), 145–155. https:
Kempf, A., & Korn, O. (1999). Market depth and order size. Journal of Financial Mar- //doi.org/10.1007/BF02480272.
kets, 2(1), 29–48. Peskir, G., & Shiryaev, A. (2006). Optimal stopping and free-boundary problems.
Kirilenko, A., Kyle, A. S., Samadi, M., & Tuzun, T. (2017). The flash crash: High-fre- Springer.
quency trading in an electronic market. The Journal of Finance, 72(3), 967–998. Philip, R. (2020). Estimating permanent price impact via machine learning. Journal
Kogan, L., Ross, S. A., Wang, J., & Westerfield, M. M. (2006). The price impact and of Econometrics, 215(2), 414–449.
survival of irrational traders. The Journal of Finance, 61(1), 195–229. Predoiu, S., Shaikhet, G., & Shreve, S. (2011). Optimal execution of a general
Kyle, A. S. (1985). Continuous auctions and insider trading. Econometrica, one-sided limit-order book. SIAM Journal on Financial Mathematics, 2, 183–212.
1315–1335. Shleifer, A., & Summers, L. H. (1990). The noise trader approach to finance. Journal
Lindley, D. V. (1961). Dynamic programming and decision theory. Journal of the Royal of Economic perspectives, 4(2), 19–33.
Statistical Society. Series C (Applied Statistics), 10, 39–51. Ting, C., Warachka, M., & Zhao, Y. (2007). Optimal liquidation strategies and their
Longstaff, F. A., & Schwartz, E. S. (2001). Valuing american options by simulation: A implications. Journal of Economic Dynamics and Control, 31(4), 1431–1450.
simple least-squares approach. Review of Financial Studies, 14(1), 113–147. Tirole, J. (2011). Illiquidity and all its friends. Journal of Economic Literature, 49(2),
Mamer, J. W. (1986). Successive approximations for finite horizon, semi-Markov de- 287–325.
cision processes with application to asset liquidation. Operations Research, 34(4), Tsitsiklis, J. N., & Van Roy, B. (2001). Regression methods for pricing complex amer-
638–644. ican-style options. IEEE Transactions on Neural Networks, 12(4), 694–703.
Mrázová, M., & Neary, J. P. (2017). Not so demanding: Demand structure and firm Tsoukalas, G., Wang, J., & Giesecke, K. (2019). Dynamic portfolio execution. Manage-
behavior. American Economic Review, 107(12), 3835–3874. ment Science, 65(5), 2015–2040.
Norberg, R. (2004). Vasiček beyond the normal. Mathematical Finance, 14(4), Vayanos, D., & Wang, J. (2013). Market liquidity-theory and empirical evidence. In
585–604. Handbook of the economics of finance: vol. 2 (pp. 1289–1361). Elsevier.
Obizhaeva, A. A., & Wang, J. (2013). Optimal trading strategy and supply/demand Zheng, B., Roueff, F., & Abergel, F. (2014). Modelling bid and ask prices using con-
dynamics. Journal of Financial Markets, 16(1), 1–32. strained Hawkes processes: Ergodicity and scaling limit. SIAM Journal on Finan-
cial Mathematics, 5(1), 99–136.
1066