1 s2.0 S0377221721004483 Main

European Journal of Operational Research 296 (2022) 1050–1066
Contents lists available at ScienceDirect
European Journal of Operational Research

journal homepage: www.elsevier.com/locate/ejor
Interfaces with Other Disciplines
Optimal liquidation problem in illiquid marketsR

Amirhossein Sadoghi a,∗, Jan Vecer b,c
a
Department of Finance and Accounting, Rennes School of Business, 2 Rue Robert d’Arbrissel, Rennes 35065, France
b
Department of Probability and Mathematical Statistics, Charles University, Sokolovska 83, 18675, Praha 8, Czech Republic
c
Department of Finance, Frankfurt School of Finance and Management, Adickesalle 32-34 60322, Frankfurt am Main, Germany
a r t i c l e i n f o a b s t r a c t
Article history: In this research, we develop a trading strategy for the optimal liquidation problem of large-order trading,
Received 18 November 2020 with different market microstructures, in an illiquid market. We formulate the liquidation problem as a
Accepted 17 May 2021
discrete-time Markov decision process. In this market, the flow of liquidity events can be viewed as a
Available online 25 May 2021
point process with stochastic intensity. Based on this fact, we model the price impact as a linear func-
Keywords: tion of a self-exciting dynamic process. Our trading algorithm is designed in such a way that when no
Finance favourite orders arrive in the Limit Order Book (LOB), the optimal solution takes offers from the lower
Optimal liquidation problem levels of the LOB. This solution might contradict conventional optimal execution methods, which only
Illiquid market trade with the best available limit orders; however, our findings show that the proposed strategy may
Markov-modulated poisson process reduce final inventory costs by preventing orders not being filled at earlier trading times. Furthermore,
Hawkes processes the results indicate that an optimal trading strategy is dependent on characteristics of the market mi-
crostructure.
© 2021 Elsevier B.V. All rights reserved.
1. Introduction vated by a real problem faced by traders wanting to liquidate large

portfolios in illiquid markets. The optimal solution to this problem
In an illiquid market, due to the lack of counterparties and is to trade by going deeper into the Limit Order Book (LOB) and
uncertainty about the value of assets, there is no guarantee that reduce the cost of the inventory punishment at the end of the pe-
assets will trade at fair value (Ang, Papanikolaou, & Westerfield, riod. The expected revenue of the trader is linked to his/her pref-
2014). Depending on the elasticity of the market, the effect of erences over a set of limit orders. We explain how the dynamics of
an increased offer will be compensated by a drop in the price price might be affected by the arrival rate of orders. Following this,
(Mrázová & Neary, 2017). Effectively, the initial price impact is par- we show that the information on the arrival rates of limit orders in
tially temporary and vanishes after the execution of the order (de- the order book can be used to compute the price impact. The re-
pending on the elasticity of the market). In a market like this, trad- sults of the algorithmic simulations show that the optimal value of
ing only at best bid and ask prices is not an optimal strategy. The the trading rate is dependent on the characteristics of the market
trader faces liquidity restrictions and should behave differently in microstructure and the dynamics of the incoming orders. In fact,
comparison to an unconstrained market. Conventional optimal ex- trading in a market with a high probability of the same types of
ecution methods mostly only trade with the best available limit or- orders arriving (self-exciting property of the arrival rate of orders)
ders. This strategy can increase final inventory costs in an illiquid is more profitable than other types of markets.
market. Our paper, in line with studies by Ha and Zhang (2020),
In this study, we introduce a new algorithm for trading in an Ting, Warachka, and Zhao (2007), Brunovskỳ, Černỳ, and Komadel
illiquid market, picking not only the best available orders but also (2018), considers the portfolio selection and optimal liquidation
orders in the deeper level of the order book. Our study is moti- problems of illiquid assets in markets that have limited liquid-
ity. Illiquid assets are characterised by long times between trades,
low turnover, challenges in finding counterparties, indivisibility
R
We thank two anonymous referees, and the Editor, Emanuele Borgonovo, for (Henderson & Hobson, 2013), and assets only being able to be
helpful comments that significantly improved the paper. We are grateful for valu- traded infrequently (Ang et al., 2014). Assets are generally illiq-
able feedback from Stefan Kassberger and Falko Fecht. The former title of the paper
uid, and some liquid assets occasionally become illiquid; conse-
is “Optimum Liquidation Problem Associated with the Poisson Cluster Process”.
∗
Corresponding author. quently, it is necessary to apply a search process to find a suitable
E-mail addresses: amirhossein.sadoghi@rennes-sb.com (A. Sadoghi), counterparty (Diamond, 1982). In a systemic way, illiquidity crises
vecer@karlin.mff.cuni.cz (J. Vecer). can occur when some assets are illiquid. For example, the cause of
https://doi.org/10.1016/j.ejor.2021.05.020
0377-2217/© 2021 Elsevier B.V. All rights reserved.
A. Sadoghi and J. Vecer European Journal of Operational Research 296 (2022) 1050–1066
the 2008-09 global financial problem was rooted in the illiquid fi- well as the price and size of stocks, which are available during the
nancial market (Tirole, 2011). When insolvent financial institutions execution time. To find an optimal solution to the liquidation prob-
must liquidate massive amounts of illiquid assets, they no longer lem, one needs to consider a trade-off between liquidity risk and
follow their optimal trading strategies. In this situation, financial changing the stock price. The former results in slow order execu-
institutions are forced to liquidate the assets based on available tion, while the latter is caused by exogenous events or a rapid liq-
orders as fear of the crisis spreads. Consequently, the illiquid asset uidation. An early liquidation causes an unfavourable influence on
price further drops, which can lead to fire sales (Diamond, 1982). the stock price, and a late execution has liquidity risk since the
Our liquidation model is primarily an algorithmic method. These stock price can move away from that at the beginning of the pe-
methods apply automatic procedures to make decisions about sell- riod.
ing or buying of shares and the submitting of orders. In modern There are different ways to measure illiquidity (Vayanos &
financial markets, algorithmic trading methods are employed to Wang, 2013), such as bid-ask spread Glosten and Milgrom (1985),
trade illiquid assets. Algorithmic trading represents a large share market depth (Kyle, 1985), turnover, and other asset characteristics.
of market activity and can enhance liquidity supply and fragmen- In our market model, illiquidity is caused by a disparity between
tation of order flows (Foucault & Menkveld, 2008; Hendershott, supply and demand, which means that the execution of larger po-
Jones, & Menkveld, 2011) even when financial markets experience sitions requires longer periods of time and cannot be performed in
systemic events like the flash crash on May 6, 2010 (Kirilenko, Kyle, continuous time. Therefore, we formulate the liquidation problem
Samadi, & Tuzun, 2017). as a continuous-time stochastic control problem. By applying the
To understand the behaviour of market participants, we need to Markov decision processes (MDP) approach, we decompose the liq-
analyse the stochastic fluctuations in stock prices and the reaction uidation problem into multiple deterministic optimal control prob-
of players in the market. These fluctuations can be explained by a lems. The discrete-time framework is closely related to classical
sequence of market equilibria, which are determined by demand trading algorithms. In most continuous liquidation problem setups,
and supply. Generally, informed traders prefer to trade mainly dur- it is assumed that supply and demand are in balance; however,
ing times of liquidity. As opposed to when the market is liquid this is only a reasonable assumption in a liquid market.
(Collin-Dufresne & Fos, 2015; Kyle, 1985), traders are willing to Through our research, we formulate the liquidation problem as-
submit small orders when the market is illiquid Philip (2020). sociated with the depth function of the LOB. Our algorithm con-
However, traders are not entirely rational, and their trading be- structs trading boundaries built-in on market depth to determine
haviours are governed to some degree by their beliefs or senti- an optimum strategy. When the stock prices hit one of these trad-
ments (Shleifer & Summers, 1990). This disagreement in the mar- ing boundaries, i.e. there is a liquidity event, the algorithm liquids
ket creates an imbalance between supply and demand for limit or- some part of shares by submitting limit orders. Past studies show
ders, which causes illiquidity in the market. In this market, assets the link between price impact and market depth and show that
can be traded only infrequently, contingent on the arrival of ran- there is a linear relationship between an orders flow imbalance
domly occurring liquidity events. Furthermore, the dynamics of or- and price changes in high-frequency markets. This fact is investi-
der arrival are influenced by the liquidity of assets. Therefore, trad- gated by Kempf and Korn (1999) who measure the market depth
ing activities have indirect effects on the dynamics of price. We use as a surplus demand amount that is needed for jumping one unit
this fact and model the dynamics of incoming orders and liquidity price. They show that there is a non-linear relationship between
events with the Hawkes process. This process can capture irregu- market depth and the price impact of the orders flow.
lar patterns of asset prices. Following this approach, we measure In our numerical simulation, we compare the performance of
the price impact of order executions based on a stochastic inten- our algorithm concerning various market characteristics, as well
sity process using the mutually- and self- exciting properties of the as price impact functions. In the case where favourite offers do
Hawkes process. not come, our algorithm will reduce the speed of trading and go
In order-driven markets, buy and sell orders arrive at different deeper into LOB to avoid the not filling of orders and facing an
time points and wait in the LOB to be traded. We study the dy- ultimate inventory penalty. This solution might contradict conven-
namics of the arrival rate of limit orders and the model distribu- tional optimal execution methods, which only trade with the best
tion of limit orders in a LOB with the help of order statistics. We available limit orders. In fact, in illiquid markets, lower price lev-
analytically express the probability of orders ranking, sorted based els of market depth are more attractive than upper levels; these
on price, in the LOB. Modelling the LOB distribution helps us to depth levels include orders with significantly larger volumes. Most
forecast order in the hidden levels of the LOB and exploit the or- optimal liquidation methods focus on the best sell and buy price
der information stored more in-depth in the LOB (Biais, Hillion, & levels since their imbalance can move prices. By causing the lack
Spatt, 1995). More precisely, knowing the LOB distribution and ex- of available liquidity, we need to take the lower level of market
pected order quantities in the hidden levels guides the algorithm depth into account.
to select the limit orders of top levels of the LOB (both best and The main contributions of this paper can be summarized as fol-
deeper levels of the LOB) to avoid risk execution and lack of offers lows. First, we develop a new algorithm to liquidate a large asset in
in the future. It is consistent with the finding of (Kavajecz, 1999) an illiquid market. This algorithm uses trading boundaries built-in
which limit order traders maintain a reasonable level of depth on the market depth and adjusts the speed of trading. It helps to
around illiquidity events to reduce their exposure to adverse se- avoid not filling of the order by picking up not only the best limit
lection losses. Our algorithm mimics informed traders’ behaviour orders from LOB. Second, we model the arriving pattern of limit or-
that uses depth as a strategic decision factor to adjust trading rate ders based on the Poisson cluster process. We then use the same
(Goldstein & Kavajecz, 20 0 0; Kavajecz, 1999). approach to model price impact (Proposition 1) and show how this
We set up a stochastic control framework to maximize the ex- model can extend to two main classes of price impact functions,
pected revenue of trading with consideration to liquidity restric- i.e. permanent and temporary (Lemma 2). Third, we determine the
tions in trading a large order. We formulate the liquidation prob- distribution of a random number of sorted limit orders in LOB
lem as a multi-stopping problem with Poisson arrival patterns of (Theorem 1). Forth, we solve the liquidation problem numerically
orders in a discrete-time model. The optimal liquidation problem with apply MDP approach (Theorem 2) and prove the uniqueness
can be expressed as an optimal control problem, to determine opti- of the solution (Theorem 3). Fifth, with employing this algorithm,
mal strategies of trading stock portfolios by minimizing some cost we investigate different trading scenarios in various market charac-
functions. These strategies depend on the state of the market, as teristics. The results show that a market with a higher level of the
1051
Table 1
Summary of the result of simulation the level of inventory and its corresponding optimal trading rate under different market conditions.
Panel I Quantile
Self-damping Revenue 10% 25% 50% 75% 100%
κ = 0.6 $50,067 Trade rate 0.36 6.17 12.75 19.28 44.16

σ = −0.6 Inventory level 6205.45 14980.97 22618.06 31680.03 70,000
κ = 0.2 $50,352 Trade rate 0.27 5.69 11.92 19.09 44.59
κ = 0.6 $50,000 Trade rate 0.13 5.53 11.36 21.00 40.38
Panel II Quantile
Self-exciting Revenue 10% 25% 50% 75% 100%
κ = 0.6 $50,779 Trade rate 0.59 6.17 12.54 19.27 43.20
σ = 0.1 Inventory level 3229.40 14751.81 22830.39 30507.01 70,000
κ = 0.2 $62,539 Trade rate 0.03 5.74 13.16 22.15 48.74
σ = 0.1 Inventory level 1593.42 11177.87 18155.91 31810.18 70,000
κ = 0.6 $94,691 Trade rate 0.34 9.68 16.12 26.71 72.67
σ = 0.6 Inventory level 2153.13 7580.03 19473.28 36405.45 70000
Panel III Quantile
Conventional method Revenue 10% 25% 50% 75% 100%
κ = 0.6 $42,017 Trade rate 0.125 4.47 9.15 16.28 39.16
κ = 0.6 $63,131 Trade rate 0.12 5.18 11.02 12.41 52.17
σ = 0.6 Inventory level 1053.03 3581.23 15433.78 31400.01 70000
self-exciting property of limit order arriving can be more profitable tween the current system state and the policys action. A similar
(Table 1). problem, which has been considered in the market microstructure
The rest of the paper is organized as follows. We review the re- literature by Garman (1976), who study a trading problem of a
lated literature in Section 2. Section 3 explains the market model market maker who maximizes her profit by assuming order arrival
setup and the problem statement and describes the statistical rates depending on the price dynamics governed by the Poisson
model of the LOB. Section 4 presents the stochastic dynamics of process. Some other studies explain the unconditional and steady-
the intensity of the order arrival process, and we turn to model state distributions of the order book.
price impact. Section 5 describes the procedure of solution with The homogeneous Poisson process of sell and buy orders ar-
using discrete-time Markov Process. In Section 6, we explain the rival rate was the primary assumption of past studies. Garman
numerical method for optimal stopping time and simulate with (1976) explain conditions necessary and sufficient for the order ar-
different market microstructures. Section 7 summarizes the results rival patterns to be modelled by a homogeneous Poisson proces-
and concludes the paper with further remarks. sor. In this framework, none of the agents’ trading can dominate
other agents, or they place a large number of limit orders in a
2. Related literature finite time. Nevertheless, most of these assumptions are violated
in high-frequency trading. Empirical studies of high-frequency data
Our paper relates to the recent and growing literature on op- show that there are significant cross-correlation patterns of the ar-
timal execution problems. The primary research on this subject rival rate of similar limit buy or sell orders and significant auto-
date back to 1990s, and mainly focus on the discrete-time models, correlation in durations of events, (see: Cont, 2011). Dufour and
in which optimal strategies are determined as optimal liquidation Engle (20 0 0) use a framework provided by Hasbrouck (1991) and
rates per unit time. Bertsimas and Lo (1998), as a discrete-time investigate the time duration between trades. They determine that
model, study the optimal liquidation of a large block of shares with the trade duration process is correlated with the information about
linear permanent price impact in a fixed time horizon. Almgren current and past transactions. Some empirical studies like Zheng,
and Chriss (2001), as one of the most cited optimal execution Roueff, and Abergel (2014) show that the dynamics of the Bid and
models, constructs an efficient frontier of execution strategies via Ask price can be modelled using Hawkes processes, introduced by
mean-variance analysis of expected costs of liquidation and divide Hawkes (1971). The irregularity properties of high-frequency fi-
the price impact into temporary and/or linear permanent price im- nancial data can be explained by the self-exciting and mutually-
pact. They use diffusion price processes in a continuous-time trad- exciting properties of the Hawkes processes. We use this fact and
ing space. In our study, we model a similar liquidation problem model dynamics of incoming limit orders using Hawkes processes.
with a restriction on liquidity in continuous-time trading time. We In a similar study, Engle and Lunde (2003) model trades and quote
then solve the model in a discrete-time space. arrivals with bivariate point processes. Some empirical studies like
Our study of an optimal stopping problem is a link between Bacry, Delattre, Hoffmann, and Muzy (2013) show that high fre-
two different areas: control theory and market microstructure. In quency data can be modelled using Hawkes processes. Our base-
control theory, this problem has been studied as a single or multi- line model only considers the submission of limit orders in a LOB.
stopping problem in the classical best choice problems by us- Therefore, we apply univariate bivariate point processes to model
ing homogeneous Poisson processes. A single stopping time prob- the dynamics of incoming limit orders. Our baseline model can
lem governed by the Poisson process was formulated as the best be extended to apply bivariate point processes for submission and
choice problem in the late 1950s by Lindley (1961). The op- cancellation processes. Several studies extend Hawkes processes.
timal k-stopping problem with finite and infinite time horizon Chehrazi, Glynn, and Weber (2019) develop a traceable model
was presented with the complete solution by Peskir and Shiryaev based on a controlled self-exciting point processes framework to
(2006) in a Bayesian formulation. A recent study by Ciocan and predict the repayment behaviour of unsecured loans placed in
Mišić (2020) develops an interpretable optimal stopping model credit portfolios. They also consider the Hawkes processes along-
which helps the decision-maker to determine the connection be- side an extra predictable process that models account-treatment
1052
actions. Similarly, Dassios and Zhao (2017) extend Hawkes pro- one the component of price model (Almgren & Chriss, 2001; Bert-
cesses with a diffusion component to estimate default probability simas & Lo, 1998). Empirical researches show that in a liquid mar-
and model a portfolio loss process. ket, trading activities cannot alter the dynamics of the price for a
In our research, we study the statistical model of the LOB. We long time.
determine an analytic expression of the distribution of limit or- Conversely, this component can be one of the basic building
ders in Theorem 1. Garman (1976), Bayraktar, Horst, and Sircar blocks of price impact models in the illiquid market. The second
(2007) study the fluctuation of the orders arriving in the elec- tier of the price impact model is related to modelling the tem-
tronic trading market. Cont, Stoikov, and Talreja (2010) develops porary impacts that just have an effect on the current orders for
a tractable stochastic model of limit order markets to capture the a short time and not for the entire trading time. This component
main statistical features of LOBs. They show the short-term price cannot alter the dynamics of the price for an extended period and
movements could be explained by the information on the current just has an impact on the immediate execution of the trades. Some
state of the LOB. Their model is based on the dynamics of the best studies like Chen, Feng, Peng, and Ye (2014), Chen, Feng, and Peng
bid and offer queues since the best orders can move the price. (2015) consider both permanent and temporary price impact in op-
This study uses Laplace transforms to analyse the behaviour of the timal deleveraging problem and other studies like Brunovskỳ et al.
order book with Poisson arriving patterns of buy- and sell- limit (2018) leaves out the permanent effect on optimal trade execution
orders. Unlike these studies, we express the distribution of LOB, problem. The transient impact is the third class of price impact;
sorted by price, thoroughly based on order statistics methodology. this component can be significant for a finite period, and it even-
This distribution, which is defined analytically in Theorem 1, can tually vanishes (see: Alfonsi, Fruth, & Schied, 2008; Gatheral, 2010),
help us to estimate the distribution of a random number of arrival Predoiu, Shaikhet, and Shreve (2011) models price impact by con-
limit orders in a LOB. sidering this component. The theoretical model by Huberman and
Our paper considers a liquidation problem when trader faces Stanzl (2004) show that permanent price impact can be linear with
liquidity restrictions, and she should adjust her trading strategies. trade size. This model also shows that with having temporary price
The impact of illiquidity on trading activities has been investigated impact in a general form, the permanent price impact must be lin-
in some theoretical studies like (Easley & O’hara, 1987; Kyle, 1985) ear.
as well as some empirical studies like (Chordia, Roll, & Subrah- The last class of the price impact modelling is to control the
manyam, 2001; Chordia, Subrahmanyam, & Tong, 2014). The the- rate of arrival of limit orders via the trading rate process. Cai, Han,
oretical models mostly focus on mechanisms underlying the trad- Li, and Li (2019) study the price impact of herding in the fixed-
ing activities faced with the cost of liquidity constraints, e.g. a the- income market and show that buy and sell herdings are asso-
oretical model by Grossman and Miller (1988) tries to determine ciated with permanent price impact and transitory price impact.
the equilibrium of liquidity level of the market. Empirical studies Alfonsi and Blanc (2016) introduce a mixed market impact Pois-
mostly concern about changing depth and trading activities due to son model to analyze a temporary shift of dynamics of the rate of
some significant macroeconomic factors and revolutions in trading order arrival. This model used the advantage of the self-exciting
technologies. property of the Hawkes process to change the direction of trad-
In studies by Bayraktar and Ludkovski (2014), Horst and Nau- ing in the same or opposite direction of order arrivals. Studies by
jokat (2014), the liquidity model of Almgren (2003) has been ap- Bayraktar and Ludkovski (2014), Guéant, Lehalle, and Fernandez-
plied to determine temporary and permanent impacts of trading Tapia (2012) control the intensity of limit orders for the liquida-
on fundamental price of asset. These studies design optimal liq- tion problem in a risk-neutral and risk-averse model, respectively.
uidation strategies in illiquid markets. Bayraktar and Ludkovski Gueant and Lehalle (2015) extend this model to a general form of
(2014) formulate the liquidation problem as a Hamilton–Jacobi– dynamics of intensity. From a traditional viewpoint on this prob-
Bellman equation association with the depth function of the LOB. lem (see: e.g. Alfonsi & Schied, 2010), the optimal liquidation de-
In our paper, we study a liquidation problem which is formulated pends on the existence of a sufficiently large limit order in the LOB,
similarly to Bayraktar and Ludkovski (2014); however, we solve and price impact is a function of the shape and depth of the LOB.
the problem numerically in a discrete space similar to (Bäuerle & Our modelling of price impact mainly belongs to the third class
Rieder, 2009; Huang & Guo, 2020). of price impact models. Essentially, in illiquid markets, the trader
faces a liquidity problem, and she should wait for a longer time
2.1. Related literature on price impact to execute her orders. Empirical studies show that this market is a
plastic (inelastic) market, and limit orders do not arrive in contin-
In the current study, we use a stochastic intensity process to uous time. The characteristics of this market allow us to model the
measure the price impact of order executions. Price impact is a tra- pattern of the arrival of orders in discrete space and express an im-
ditional topic in conventional market microstructure research (see: pact price function as a consequence of trading on this pattern. In
e.g. Dufour & Engle, 20 0 0; Hasbrouck, 1991; Kyle, 1985). Modelling Proposition 1, we introduce a new price impact, defined on an im-
of price impact on illiquid markets is not widely studied; we re- pact of trading on the arriving order book. Similar to impact price
view some different generations of price impact modelling in the models in the third class, such as Bayraktar and Ludkovski (2014),
literature. An illiquid asset can only be liquidated contingent on Guéant et al. (2012), our price impact model can control the in-
the arrival of liquidity events. Accordingly, we cannot consider as a tensity of limit orders, and resultantly can control/mitigate the im-
circumstance in which agents trade continuously in time, and there pact of price trading. In the Lemma 2, we show how this model
is no price impact (Kogan, Ross, Wang, & Westerfield, 2006). can be related to the first and second classes of price impact mod-
We can loosely classify price impact models based on their ef- els. Moreover, similar to the Kyle (1985) model, our impact price
fects and how long they influence the dynamics of the prices in model is conditional on the state of liquidity in the market.
the following categories. We then show the link that our price im-
pact model has to these classes. 3. Market model setup
The first class is a permanent price impact. As a consequence
of a significant discrepancy between supply and demand in the It is assumed that our financial market consists of an illiquid
market and a spread of bulk trading information, the dynamics of asset and a risk-free asset. The illiquid asset and the risk-free as-
price have stable shifts. Kyle (1985) proposed a basic microstruc- set can be considered as a risky asset, and as a numeraire with
ture model to analyze the price impact. Permanent impact was the interest rate r, respectively. The market for the risk-free asset
1053
is liquid, that means traders can liquidate a large amount of this to predict different markets quantities conditional on the current
security without facing costs of price impact. state of the order book. On the other hand, this information might
In a complete financial market, the price process St is a stochas- be used by counter-parties to take advantage of price movement
tic process on a complete filtered probability space (; F; F; P ) and submit or cancel strategic orders. The idea of the dark pool is
where F is the filtration generated by {Ft }t∈T . This space is to reduce the information linkage and adverse price risk.
bounded by a maturity time T . The assumption of the finite Underlying our approach is that the dynamics of limit order
discrete-time space is contradictory to conventional liquidation arrivals follow a Poisson pattern of prices depending on rates of
problems. However, it is not far away from reality; order arrival trading. We study the stochastic dynamics of the arrival rate of the
patterns of buy and sell orders are not the same. Due to fluctua- limit order by using order statistics methodology.
tions in the stock market, especially in a high-frequency environ-
ment, orders might not appear regularly in the LOB. Some empir- Distribution of a limit order book
ical studies show that, in a short period, the percentage changes j
of a stock price are not uniformly distributed with the same cen- We assume in a financial market, the price processes (St )( j=1···d )
trality, but price processes can be internally steady-state processes. of limit orders satisfy the following stochastic dynamics for t ∈
[0, T ] is dSt = St (μt dt + σ dWt ).
j j
Therefore, the model should be solved and interpreted in discrete
j
time and space. We use these facts and propose a model with Given a vector of price of buy (sell) orders (St )1≤ j≤d on the
the rate of incoming buy orders as a Poisson process in a finite probability space (; F; P ), the orders are sorted into a vector
discrete-time and space. (St(1) , St(2) , · · · , St( j ) ) satisfy the following chain conditions:
3.1. Trading boundaries

(St(1) ≤ St(2) , · · · , ≤ St( j ) ).
( j)
The vector (St(1 ) , St(2 ) , · · · , St ) is a so-called vector of order statis-
Traders often follow trends of price and construct trading j
tics of the price processes (St ) which is sorted by price.
boundaries built-in on market depth. They then submit limit or-
It is assumed that noisy traders cannot have a strong influence
ders in touch with an optimum volume when the stock prices hit
on the market, and no cancellation or strategic cancellation of or-
one of these trading boundaries (liquidity events). Market depth
ders can occur. These assumptions are crucial since Brogaard, Hen-
reflects the information related to the prices of buying and sell
dershott, and Riordan (2019) show the strong effects of limit order
orders for the price and depends on trade volume and minimum
submissions and cancellations on permanent price impacts. In an
price increment known as tick size (Goldstein & Kavajecz, 20 0 0).
illiquid market, liquidity events occur infrequently, and the rate of
It is continuously changing and reflects the valuable information
arrival of limit orders is governed by a Poisson process. The liquid-
about the current orders sitting in the LOB. Knowing this informa-
ity events in the non-overlapping intervals are independent.
tion helps traders to understand hidden patterns of price move-
In a small time interval dt, the probability that one limit order
ments and price impact. Traders can use this potential information
arrives and sits in the LOB is: P (Ndt = 1 ) = λ1!
dt −λdt
e = λdt + o(dt ).
and regulate their orders in response to the net order flow.
Similarly, the probability that no limit order appears in the LOB in
Market depth also gives the overall picture of market conditions
this interval is: P (Ndt = 0 ) = e−λdt = 1 − λdt + o(dt ), see Fig. 2. In
and a short term prediction to determine an optimum strategy, e.g.
the following Theorem, we determine a distribution of a random
price movement towards selling pressure or buying strength in the
number of sorted limit orders in LOB.
short time, especially in illiquid markets. Some platforms apply a
number of restrictions for market participants to observe or trade Theorem 1 (Distribution of the Limit Order Book). Denote by L the
at the best level; we assume all orders are observable and accessi- number of unexecuted orders at time s, we assume that in the inter-
ble for traders. Market depth can be affected by the transparency val (0, t ), the number of orders (Nt ) is a random variable with the
Poisson distribution with mean value λt. Vector {St1 , · · · , St t } repre-
N
of platforms in such a way that some levels of market depth are
hidden, and just the latest submitted orders are available. Limiting sents the prices of buy (sell) orders with distribution function F on the
trading and adjusting the minimum tick size are essential mecha- probability space (; F; P ). If U is the number of unexecuted orders
with prices higher than y = max{St1 , · · · , St t }, then the probability of
N
nisms to improve the efficiency of the market. As mentioned be-
fore, in illiquid markets, lower price levels of market depth include having k unexecuted orders with prices higher than y is:
orders with significantly larger volumes. Therefore, to liquidate a
e−λy · λky
large number of shares, we need to take the lower level of market P (U = k|Ns = L ) = ,
depth into account. k!
where
3.2. Statistics model of limit order book
λL+1
λy = λ 1 − F (y )t − ln eλF (y) − .
The Limit Order Book (LOB) contains information about orders,
( L + 1 )!
such as quantity, price, and type of order. Traders use knowledge Proof. If the number of orders is a random variable with the Pois-
of the LOB in their trading strategies, see (Hasbrouck, 1991). Philip son distribution and the mean value λ in a finite interval of length
(2020) describes the information included in the configuration of t. We consider the order with the maximum price of N (0,t ) number
the LOB is a significant element of permanent price impact. The of limit orders. Let FN (0,t ) (y ) be the distribution of
price impact is dependent on the traders private information as (0,t )
y = max{St1 , · · · , StN }.
well as their information from the LOB. Philip (2020) provides em-
Indeed
pirical evidence that the shape of the order book has substantial
effects on the dynamics of trade. Trading against the thin (slight) FN (0,t ) (y ) = P[(S1 ≤ y ) ∩ (S2 ≤ y )∩, · · · , SN ≤ y )]
side of LOB has a higher permanent price impact than trading to- (0,t )
= F ( y )N
ward the thick (deep) side of the LOB. Cont et al. (2010) shows that
this information contains short-term price movements and might The generating function with distribution function F (St ) is
change quickly during the trading period. On the one hand, this (0,t )
Gt (F (y )) = E[F (y )N ]
information can be useful for reducing the complexity of the re-
N (0,t +dt )
lation between price fluctuations and LOB dynamics. It also helps Gt+dt (F (y )) = E[F (y ) ]
1054
(0,t ) +N (t ,t +dt )
= E[F ( y )N ] 3.3. Problem statement
N (0,t ) N (t ,dt )
= E[F ( y ) ]E[F ( y ) ] The dynamics of the price is described by a right-continuous
= Gt (F (y ))(1 − λdt + λdtF (y )) process (St )(t≥0 ) changing while the times when the book order
Gt+dt (F (y )) − Gt (F (y )) process meets boundary conditions. Later on, we discuss the im-
= −λ(1 − F (y ))Gt (F (y )) pact of the current trade on the underlying price by using a par-
dt
d ticular approach to model price impact based on exogenous factors
(Gt (F (y )) = −λ(1 − F (y ))Gt (F (y )) as well as characteristics of the stock price processes. We show
dt
how to model the dynamics of the intensity of orders arrival cor-
d
ln(Gt (F (y )) = −λ(1 − F (y )) responding feedback effects of trading and the state of the market
dt
during orders execution period.
Solving for F (y ) gives We consider a problem for an investor holding a large volume
Gt (F (y )) = e−λt (1−F (y )) (3.1) (Q 0 ) shares of illiquid assets and a risk-free asset. The objective of
the shareholder is to maximize her profit with subject to liquidity
With assumption of a L number of unexecuted orders at the time constraints and limited execution time.
point s, (s < t ), the generating function for the time interval (0, s ) The illiquid asset can be considered a risky asset with the fol-
is: lowing dynamic for t ∈ [0, T ]
λ0 λ1 dSt = St (μt dt + σ dWt ),
Gs (F (y )) = e −λ ( F ( y ) 0 ) + e −λ ( F ( y ) 1 )
0! 1!
and a risk-free asset used as a numeraire with the interest rate r
λL
+ ··· + e −λ ( F ( y ) L ) with the following dynamic:
L!
dBt = B0 ert dt.
(λF (y ))0 (λF (y ))1 (λF (y ))L
= e −λ + + ··· +
0! 1! L! Let Sˆ = e−rt St be a martingale with respect to the measure P on a
complete filtered probability space(; F; P ).
Use Taylor series with remainder:
The trading strategy of the trader is characterized by a trading
f (c )λL+1 rate (γt )t∈T . The vector γ contains the information on the amount
Gs (F (y )) = e−λ eλ(F (y )) − (3.2)
( L + 1 )! of trading at each time point t.
The dynamics of the inventory of the investor holding Q 0 shares
For c ∈ [0, 1], f (c ) = F (y )L+1 ≈ ecλ . For the sake of simplicity, it is of an illiquid asset with the trading rate γ as a control process is
assumed that c = 0. given with the following counting process:
Gt (F (y )) = Gs+(t−s ) dqt γ = −γt dNt γ ,

= Gs (F (y ))Gt−s (F (y )) where the is a fraction of Q 0 shares, assumed to be constant at
From Eqs. (3.1) and (3.2), the generating function of time interval each stopping time i.e. either we fill whole orders or reject offers.
((t − s ), t ): We should note that is a block of shares, each block being of the
same volume. Typically, in practice, equals the average (or me-
e−λ(1−Fx (y )) dian) trade size, which can be fulfilled partially. The assumption of
Gt−s (F (y )) = L+1
λF (y )− λ might be questionable, but it reduces the problems complexity
e−λ ln(e (L+1 )! )
L+1
from a theoretical point of view. For the sake of simplicity, we have
−λ[(1−Fx (y ))−ln(eλF (y ) − (λL+1)! )]
=e also assumed = 1. Let Nt be a counting process, and Ft be a cash
flow process with dynamics:
Consider the number of orders in the coming stopping time is
a random variable with Poisson distribution and mean value λt. dFt γ = Sˆt γt dNt γ .
From the above generating function, we can define the probability
The investor has a finite time to liquidate her risky assets and
that no order arrival in time interval (0, t ) is said to have a Poisson
maximize an expected utility function of her terminal cash account
distribution greater than y:
(wealth):
e−λy · λky
P (U = k|N (0,s ) = L ) = sup (E[U (FT )] ),
k! γ ∈A(Q 0 )
where FT is the amount of the cash at the end of time horizon

λF (y ) λL+1 T and U is a utility function to model the effects of illiquidity on
λy = λ (1 − F (y )t − ln e −
( L + 1 )! preferences of a risk-averse agent. The utility function helps us to
measure investor’s preferences and the amount of risk they are
where F (y ) is the distribution of the price process and L is the willing to undertake in the hope of attaining better opportunities.
number of unexecuted orders up to time point s. Where k = 0, this We define A(Q 0 ) as a set of admissible strategies with given ini-
distribution function gives the probability of the first best order, tial inventory with Q 0 shares to have nonnegative inventory at all
k = 1 is the second best order, etc. times:
With given k = 0, Equation (3.1) gives the probability of best or- A(Q 0 ) ≡ {γ : γ is a predictable process, and admissible
ders (i.e. none of orders with price (Sti )i∈{1,··· ,Nt } has greater price
strategy from Q 0 }
than y), with k = 1 gives the probability of the second best order,
and so on. The number of limit of order book is not constant and We shall use A ≡ A(Q 0 ) for the set of admissible strategies γ .
it can be changed at each stopping time. In a dynamic framework, The liquidation problem can be formulated as a multi-stopping
the proposed distribution in Theorem 1 can help us to estimate problem with discrete-time sequences. The trader can liquidate her
the statistical order distribution of a random number of orders ac- shares in the m stops T = {1 ≤ T1 < · · · < Tm ≤ n} with a discrete
cumulated in the LOB. LOB. The goal is to maximize the expected gain at each stopping
1055
problem; the value function can be defined by:
V (t, q ) = sup Et,q [H γ (t, q )], t ∈ [0, T ], q ∈ [0, Q 0 ]. (3.3)

γ ∈A
Assumption 1 (Continuity and concavity of the value func-

tion). We assume that the value function V is a continuous and
bounded function. It is also strictly concave in q, increasing in t
and non-negative. The differentiability condition of the value func-
tion is not necessary to be satisfied.
The strict concavity of value function V is an essential assump-

tion that implies the existence of optimal controls and steady-state
stability (Bäuerle, 2001; Bäuerle & Rieder, 2010). Then under As-
sumption 1, the terminal wealth value function of the investor who
maximizes her wealth at the end of time period T , with given dis-
counted price process Sˆ, and inventory Q 0 , associated to optimal
trading rate γ , can be expressed as the follows:
Lemma 1. Let V (T , Q 0 ) be the continuation value that is obtained

from optimal trading of Q 0 shares until the end of period T and the
intensity process λt be defined as a rate of arrival of limit orders. The
Fig. 1. Schematic model of trading boundaries built on market depth with Poisson expected revenue from the execution of limit orders with arrival pat-
arrival patterns. terns with Poisson distribution is equal to:
T

sequence Ti (i ≤ m ). The trader wants to maximize the expected
V (T , Q ) = sup Et,q
0
e −rT
γt St λt dt , (3.4)
γ ∈A 0
utility function of her final wealth. The expected utility allows the
trader to compare the gain and loss of her trading strategies. Ex- where T
= T ∧ inf{t ≥ 0 : Q 0 − qt = 0} is trading time and trad-
pected utility is comparable to expected revenue which means that ing process γt is a control process, which has influence over the cash
the trader prefers a trading strategy that generates the highest ex- process, the inventory process, and the dynamics of prices. The vec-
pected utility. tor γ contains the information on the amount of trading at each time
The optimum expected revenue of the trade can be modelled as point (γt ).
a dynamic optimization problem which is an interaction between
Proof. It is assumed that the limit orders arrival rate is a point
price impact and price dynamics. We apply the Bellman’s optimal-
process with intensity rate λt . At each stopping time, we liquid qt
ity equation in a recursive format to solve the m-stopping prob-
shares of an illiquid asset with dynamics dqt γ = −γt at price Sˆ =
lem.
e−rt St . The trading rate γt is a control process. From Eq. (3.3) we
max(Expected Revenue ) have:
= max(Immediate Exercise + Continuation ). V (T , Q 0 ) = sup Et,q [H (T , Q 0 )]
γ ∈A
The trader holds Q0
number of shares and places a selling order an
k = γ (we have assumed = 1) illiquid asset in the market with
n
the trading rate γ in the time horizon T , the performance criteria = sup Et,q e −rTi
γt St 1(Ti ≤T )
γ ∈A
with the strategy γ is given by: i=1
T

H γ (t, Q 0 ) = hγ (t, k ) + Et,q [H γ (t + 1 ), Q 0 − k], (when n → ∞ )(t ∈ T ) = sup Et,q e −rt
γt St dNt
γ ∈A
where h(t, k ) is the depth function (concave and increasing in k;
0

decreasing in t): T
= sup Et,q e −rt

γt St λt dt
h(t, k ) = max((S(t,k ) − ES(t,k ) , 0 )) γ ∈A 0
= (S(t,k ) − ES(t,k ) )+ ,
where:
4. Modeling stochastic intensity and price impact

n
S(t,k ) = St(i ) 1(i≤k ) .
i=n−k+1 The rate of arrival of limit orders depends on the price and size
of orders; the cheaper orders will remain on the LOB for a shorter
Let ES(t,k ) be the expected boundaries, constructed from the set
time. It is empirically shown that the distribution of the price is
of best limit orders in the LOB by using the distribution of the or-
not constant and depends on the current state of the LOB. The ex-
der book at the stopping time t.
istence of significant autocorrelation of price movement and cor-

n
relations across time periods rejects the natural assumption of a
ES(t,k ) = P (i ≤ k )St(i ) . constant intensity of order arrivals rate (see: e.g. Cont, 2011). In an
i=n−k+1
illiquid market, bid and ask orders do not arrive consistently, and
At the stopping time, when one of these expected boundaries counterparties do not meet their demands regularly. These irreg-
(see Fig. 1)is hit by the stock price, we execute some proportion of ular proprieties of a high-frequency environment lead to applying
the inventory via limit order at the LOB with the depth function point process to model time series of the bid or ask price move-
h(t, k ) with a boundary condition: h(T , 0 ) = h(0, Q 0 ) = 0. ments. Garman (1976) propose models, in which order arrivals are
In this regard, the goal of the investor is to maximize her governed by a point process with constant intensity. There are sev-
wealth at the end of the period T associated with this dynamic eral conditions in previous trading models that do not hold our
1056
setting. First, a small number of traders cannot dominate the mar- is influenced by the arrival time of orders, the order value, and in-
ket with large-scale orders. The second condition is that the sub- directly, the price will be affected by trading.
mitting of orders is independent, and we mainly have an assump- Generally, the supply/demand of financial securities is not elas-
tion of market efficiency. These conditions are essential for having tic (Obizhaeva & Wang, 2013). According to fundamental concepts
a constant intensity of order arrivals. However, the structure of the of economics, in equilibrium, the relationship between supply and
market is dynamic, and high-frequency traders dominate over sev- demand determines the price that traders are willing to take for
enty percent of the market; therefore, none of the above conditions positions during a specified period. If the market is illiquid, trad-
can be satisfied. ing a large number of shares produces a large price fluctuation
Hawkes process as a point process was initially applied to (Mrázová & Neary, 2017). High-frequency traders Often execute
model earthquake occurrences. Some empirical studies show that trades in the course of the order book imbalance. As a result of this
the Hawkes process can fit with high-frequency data (Bacry et al., shock, the equilibrium of supply and demand could be changed,
2013; Bacry, Jaisson, & Muzy, 2016; Chavez-Demoulin & McGill, and the price must rise/decline to reach a new equilibrium (see:
2012). This process can explain the irregularity properties of data Fig. 10). The price movement occurs when a change in demand is
based on the positive and negative jumping behaviour of the asset caused by a change in a number of market orders. This change in
prices. Cartea, Jaimungal, and Ricci (2014) use the Hawkes process the price equilibrium is sensitive to shifts in both prices, and quan-
to express the dynamics of market orders and the LOB. This pro- tity demanded, which is called the ”elasticity of demand” (Mrázová
cess is a general form of a standard point process. The intensity of & Neary, 2017).
this process is conditional on the recent history, increase the rate We model this impact with close form solutions for the dynam-
of arrival of the same type of event (self-exciting property), and it ics of price impact. Let function f (γt ) be a general form of the
captures the impact of arrivals of orders on other types of orders impact market, conditional on the state of liquidity of the market.
(mutually-exciting property). In A.1, we explain the basic concept Similar to Kyle (1985) model, we use parameter α , as inverse liq-
of a point process and the dynamic of the Hawkes process. uidity of the market, to measure price impact. Function (t − s )
is the decay of price impact function, which is independent of the
state of the market. This function can be shown by the exponential
4.1. Price impact model or power-law decay of price impact. Gabaix (2016) shows power-
law functions can explain several patterns in the stock market, in-
Estimating and modelling price impacts is a crucial research cluding price impact.
area in market microstructure literature. It can be expressed as a Different types of price impact and decay functions have been
relationship between trading activities and price movements. Mon- used in the algorithm trading and market microstructure litera-
itoring and controlling the impact of trading are the main parts ture. We define two well-known impact functions: exponential and
of algorithm trading. To minimize the market impact, traders split power-law functions, linked to the exponential decay function. The
their orders into smaller chunks based on the current liquidity in parameter α is a proxy for the inverse of the market liquidity:
the LOB. Price impact might be dependent on exogenous factors • Power law impact function:
like trade rate and some endogenous factors, such as liquidity and
volatility. It is empirically observed that there is a changing of the f (γ ) = γ α , α > 0.
volatility of prices on trading activities. • Exponential impact function:
Kyle (1985) proposes a simple model for the evolution of mar- f (γ ) = exp(αγ ), α > 0.
ket prices and price movement. In his model, noise traders and
inform traders submit orders, and a market maker executes the Gatheral (2010) examines the dynamic-arbitrage for different
orders. The price is adjusted to a linear relationship between the combinations of the market impact function and decay function
trade size and a proxy for market liquidity. As a consequence of to not admit price manipulation strategies. (Alfonsi and Schied
trading, the price moves permanently, information affects the price (2010), proposition 2) proves the convex and non-constant expo-
for a long time, and price changes are strongly autocorrelated. The nential price impact, leads to the strictly positive definite property,
most recent literature on market microstructure shows that for liq- to avoid arbitrage implication, which is already stated in Gatheral
uid markets, price impact cannot be permanent. In highly liquid (2010). Nonlinear impact function that is not a related state of the
markets, outstanding shares are small and need less time to be liq- market is questionable; however, the above functions defined in an
uidated. illiquid market depend on the state of the market.
In illiquid markets, the temporary price impact governs the en- We model the dynamics of the order arriving intensity with the
during impact, and after a considerable number of trades, the price Hawkes process as a point process. This model can capture irreg-
movement shows a high resilience. Empirical studies show that ular properties of the high-frequency data. The strength of incen-
an elastic market can be converted to a plastic (inelastic) mar- tive to generate the same event is presented with parameter σ .
ket as a consequence of lacking counterparties, such that high- With choosing the price impact function f as a function of a trad-
volume trading has a long-lasting impact. In this market, traders ing rate γt and the parameter κ as the exponent of the decay of
face difficulty in finding counterparties at a particular time, and market impact, we can model a liquidation effect on the trade ar-
they should wait for a longer time to execute the orders or else rival dynamics. This model can be shown by a particular form of
cooperate with counterparties. The effect of price impact corre- the Hawkes process with the following SDE:
sponding to the liquidation strategy can be significantly large for dλt = ( f (γt ) − κλt )dt + σ dNt . (4.1)
substantial risk-averse traders who liquidate shares at a fast rate. In the following proposition, we model the impact of trading on
For this study, we use a stochastic intensity process to measure the rate of order arriving based on the Hawkes process:
the price impact. This model is based on a counting process us-
ing the mutual-exciting property of the Hawkes process. In illiquid Proposition 1 (Price Impact Model). Let f be a price impact func-
markets, the imbalance between supply and demand causes illiq- tion and a function of a trading rate γt . σ and κ represent the magni-
uidity, and the execution of a larger position needs a longer time tude of self-exciting and the exponent of the decay of market impact,
and effect on the arrival of the new orders, or to stay longer in the respectively. The solution of the SDE is expressed by:
LOB. Also, we explained the coming pattern of orders governed by t t
a stochastic point process. Therefore, the intensity of order arrival λt = ( f (γs )(t − s ))ds + σ e−κ (t−s ) dNs , (4.2)
0 0
1057
t+ε
where the function is the decay of impact.
= lim eαγs e−κ (t−s ) ds ≈ e(−κ t+αγε ) = λε .
ε →0 t
Proof. Following SDE represents the impact of trading on the dy-
namics of the rate of orders’ arrival: We can simply define the instantaneously affected stochastic in-
tensity by:
dλt = ( f (γt ) − κλt )dt + σ dNt .
t
To prove, we can move the first term of SDE to the left side, then λtInst = λε + σ e−κ (t−s ) dNs . (4.4)
multiply it by eκ t (see: Norberg, 2004), or we can define an initial 0
guess for the solution of above SDE as follows:
t
λt = λ0 + σ e−κ (t−s ) dNs , 5. Solving model by discrete-time MDP
0
where The liquidation problem (3.4) is a continuous-time stochastic
t
control problem. Bayraktar and Ludkovski (2014) formulate this
λ0 = e−κ t D(0 ) + f (γs )(t − s )ds.
problem as a Hamilton–Jacobi–Bellman equation, and provided vis-
0
cosity solutions as closed-form solutions associated with the LOB
λ0 Verify by Itô’s lemma on eκ t λt
model and its depth function. The classical stochastic control ap-
t
proach solves nonlinear partial differential equations, and it is nec-
eκ t λt = D(0 ) + eκ t f (τs )e−κ (t−s ) ds
0 essary the differentiability of the value function to be satisfied.
t In contrast, a discrete-time Markov Decision Processes (MDP) ap-
+ eκ t σ e−κ (t−s ) dNs proach provides a set of optimal policies, condition on the differ-
0
t t entiability of the value function. Bäuerle and Rieder (2009) use this
= f (γs )eκ s ds + σ eκ s dNs approach and by applying dynamic programming principles, prove
0 0 the existence and uniqueness of the solution. This liquidation prob-
κ eκ t λt dt + eκ t dλt = f (γt )eκ t dt + σ eκ t dNt lem is a mathematical abstraction of real problems in which an in-
κλt dt + dλt = f (γt )dt + σ dNt vestor should decide on several stopping times to gain certain rev-
enue at each stage. The investor has a finite period to liquidate a
dλt = ( f (γt ) − κλt )dt + σ dNt .
position and maximize the total revenue at the end of the period.
Therefore, in this problem with a finite number of sub-periods,
a mapping function should be applied to compute optimal poli-
Lemma 2. The impact stochastic intensity equation: (4.3) is a general
cies through the limited number of steps of the dynamic program-
function of price impact. It can then measure an instantaneous price
ming algorithm. She must find an equilibrium to minimize the cost
impact in the short term, and a permanent price impact on the long
of the present extensive trading against the future abandon risk
run.
where the overall cost is not predictable. This problem can be for-
Proof. From Equation (4.3) we have: mulated as a deterministic or stochastic optimal control problem
t with Markov or semi-Markov decision property under different se-
lim λ0 = lim (e−κ t D(0 ) + f (γs )(t − s )ds ) tups.
t→∞ t→∞ 0
Bertsekas and Shreve (1996) distinguish between the stochas-
t
tic optimal control problems from its deterministic form regard-
= lim f (γs )e−κ (t−s ) ds
t→∞ 0 ing available information. In a deterministic optimal control prob-
t lem, we can specify a set of states and policy as a control pro-
= lim e−κ t f (γs )eκ s ds cess in advance. Thereby, a succeeding state is the function of the
t→∞ 0

t present state and its control process. On the other hand, in the
0 ( f (γs )eκ s ds stochastic control problem, controlling the succeeding state of the
= lim
eκ t t→∞ system leads to evaluate unforeseen states; therefore, control vari-
f (γt )eκ t ables that are no longer appropriate or have ceased to exist.
(apply l’Hopital’s rule) = lim
t→∞ κ eκ t In this paper, we use a Piecewise-Deterministic Markov Deci-
eαγt sion Model (PDMD) to decompose the liquidation (problem: 3.4)
= lim
t→∞ κ as a continuous-time stochastic control problem into discrete-time
1 + αγt + o(αγt ) 1 + αγT problems. Alternatively, we can apply a method by Mamer (1986),
= lim ≈ = λ∞ . which is successive approximations for semi-Markov decision pro-
t→∞ κ κ
cesses (Huang & Guo, 2011) or the pathwise optimization method
We defined the liquidation problem as a finite time investing prob-
(Desai, Farias, & Moallemi, 2012).
lem on a limited time horizon T . λ∞ represents the long-run trad-
ing impact on the intensity of order arrivals rate, we can think of
5.1. Solution by PDMP
it as a permanent price impact as ”base” intensity part of stochas-
tic intensity. It is a linear function of the trading rate to avoid dy-
Piecewise-Deterministic Markov Process (PDMP), introduced by
namic arbitrage (Gatheral, 2010). We can express the permanently
Davis (1984), is now applied mainly in various areas such as natu-
effected stochastic intensity by:
ral science, engineering, optimal control, and finance. The PDMP
t
is a member of the cádlág Markov Process family; it is a non-
λtPerm = λ∞ + σ e−κ (t−s ) dNs . (4.3)
0 diffusion stochastic dynamic model, with a deterministic motion
that is punctuated by a random jump process.
Instantaneous market impacts can be measured from the small
interval of trading, and the difference between the pre-trade and Definition 1 (Piecewise Deterministic Markov Process). A
post-trade price movements: piecewise-deterministic Markov process (PDMP) is a cádlág
t+ε Markov process with deterministic emotion controlled by the
lim λ0 = lim (e−κ t D(0 ) + f (γs )(t − s )ds ) random jump at jump time.
ε →0 ε →0 t
1058
Markov process (PDMP). We note that since Ji > −1 the price pro-
cess stays positive.
5.1.1. Markov process components

Fig. 2. Orders’ arrival governed by Poisson process. General speaking, Markov process models, which are not sta-
tionary in our setup, include a set of the following terms:
• Let A be an action space includes the action at which denotes
the quantity of shares to be liquidated at time t.
at = γt ,
t ∈ [0, · · · , T ]
where γt is an admissible strategy which satisfies the condition
γt ≤ q to have a non-negative inventory.
• With a given action set A, the set E is defined as a state-
Fig. 3. Iterative Procedure of PDMP. space, that contains the state x of inventory after applying ac-
tion a. The process state Xt denotes the amount of not liqui-
dated shares at jump time t.
Three measurement quantities characterize this process. The t
first feature is the transition measure k, which selects the post-
Xt = Q 0 − qs dNs
jump location. The other quantities are deterministic flow motion 0
φ between jumps and the intensity of the random jump t , de- t
fined as Borel measures on the Borel sets of the state space E, and = Q0 − γs dNs.
0
the control action space A.
In the state space E, set (t, x ) donates the desired process value
• Let K be a stochastic transition kernel from E × A, as a set of all
at the jump time point t. In an embedded Markov chain as a state-action, to a set of states E. We measure the probability of
discrete-time Markov chain, the state of the PDMP process can the next state and action with transition kernel K (.|xt , at ) based
be defined as a set of components of the continuous trajectory on the transition law at Markov decision model.
Zk = (Tk , XTk )k=1,··· ,n where Tk is an increasing sequence of the jump
• The reward function R represents the expected gains as a result
time component and Xt and Zt are a jump location component and of applying the strategy γ , of each state at the jump point time
a post jump location component, respectively: (Zt = Xt if t = tk ). t:
t t
This process starts at the state xt , and jumps with the Pois-
R(t, x, γ ) = γs Ss dNs = γs Ss λs ds (5.2)
son rate process t (fixed or time-dependent) to the next state or 0 0
hits the boundary of the state space. Stochastic Kernel K (.|xt , at ) by t
measuring the transmission probability selects the next location of = U γ (Xs )dNs , (5.3)
the jump given the current information on state and action. Each 0
Markovian policy is a function of the jump time component (Tk ) where the function U : (0, ∞ ) → R+ is an increasing concave
and the post jump location (ZTk ) with the following condition: function, which represents the preference over a set of limit or-
ders or satisfaction of the trader from offers in the market and
φ (Xt ), for t < T ,
Zt = (5.1) Xs is the process state Xt represents the amount of not liqui-
Xt , for t = T .
dated shares at jump time t.
Figure 3 shows the iterative procedure of the PDMP: starting • The deterministic flow motion φ measures the movement of
point of the PDMP is X0 = Z0 , then X0 follows the flow φ un- the inventory of investor between two jumps with a given the
til T1 to determine first jump location X1 = φ (X0 ). The stochas- strategy η:
tic Kernel K (.|φ (X1 ), . ) selects the next location of the post-jump t
Z1 = K (.|φ (X1 ), . ). Latter, similar to the first jump, Xt follows the φ η (Xt ) = ηs dNs. (5.4)
flow φ until T2 so X2 = φ (Z1 ). In next step is the selection of a 0
location of the post-jump Z2 = K (.|φ (Z1 ), . ) by using the stochas- To be more precise on how we can solve the liquidation prob-
tic Kernel K (.|φ (Z1 ), . ). This iterative procedure will be continued lem with PDMD, we explain the formal expression of deterministic
until it hits a boundary of the state space. optimal control problems, which is well documented in Bertsekas
As mentioned earlier, the process of the illiquid asset for satis- and Shreve (1996). In a formal deterministic optimal control prob-
fies the stochastic difierential equation: lem, xi presents the state of the system at stage i and function ci
is a corresponding control at that stage. System equation xi+1 =
dSt = St (μt dt + σ dWt ), t ∈ [0, T ]
f (xi , ci ) is the generating function of next state xi+1 from current
In this market, the price processes can have jumps that could state xi and its control ci . Function g(xi , ci (xi )) is the rewarding
be occurred at random time points and followed by a Poisson pro- function of state i a associate with function ci .
cess. We denote by T = {0 := T0 < T1 · · · < Tm < n} the set of jump The total expected revenue after N decision steps is defined
time points of the Poisson process. The price processes for the time by:
between two jumps i.e. t ∈ [Ti , Ti+1 ) for i ≤ m is given by

N
σ 2 J (x0 ) = Et,x
C
g (xi , ci (xi )) .
C
(5.5)
St = STn exp(μt − )(t − Ti ) + σ Wt ,
2 i=0
and at time of a jump, the price process is: The set C contains all control functions (ci )i={0,··· ,N} , i.e. the ex-
pected total revenue is set of states and corresponding a sequence
STi − STi+1 = STi+1 Ji , of Markovian decision controls.
where Ji is an independent and identically distributed random vari- Let = {π1 , π2 , · · · , πN } be the sequence of all Markovian de-
able. The price process St is a so-called piecewise-deterministic cision controls πi . These decision controls are corresponding to
1059
Markovian policy for predictable admissible process γ defined in been given in the Equation (3.3) and is more consistent with the
the set . Each Markovian policy is a function of the jump time dynamic programming principle.
component (Tk ) and the post jump location (ZTk ). Bertsekas and Shreve (1996) define the universal measurable
In each stage, the reward obtained as a result of sequential mapping T to map Equation 3.3 from (5.3) as follows:
Markovian decisions after the jump by applying the strategy πi
T π ( )(x ) = H[x, π , ]. (5.8)
is:
t Therefore the operator T πn can be decomposed to T π0 · T π1 ·
R(Zi , πi (Zi )) = U π (φs (Xi ))dNs . (5.6) T π2 · · · T πn−1 .
0
From above definitions, we have:
The total expected revenue after N steps is defined by:
πn = T πn (r )

N
= T π0 T π1 · · · T πn−k πn−k .
π (t, x ) = Et,x R ( Z i , πi ( Z i ) . (5.7)
i=0 The mapping T π is an universal measurable mapping, let T π0 = 0
In the following theory, we explain how the above equation is and k ≤ n, (see: Bertsekas & Shreve, 1996 (chapter 1)) we have:
mathematically equivalent to our main problem: (3.4). πk = T π0 T π1 · · · T πk−1 π0 .
Theorem 2. Suppose policy set = {π1 , π2 , · · · , πN } is a Markovian In the following theorem, we prove the uniqueness of the solution
policy set and Z = {Z1 , Z2 , · · · , ZN } is a set of post jump location of by applying a piecewise-deterministic Markov process model.
PMDP and R is the reward function equation: (5.3). Then the value
Theorem 3 (Uniqueness of the solution). Let V (t, q ) be a revenue
function is the expected reward of the PDMP under the Markovian
function, and the optimal solution of the liquidation problem is a con-
policy at time point t and in the state x:
cave and decreasing function in q and increasing in t. Then the solu-
V (t, q ) = sup π (t, x ), t ∈ T, q ∈ [0, Q 0 ] tion of the liquidation problem by applying a PMDP model is converg-
π ∈
ing to a unique solution.
where
Proof. In the Theorem 2, we have shown that

N
π (t, x ) = Et,x R(Zi , πi (Zi )) . V (t, q ) = sup H γ (t, q )
γ
i=0
= sup π (t, x ),
Proof. By using the information on the jump location from Z = π ∈
(Z1 , Z2 , · · · , ZN ) as a set of post jump of PMDP, we have: with defining the sequence of = {π0 , π2 , · · · , πn } as set of
T

Markovian policies (Bertsekas & Shreve, 1996), we have:
V π (t, q ) = Et,q e −rt
γt st λˆt dt (t ∈ T )
0 πn = lim T π0 · T π1 · T π2 · · · T πn−1 π0 (x0 )
T
n→+∞
refer to Equation:(5.3 ) = Et,x U π (X )dNt = lim (T πn )n ( π0 (x0 ))

T
n→+∞
0
= T n ( πn (x0 )).
N
Ti+1 ∧T
= Et,x [ U π (φ (XTi ))dNt Therefore under the same condition, the optimal solution is de-
i=0
Ti fined as:
(Zi define as Zi = [Ti , XTi ] ) (x ) = sup ( π (x )).

π ∈

N Ti+1 ∧T
= Et,x Et [ U π (φ (XTi ))dNt |Zi ] Equally

i=0
Ti
(x ) = T ((x )).

N
which is a subset of the Banach space, so we can apply the Banach
= Et,x R(Zi , πi (Zi )) fixed point theorem and show that V is an unique fixed point of
i=0 the operator T on the set .
= π (t, x ).
6. Numerical method and simulation
We define a set = {π1 , π2 , · · · , πN } as a sequence of all Marko-
vian decision controls πi corresponding to the Markovian policy for
In this section, we apply a simulation method to assist the
the predictable admissible process γ included in the set . We can
performance of our model under various market microstructures’
then decompose this optimal control problem into the piecewise-
characteristics. The trader will decide to take a number of offers
deterministic Markov process:
in the LOB at given prices. The optimum trading rate is dependent
V (t, q ) = sup (t, x ). on the dynamics of orders’ arrival as well as time to maturity. We
π ∈ approximate the value function with a quantization method.

6.1. Approximation of the value function
5.2. Uniqueness of solution
As we explained before, the solution of the value function
We started to model the optimal liquidation problem as a V of the optimal liquidation problem is obtained by the sum-
stochastic control model Equation 3.3 and used the piecewise- ming rewards of sequential Markovian decisions with correspond-
deterministic Markov process to find an equivalent deterministic ing the Markovian policies π and a set of post jump process of the
model for this problem. We also proved this problem could be con- PMDP:
structed as a summation of the sequence of state processes and
V (t, Q 0 ) = sup Et,q [V (t + 1, Q 0 − q ) + h(t, q )]
corresponding control processes in the set . The formulation has γ ∈A
1060
where Bik is a Borel partition of Rd (see: Bally et al. (2003)).

As previously stated, at each stage, the reward obtained for each
stage of sequential Markovian decision after a jump by applying
strategy πi is:
Vˆk = R(ZˆTk , π (ZˆTk )) (6.3)
By applying dynamic programming principle for n fixed grids
ϒ0≤k≤n , Vˆk satisfies the backward dynamic programming condi-
tion:
Vˆn = R(ZˆTn , π (ZˆTn )) (6.4)
Vˆk = max(R(ZˆTk , π (ZˆTk ), E(Vˆk+1 |ZˆTk )) (6.5)

We have used a particular SDE form of the Hawkes process of
the intensity of the rate of order arrivals to express the impact of
order execution on the market:
dλt = ( f (γt ) − κλt )dt + σ dNt , (6.6)
where f (γt ) is a function of trade rate γt , σ explained the strength
of the incentive to generate the same event, and κ is the exponent
Fig. 4. Quantization of state space (Zi , Ti ). of the decay of market impact.
The Hawkes process can capture both exogenous impacts and
endogenous influence of past events to measure the probability of
∞
occurrences of events. This intensity, with the condition (σ > 0),
= sup Et,q R(Zi , πi (Zi ))1Ti <T + h(t, q )1Ti ≥T
γ ∈A can capture the contribution of past events which amplifies the
i=0
chance of an occurrence of the same type of events (self-exciting
We approximate the value function V with the function Vˆ , such property of the Hawkes process). By using this endogenous mech-
that |V − Vˆ |Lp is minimized for the Lp norm. To approximate the anism, (while σ < 0), the model can explain the significant aspect
continuous state space by a discrete space, we use a technique of the strategic function of market participants known as a ”mar-
that is called Quantization method. Bally, Pages et al. (2003), Bally, ket manipulation” (see: e.g. Cartea et al., 2014). It is often used
Printems et al. (2005) develop quantization methods to compute by traders to submit or cancel orders strategically to detect hid-
the approximation of a value function of the optimal stochastic den liquidity or to manipulate markets. Suppose trading activity
control. De Saporta, Dufour, Gonzalez et al. (2010) explain the im- with reducing possibility occurrence has an adverse influence on
plication of the numerical solution to PMDP, such that transmis- the arrival of orders. In that case, the jump process has an ad-
sion function cannot be computed explicitly from local character- verse impact on its intensity and makes imbalance in supply and
istics of PMDP. De Saporta et al. (2010) express a numerical so- demand of the market exponentially. This damping factor can be
lution for Embedded Markov chain, while the only source of ran- measured with Hawkes model the magnitude of self-exciting and
domness is a set of the post jump process (Tn , Zn ). By quantization the strength of the incentive while it is negative (self-damping
of Zn = (XTn ; Tn ), we can transfer the conditional expectations into property of the Hawkes process).
finite sums, and least upper bound of the value function (sup) into
its maximum value (max) in the discretized space of [0, T ]. We de- 6.3. Result of simulation
fine Vˆ as an approximation of the value function as follows:
In this part, we present the numerical solution of the optimal
∞
liquidation problem. To study the characteristics of the value func-
Vˆ (t, Q ) = max Et,x
0
R(Zî , πi (Zî ))1Tˆ <T + h(t, q )1Tˆ ≥T tion and the level of inventory associated with the control variable
γ ∈A i i
i=0
γ as the rate of trading, we compute some numerical examples
De Saporta et al. (2010) estimate the error and the convergence using different scenarios from empirical studies. Our simulation is
rate of the approximated value function with Lipschitz assumption an abstract of real problems. We consider a trader who wants to
of local characteristics of PMDP, and showed it is bounded by the liquidate Q 0 shares of a risky asset within a short time and fixed
constant rate of quantization error Qe. time horizon T . In an illiquid market, she expects a longer time to
liquidate the whole position. Her goal is to minimize the implicit
|V (t, Q 0 ) − Vˆ (t, Q 0 )| ≤ Qe (6.1) and explicit cost of trading and keep the low-level inventory by
controlling the trading rate.
6.2. Simulation We implement our model in the discrete state space. We choose
time steps small enough to increase the chance of catching orders,
Conditional expectation of value function can be computed with and larger than the usual tick time to make sure that quotes are
some numerical methods in finite dimensional space, such as re- not outside of the market bid-ask spreads. We assess the perfor-
gression method (Carriere, 1996; Longstaff & Schwartz, 2001; Tsit- mance of our model by quantizing the value function at a fixed
siklis & Van Roy, 2001) or on Malliavin calculus (as in Cont & position within the space and time of the mesh refinement.
Fournié, 2010). Bally et al. (2003) propose a quantization method To illustrate how our model behaves, we study the inven-
to approximate the state space of problem; from each time step Tk , tory level associated with the optimal trading rate control for
a sate function Zˆ can be projected to the grid ϒk := {zˆT i }1≤i≤N (see: both types of price impact Hawkes models: self-damping and
k
Fig. 4) self-exciting properties. Concerning our simulation scenarios, we
choose values for the parameters of the model:
ZˆTk = zˆT i 1T i ∈Bk , (6.2)
1≤i≤N
k k i • The parameter σ as the magnitude of self-exciting
1061
Fig. 5. Inventory level of a trader with given the initial inventory Q 0 and time to maturity T.
Fig. 6. Trading rate of liquidation problem with constant order size .
• The parameter κ as the exponent of the decay of market impact strategies. In the case of coming of not favourite offers, the algo-
• The size of the bid-ask spread is used as a proxy to measure rithm reduces the speed of trading (panel I: self-damping prop-
illiquidity erty) and waited for a longer time to find better matching coun-
terparties. In unstable market conditions, are indicated with the
These parameters can be estimated from real market data. higher level of self-damping, the trader should pay for final inven-
Plotting the trading boundaries (Fig. 7) shows that the optimal tory to liquidate the whole position of initial shares. At this point,
trading level depends on the price process, the remainder of the the best strategy is to accept offers in the LOB to avoid never to
inventory (Fig. 5), and time to maturity. face severity penalties at the end of period. If estimated parame-
Due to market conditions and asset characteristics, an agent ters of markets might show that the higher chance of same types
with a higher degree of risk aversion cares more about the exe- of orders’ occurrences (panel II: self-exciting property), the algo-
cution risk and price fluctuations. She starts trading with available rithm reduces the trading rate in the hope of getting better offers.
orders at a deeper level of the LOB to avoid risk execution and lack The second column shows the related entries of the value function
of offers in future. She splits the original order into smaller slices as a result of the implicit and explicit cost of trading. The results
to mitigate price impact. demonstrate that the second type of market characteristics with a
Table 1 summarizes results of simulations by our model, includ- higher level of self-exciting property can be more profitable.
ing the level of inventory (Fig. 5) and its corresponding optimal It is worthwhile to compare our algorithm with some bench-
trading rate (Fig. 6) for different scenarios of implementations of marks, such as the conventional algorithm that trades only the best
1062
Fig. 7. Trading boundary condition for multi-Stopping time and with Poisson arrival.
Fig. 8. Trading rate of liquidation Problem with different specification.
LOB orders. Table 1 (panel III: Conventional method) presents the shares at the given time. The central assumption of the majority of
results of the conventional algorithm simulations. The conventional limit order models is to trade at the best bid and ask prices (see:
optimal execution methods only trade with the best available limit Cao, Hansch, & Wang, 2008). We allow the trading procedure to go
orders in the LOB. We thus replicated the simulation experiments to more in-depth into the LOB to avoid not filling the order and
with both market microstructure conditions: self-exciting and self- face last-minute inventory penalties.
damping properties. Our results showed that the trader faces more
costs at the end of the period and fewer profits than our algorithm 7. Discussion and further works
in both scenarios. The conventional algorithm also considers orders
at the top level of the LOB, and is less flexible when the market is In this study, we proposed an analytical solution to the optimal
illiquid. liquidation problem, taking a dynamic approach and building nu-
Having a look at the graphs (Fig. 8) showing the optimal trading merical boundaries of multi-stopping problems in an illiquid mar-
rate of different scenarios, one can conclude that there are more ket. We simulated optimal splitting order models according to the
fluctuations in trading if the probability of arrival new offers is re- existing liquidity in the order book with different parameters and
duced. price impact models. We used the PDMD to decompose the liqui-
Our numerical results also show that the proportion of the gain dation problem into discrete period problems, and applied Markov
of the liquidation model considerably depends on the specification decision rules to obtain the solution. We examined the uniqueness
of the price impact function. It indicates that market conditions and practical existence of the optimal solution. We showed that
have an effect on the final inventory level and causes a substantial the percentage gain of the liquidation model depends on the mar-
dropping in the final wealth of the trader. An important aspect of ket conditions and specifications of the price impact function. In
the optimal strategies, which we have developed, is to take into ac- contrast to most limit order models for liquidating a market, which
count the execution risk in an illiquid market, i.e. inability to liquid only trade at the best bid and ask prices, our model allows the
1063
trading to go deeper into LOBs to avoid situations in which orders

are not filled until the last time periods.
We believe that a possible area for further research would
be to study the optimal liquidation problem for multiple assets
(Tsoukalas, Wang, & Giesecke, 2019) across trading venues (see:
e.g. Barclay, Hendershott, & McCormick, 2003), known as the op-
timal splitting problem, by considering different trader types (see,
e.g., Brogaard et al., 2019) in lit or dark markets (lit or dark pools)
(see, e.g, Comerton-Forde & Putniņš, 2015). In this framework, a
trader might generate profits by pushing up prices at the tradi-
tional venue and parallel trading in the multi-platform. We could
apply the same modelling approach to the arrival flow of orders
and solve the problem using the interpretable optimal stopping Fig. 9. Hawkes Processes with 31 events.
modelling approach developed by Ciocan and Mišić (2020).
Acknowledgement
Jan Vecer’s work was partially supported by Grant Agency of

the Czech Republic under grant 21-19311S.
Appendix A
A1. Hawkes process
The concept of a point process is fundamental to the stochastic

process. Before we explain the dynamic of the Hawkes process, we
state the following formal definition of a point process, a count-
ing process, and an intensity process. For more explanation about
point process and intensity process, we refer to Giesecke, Kaka-
vand, and Mousavi (2011).
Definition 2 (Point Process). Let ti (i ∈ N) be a sequence of non-

negative random variables, which is measurable on the probability
Fig. 10. Changes in price equilibrium as a result of increasing and decreasing of
space (; F; P ), in such a way that ∀i ∈ N, ti < ti+1 , is defined as a demands.
point process on R.
Definition 3 (Counting Process). The right-continuous process

Nt = i∈N 1ti ≤t ,with given a point process ti (i ∈ N), is a so-called t
counting process if it measures the number of discrete events up = λ0 + α f (t − s )dNs , (A.7)
and including the time point t. 0
where λ0 is a deterministic long run ”base” intensity, which is

Definition 4 (Intensity Process). With given Nt as a point pro-
assumed to be constant and α represents the magnitude of self-
cess adapted to a filtration F, the intensity process λ as a left-
exciting.
continuous process is defined by:
N − Nt
The function f (t ) expresses the impact of the past events on
t+t
λ(t |Ft ) = lim E |Ft (A.1) the current intensity process, and the parameter σ explains the
t→0 t
magnitude of self-exciting and the strength of an incentive to gen-
Equally
N − Nt
erate the same event, see Fig. 9. We define one-dimensional (1D)
t+t
λ(t |Ft ) = lim P |Ft (A.2) Hawkes processes as follows:
t→0 t
Definition 6 (Hawkes Processes). Hawkes (1971) defines Hawkes
To be more precise, a intensity process λt is determined by a Processes as a linear self-exciting process with an exponential ker-
counting process Nt with the following probabilities: nel function f (t ) = α e−β t which is parameterised by constants
P(Nt+t − Nt = 1 ) = λt t + o(t ) (A.3) α , β and β > 0. Simple one-dimensional (1D) Hawkes processes
can be expressed:
ti <t
P(Nt+t − Nt = 0 ) = 1 − λt t + o(t ) (A.4)
λt = λ0 + α e−β (t−ti ) (A.8)
0
P(Nt+t − Nt > 1 ) = o(t ) (A.5) t

Homogeneous Poisson process is a so-called intensity process = λ0 + α e−β (t−s) dNs , (A.9)
0
that is independent of the probability of the occurrence in the
small interval t and a filtration Ft . where β is the exponent of decay which represents the process of
lessening an amount by a constant discount rate over a period.
Definition 5 (Linear self-exciting process). A general form of a lin-
ear self-exciting process can be expressed: Generally, the arrival pattern in a system can be modelled by
ti <t
Hawkes processes as a non-Markovian extension of the Poisson
process. Each arrival in the system instantly changes the arrival in-
λt = λ0 + α f (t − ti ) (A.6)
tensity by α ; then over time, these arrivals impacts decay at rate
0
1064
β . In the self-exciting point process framework, this damping fac- Chavez-Demoulin, V., & McGill, J. A. (2012). High-frequency financial data modeling
tors magnitude indicates the Hawkes processes self-exciting and using Hawkes processes. Journal of Banking & Finance, 36(12), 3415–3426.
Chehrazi, N., Glynn, P. W., & Weber, T. A. (2019). Dynamic credit-collections opti-
self-damping properties. The former property corresponds to the mization. Management Science, 65(6), 2737–2769.
positive value of β , while the latter corresponds to the negative Chen, J., Feng, L., & Peng, J. (2015). Optimal deleveraging with nonlinear temporary
value of β . An alternative choice for the kernel function f (t ) is a price impact. European Journal of Operational Research, 244(1), 240–247.
Chen, J., Feng, L., Peng, J., & Ye, Y. (2014). Analytical results and efficient algo-
power-law function. rithm for optimal portfolio deleveraging with market impact. Operations Re-
As a result of converting the integrated intensity into indepen- search, 62(1), 195–206.
dent components via exponentially distributed variables, we can Chordia, T., Roll, R., & Subrahmanyam, A. (2001). Market liquidity and trading activ-
ity. The Journal of Finance, 56(2), 501–530.
apply most of the analytical methods to analyze these statistical
Chordia, T., Subrahmanyam, A., & Tong, Q. (2014). Have capital market anomalies
random variables. One can calibrate the parameters of the Hawkes attenuated in the recent era of high liquidity and trading activity? Journal of
process with parametric estimation methods like maximum likeli- Accounting and Economics, 58(1), 41–58.
Ciocan, D. F., & Mišić, V. V. (2020). Interpretable optimal stopping. Management Sci-
hood estimation (Ozaki, 1979) or with non-parametric estimators
ence.
like the Expectation-Maximization (EM) algorithm (Bacry, Dayri, & Collin-Dufresne, P., & Fos, V. (2015). Do prices reveal the presence of informed trad-
Muzy, 2012). The goodness of the fit of these models also can be ing? The Journal of Finance, 70(4), 1555–1582.
examined with conventional statistical tests. Comerton-Forde, C., & Putniņš, T. J. (2015). Dark trading and price discovery. Journal
of Financial Economics, 118(1), 70–92.
Cont, R. (2011). Statistical modeling of high-frequency financial data. Signal Process-
ing Magazine, IEEE, 28(5), 16–25. https://doi.org/10.1109/MSP.2011.941548.
References Cont, R., & Fournié, D.-A. (2010). Change of variable formulas for non-anticipative
functionals on path space. Journal of Functional Analysis, 259(4), 1043–1072.
Alfonsi, A., & Blanc, P. (2016). Dynamic optimal execution in a mixed-market-impact Cont, R., Stoikov, S., & Talreja, R. (2010). A stochastic model for order book dynamics.
Hawkes price model. Finance and Stochastics, 20(1), 183–218. Operations Research, 58(3), 549–563. https://doi.org/10.1287/opre.1090.0780.
Alfonsi, A., Fruth, A., & Schied, A. (2008). Constrained portfolio liquidation in a limit Dassios, A., & Zhao, H. (2017). Efficient simulation of clustering jumps with CIR in-
order book model. Banach Center Publications, 83, 9–25. tensity. Operations Research, 65(6), 1494–1515.
Alfonsi, A., & Schied, A. (2010). Optimal trade execution and absence of price ma- Davis, M. H. A. (1984). Piecewise-deterministic Markov processes: A general class of
nipulations in limit order book models. SIAM Journal on Financial Mathematics, non-diffusion stochastic models. Journal of the Royal Statistical Society. Series B
1(1), 490–522. https://doi.org/10.1137/090762786. (Methodological), 353–388.
Almgren, R., & Chriss, N. (2001). Optimal execution of portfolio transactions. Journal De Saporta, B., Dufour, F., Gonzalez, K., et al. (2010). Numerical method for opti-
of Risk, 3, 5–40. mal stopping of piecewise deterministic Markov processes. The Annals of Applied
Almgren, R. F. (2003). Optimal execution with nonlinear impact functions and trad- Probability, 20(5), 1607–1637.
ing-enhanced risk. Applied Mathematical Finance, 10(1), 1–18. Desai, V. V., Farias, V. F., & Moallemi, C. C. (2012). Pathwise optimization for optimal
Ang, A., Papanikolaou, D., & Westerfield, M. M. (2014). Portfolio choice with illiquid stopping problems. Management Science, 58(12), 2292–2308.
assets. Management Science, 60(11), 2737–2761. Diamond, P. A. (1982). Aggregate demand management in search equilibrium. Jour-
Bacry, E., Dayri, K., & Muzy, J. F. (2012). Non-parametric kernel estimation for sym- nal of Political Economy, 90(5), 881–894.
metric Hawkes processes. Application to high frequency financial data. The Eu- Dufour, A., & Engle, R. F. (20 0 0). Time and the price impact of a trade. The Journal
ropean Physical Journal B, 85(5). https://doi.org/10.1140/epjb/e2012-21005-8. of Finance, 55(6), 2467–2498.
Bacry, E., Delattre, S., Hoffmann, M., & Muzy, J. F. (2013). Modelling microstructure Easley, D., & O’hara, M. (1987). Price, trade size, and information in securities mar-
noise with mutually exciting point processes. Quantitative Finance, 13(1), 65–77. kets. Journal of Financial Economics, 19(1), 69–90.
https://doi.org/10.1080/14697688.2011.647054. Engle, R. F., & Lunde, A. (2003). Trades and quotes: A bivariate point process. Journal
Bacry, E., Jaisson, T., & Muzy, J.-F. (2016). Estimation of slowly decreasing Hawkes of Financial Econometrics, 1(2), 159–188.
kernels: Application to high-frequency order book dynamics. Quantitative Fi- Foucault, T., & Menkveld, A. J. (2008). Competition for order flow and smart order
nance, 16(8), 1179–1201. routing systems. The Journal of Finance, 63(1), 119–158.
Bally, V., Pages, G., et al. (2003). A quantization algorithm for solving multidimen- Gabaix, X. (2016). Power laws in economics: An introduction. Journal of Economic
sional discrete-time optimal stopping problems. Bernoulli, 9(6), 1003–1049. Perspectives, 30(1), 185–206.
Bally, V., Printems, J., et al. (2005). A quantization tree method for pricing and hedg- Garman, M. B. (1976). Market microstructure. Journal of Financial Economics, 3(3),
ing multidimensional american options. Mathematical Finance, 15(1), 119–168. 257–275. https://doi.org/10.1016/0304-405X(76)90 0 06-4.
Barclay, M. J., Hendershott, T., & McCormick, D. T. (2003). Competition among trad- Gatheral, J. (2010). No-dynamic-arbitrage and market impact. Quantitative Finance,
ing venues: Information and trading on electronic communications networks. 10(7), 749–759.
The Journal of Finance, 58(6), 2637–2665. Giesecke, K., Kakavand, H., & Mousavi, M. (2011). Exact simulation of point pro-
Bäuerle, N. (2001). Discounted stochastic fluid programs. Mathematics of Operations cesses with stochastic intensities. Operations Research, 59(5), 1233–1245.
Research, 26(2), 401–420. Glosten, L. R., & Milgrom, P. R. (1985). Bid, ask and transaction prices in a specialist
Bäuerle, N., & Rieder, U. (2009). MDP algorithms for portfolio optimization problems market with heterogeneously informed traders. Journal of Financial Economics,
in pure jump markets. Finance and Stochastics, 13(4), 591–611. 14(1), 71–100.
Bäuerle, N., & Rieder, U. (2010). Optimal control of piecewise deterministic Markov Goldstein, M. A., & Kavajecz, K. A. (20 0 0). Eighths, sixteenths, and market depth:
processes with finite time horizon. Modern Trends in Controlled Stochastic Pro- Changes in tick size and liquidity provision on the NYSE. Journal of Financial
cesses: Theory and Applications, 123, 143. Economics, 56(1), 125–149.
Bayraktar, E., Horst, U., & Sircar, R. (2007). Queuing theoretic approaches to financial Grossman, S. J., & Miller, M. H. (1988). Liquidity and market structure. The Journal
price fluctuations. Handbooks in Operations Research and Management Science, 15, of Finance, 43(3), 617–633.
637–677. Gueant, O., & Lehalle, C.-A. (2015). General intensity shapes in optimal liquidation.
Bayraktar, E., & Ludkovski, M. (2014). Liquidation in limit order books with con- Mathematical Finance, 25(3), 457–495.
trolled intensity. Mathematical Finance, 24(4), 627–650. Guéant, O., Lehalle, C.-A., & Fernandez-Tapia, J. (2012). Optimal portfolio liquidation
Bertsekas, D. P., & Shreve, S. E. (1996). Stochastic optimal control: The discrete time with limit orders. SIAM Journal on Financial Mathematics, 3(1), 740–764.
case. Athena Scientific. Ha, Y., & Zhang, H. (2020). Algorithmic trading for online portfolio selection under
Bertsimas, D., & Lo, A. W. (1998). Optimal control of execution costs. Journal of Fi- limited market liquidity. European Journal of Operational Research.
nancial Markets, 1(1), 1–50. Hasbrouck, J. (1991). Measuring the information content of stock trades. The Journal
Biais, B., Hillion, P., & Spatt, C. (1995). An empirical analysis of the limit order book of Finance, 46(1), 179–207.
and the order flow in the paris bourse. The Journal of Finance, 50(5), 1655–1689. Hawkes, A. G. (1971). Spectra of some self-exciting and mutually exciting point pro-
Brogaard, J., Hendershott, T., & Riordan, R. (2019). Price discovery without trading: cesses. Biometrika, 58(1), 83–90. https://doi.org/10.1093/biomet/58.1.83.
Evidence from limit orders. The Journal of Finance, 74(4), 1621–1658. Hendershott, T., Jones, C. M., & Menkveld, A. J. (2011). Does algorithmic trading im-
Brunovskỳ, P., Černỳ, A., & Komadel, J. (2018). Optimal trade execution under en- prove liquidity? The Journal of Finance, 66(1), 1–33.
dogenous pressure to liquidate: Theory and numerical solutions. European Jour- Henderson, V., & Hobson, D. (2013). Risk aversion, indivisible timing options, and
nal of Operational Research, 264(3), 1159–1171. gambling. Operations Research, 61(1), 126–137.
Cai, F., Han, S., Li, D., & Li, Y. (2019). Institutional herding and its price impact: Ev- Horst, U., & Naujokat, F. (2014). When to cross the spread? trading in two-sided
idence from the corporate bond market. Journal of Financial Economics, 131(1), limit order books. SIAM Journal on Financial Mathematics, 5(1), 278–315.
139–167. Huang, Y., & Guo, X. (2011). Finite horizon semi-Markov decision processes with
Cao, C., Hansch, O., & Wang, X. (2008). Order placement strategies in a pure limit application to maintenance systems. European Journal of Operational Research,
order book market. Journal of Financial Research, 31(2), 113–140. 212(1), 131–140.
Carriere, J. F. (1996). Valuation of the early-exercise price for derivative securi- Huang, Y., & Guo, X. (2020). Multiconstrained finite-horizon piecewise determinis-
ties using simulations and splines. Insurance: Mathematics and Economics, 19(1), tic Markov decision processes with unbounded transition rates. Mathematics of
19–30. Operations Research, 45(2), 641–659.
Cartea, A., Jaimungal, S., & Ricci, J. (2014). Buy low, sell high: A high frequency trad- Huberman, G., & Stanzl, W. (2004). Price manipulation and quasi-arbitrage. Econo-
ing perspective. SIAM Journal on Financial Mathematics, 5(1), 415–444. metrica, 72(4), 1247–1275.
1065
Kavajecz, K. A. (1999). A specialist’s quoted depth and the limit order book. The Ozaki, T. (1979). Maximum likelihood estimation of Hawkes’ self-exciting point pro-
Journal of Finance, 54(2), 747–771. cesses. Annals of the Institute of Statistical Mathematics, 31(1), 145–155. https:
Kempf, A., & Korn, O. (1999). Market depth and order size. Journal of Financial Mar- //doi.org/10.1007/BF02480272.
kets, 2(1), 29–48. Peskir, G., & Shiryaev, A. (2006). Optimal stopping and free-boundary problems.
Kirilenko, A., Kyle, A. S., Samadi, M., & Tuzun, T. (2017). The flash crash: High-fre- Springer.
quency trading in an electronic market. The Journal of Finance, 72(3), 967–998. Philip, R. (2020). Estimating permanent price impact via machine learning. Journal
Kogan, L., Ross, S. A., Wang, J., & Westerfield, M. M. (2006). The price impact and of Econometrics, 215(2), 414–449.
survival of irrational traders. The Journal of Finance, 61(1), 195–229. Predoiu, S., Shaikhet, G., & Shreve, S. (2011). Optimal execution of a general
Kyle, A. S. (1985). Continuous auctions and insider trading. Econometrica, one-sided limit-order book. SIAM Journal on Financial Mathematics, 2, 183–212.
1315–1335. Shleifer, A., & Summers, L. H. (1990). The noise trader approach to finance. Journal
Lindley, D. V. (1961). Dynamic programming and decision theory. Journal of the Royal of Economic perspectives, 4(2), 19–33.
Statistical Society. Series C (Applied Statistics), 10, 39–51. Ting, C., Warachka, M., & Zhao, Y. (2007). Optimal liquidation strategies and their
Longstaff, F. A., & Schwartz, E. S. (2001). Valuing american options by simulation: A implications. Journal of Economic Dynamics and Control, 31(4), 1431–1450.
simple least-squares approach. Review of Financial Studies, 14(1), 113–147. Tirole, J. (2011). Illiquidity and all its friends. Journal of Economic Literature, 49(2),
Mamer, J. W. (1986). Successive approximations for finite horizon, semi-Markov de- 287–325.
cision processes with application to asset liquidation. Operations Research, 34(4), Tsitsiklis, J. N., & Van Roy, B. (2001). Regression methods for pricing complex amer-
638–644. ican-style options. IEEE Transactions on Neural Networks, 12(4), 694–703.
Mrázová, M., & Neary, J. P. (2017). Not so demanding: Demand structure and firm Tsoukalas, G., Wang, J., & Giesecke, K. (2019). Dynamic portfolio execution. Manage-
behavior. American Economic Review, 107(12), 3835–3874. ment Science, 65(5), 2015–2040.
Norberg, R. (2004). Vasiček beyond the normal. Mathematical Finance, 14(4), Vayanos, D., & Wang, J. (2013). Market liquidity-theory and empirical evidence. In
585–604. Handbook of the economics of finance: vol. 2 (pp. 1289–1361). Elsevier.
Obizhaeva, A. A., & Wang, J. (2013). Optimal trading strategy and supply/demand Zheng, B., Roueff, F., & Abergel, F. (2014). Modelling bid and ask prices using con-
dynamics. Journal of Financial Markets, 16(1), 1–32. strained Hawkes processes: Ergodicity and scaling limit. SIAM Journal on Finan-
cial Mathematics, 5(1), 99–136.
1066

1 s2.0 S0377221721004483 Main

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

1 s2.0 S0377221721004483 Main

Uploaded by

Copyright:

Available Formats

European Journal of Operational Research 296 (2022) 1050–1066

Contents lists available at ScienceDirect

European Journal of Operational Research

Interfaces with Other Disciplines

Optimal liquidation problem in illiquid marketsR

1. Introduction vated by a real problem faced by traders wanting to liquidate large

κ = 0.6 $50,067 Trade rate 0.36 6.17 12.75 19.28 44.16

3.1. Trading boundaries

Gt (F (y )) = Gs+(t−s ) dqt γ = −γt dNt γ ,

 where FT is the amount of the cash at the end of time horizon

problem; the value function can be deﬁned by:

V (t, q ) = sup Et,q [H γ (t, q )], t ∈ [0, T ], q ∈ [0, Q 0 ]. (3.3)

Assumption 1 (Continuity and concavity of the value func-

The strict concavity of value function V is an essential assump-

Lemma 1. Let V (T , Q 0 ) be the continuation value that is obtained

= sup Et,q e −rt

5.1.1. Markov process components

refer to Equation:(5.3 ) = Et,x U π (X )dNt = lim (T πn )n ( π0 (x0 ))

(Zi deﬁne as Zi = [Ti , XTi ] ) (x ) = sup ( π (x )).

= Et,x Et [ U π (φ (XTi ))dNt |Zi ] Equally

where Bik is a Borel partition of Rd (see: Bally et al. (2003)).

Vˆk = max(R(ZˆTk , π (ZˆTk ), E(Vˆk+1 |ZˆTk )) (6.5)

Fig. 6. Trading rate of liquidation problem with constant order size .

Fig. 8. Trading rate of liquidation Problem with different speciﬁcation.

trading to go deeper into LOBs to avoid situations in which orders

Jan Vecer’s work was partially supported by Grant Agency of

A1. Hawkes process

The concept of a point process is fundamental to the stochastic

Deﬁnition 2 (Point Process). Let ti (i ∈ N) be a sequence of non-

Deﬁnition 3 (Counting Process). The right-continuous process

where λ0 is a deterministic long run ”base” intensity, which is

P(Nt+t − Nt > 1 ) = o(t ) (A.5)  t

You might also like

where FT is the amount of the cash at the end of time horizon

refer to Equation:(5.3 ) = Et,x U π (X )dNt = lim (T πn )n ( π0 (x0 ))

(Zi deﬁne as Zi = [Ti , XTi ] ) (x ) = sup ( π (x )).

Fig. 6. Trading rate of liquidation problem with constant order size .

P(Nt+t − Nt > 1 ) = o(t ) (A.5) t