Professional Documents
Culture Documents
10 1109@tec 2020 2990937 PDF
10 1109@tec 2020 2990937 PDF
fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TEC.2020.2990937, IEEE
Transactions on Energy Conversion
1
A. Kushwaha and M. Gopal are with the Department of Electrical Engineering, School of Engineering, Shiv Nadar University, Gautam Buddha Nagar, Uttar
Pradesh, 201314, India. (e-mail: ak999@snu.edu.in; mgopal@snu.edu.in).
B. Singh, Jr., is with the Department of Electrical Engineering, Indian Institute of Technology, New Delhi, 110016, India. (e-mail:
bhimsinghiitd61@gmail.com).
0885-8969 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Canberra. Downloaded on April 28,2020 at 07:42:49 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TEC.2020.2990937, IEEE
Transactions on Energy Conversion
2
Grid
Squirrel cage Vga Vgb Vgc Iga Igb Igc
DC Link
Induction Generator
Wind turbine
Feedforward
SPWM SPWM
compensation
+ vqref
+ dq- αβ- abc-αβ abc-αβ
Id + vdref αβ abc αβ-abc
Ia Ψr + Vgα Vgβ Igα Igβ θ
Iq θ s
Ib abc- Flux
Ic dq estimator PLL
θ αβ-dq αβ-dq
θ +- PI PI ++ ++
ωr +- dq-αβ
VDCref
igdref vgd vgq igd igq
ωref T *
e Iq* vgd
VDC igd
+- PI G +- PI ωL
Ψr Iq
ωr -ωL
ωr Ψr* Id*
+- PI 1/Lm +- PI igqref ++
+- PI ++
Ψr Id igq vgq
Voltage
Field Oriented Control (FOC) Oriented Control (VOC)
Fig 1: Block Diagram of Proposed System 3.78kW, three phase squirrel cage induction generator based
these include tedious task of offline learning. Assuming WECS.
constant characteristics also give wrong results. With the
purpose of solving problems of HCS, modified HCS techniques II. SYSTEM CONFIGURATION
have been poposed in [14]-[15]. These require detection of
The proposed system is shown in Fig. 1. The system consists of
wind speed change. Moreover, these are not able to learn from
wind turbine coupled to a squirrel cage induction generator
experience because of no memory.
(SCIG) through a gear box. The SCIG is connected to the grid
In this paper, Q-learning based MPPT technique is proposed.
with back-to-back converters. The grid side converter and
Unlike, other MPPT techniques as TSR and PSF, Q-learning is
generator side converter are connected through a DC link
a model-free learning algorithm, which rejects any requirement
capacitor.
of WECS parameter knowledge for learning as wind speed. Q-
learning based controller works like an agent which interacts A. Wind Turbine Model
with the environment and the experience gained by interaction The mathematical model of adopted wind turbine is given as
forms the basis of its learning. In [16], the authors have following. The mechanical power generated by the wind turbine
presented Q-learning based MPPT for constant and step change can be given as,
in wind speeds. The Q-table is being represented by neural 1
network. The authors have presented the online learning of 𝑃𝑚 = 𝜌𝐴𝑣 3 𝐶𝑝 (𝜆, 𝛽) (1)
2
neural network for MPPT. It terminates the requirement of where Pm is mechanical power of turbine in [W], ρ is air density
extensive offline training. However, it has limitation of working in [kg/m3], A is turbine swept area in [m2], v is wind speed in
in varying wind speed conditions and online detection of [m/s], Cp is performance coefficient of turbine, λ is tip-speed
maximum power point. In the proposed algorithm, the ratio and β is the blade pitch angle in degrees.
controller stores its experience in form of look-up table. The The performance coefficient of wind turbine which is a function
technique works in two modes: learning mode and MPP mode. of λ and β is given as,
In learning mode, the controller searches for the MPP as well
as learns through the interaction. As soon as MPP is detected, 𝑐
𝑐
− 6
the controller works in MPP mode where it remains at MPP 𝐶𝑝 (𝜆, 𝛽) = 𝑐1 ( 2 − 𝑐3 𝛽 − 𝑐4 ) 𝑒 𝜆𝑖
+ 𝑐6 𝜆 (2)
𝜆𝑖
until wind speed changes. For any wind speed change, the Where,
controller again works in learning mode. If the controller 1 1 0.035
experiences the wind speed that has been already experienced = − (3)
𝜆𝑖 𝜆 + 0.008𝛽 𝛽 3 + 1
before, it moves faster towards MPP. Therefore, the proposed with c1 = 0.5176, c2 = 116, c3 = 0.4, c4 = 5, c5 = 21 and c6 =
MPPT technique shows improvement in terms of time as 0.0068.
compared to HCS. The expression for the tip-speed ratio is given as,
With the wind speed change detection, the controller works in 𝜔𝑇 𝑅
two modes mode 0 and mode 1. In mode 1, system operates at 𝜆 = (4)
𝑣
optimum point in varying speed conditions. In mode 0, the where ωT is turbine angular speed and R is turbine blade radius
system implements in learning mode. While working in MPP in m.
mode, controller records peak point which is used in generating In wind turbine, there is an optimum value of tip-speed ratio λopt
reference speed in mode 1. at which performance coefficient of turbine is at its maximum
The proposed Q-learning based MPPT technique is verified by value Cpmax. At this point, turbine fetches maximum power from
simulation in MATLAB/SIMULINK environment on a wind. For this turbine, λopt and Cpmax are 8.1 and 0.48
0885-8969 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Canberra. Downloaded on April 28,2020 at 07:42:49 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TEC.2020.2990937, IEEE
Transactions on Energy Conversion
3
respectively for pitch angle β =00. Fig. 2 shows the The basis of most of the reinforcement learning framework is
characteristics of turbine power with turbine rotational speed at Markov decision process (MDP). Mathematically, MDP can be
different wind speeds for a pitch angle β=00. This optimum expressed as a set of variables (S, A, P, r) where S is set of
curve shows the maximum power point curve at different wind possible states of the environment, A is set of actions, P is state
speeds. The aim is to vary the generator speed so that tip-speed transition probability, and r is reward function. V(s) represents
ratio remains at optimum value for different wind speeds and the value of state s. For action a at state s, P(s,a,s’) is the
maximum power point tracking (MPPT) occurs. probability of transition from state s to s’, the value of state s
state
B. Generator Side Control Environment (st+1)
The generator side converter control is based on field oriented
action reward
control (FOC). FOC enables the machine to follow the (r(st,at))
(at)
reference speed accurately. Three phase generator currents are
decoupled into d- axis and q-axis in rotor flux oriented frame.
Agent
On the q-axis, the speed control loop is performed and iq is state
proportional to the generator torque. The flux control loop (st)
operates in d-axis Fig. 3 Block Diagram of RL
8000 is shown as,
optimal
𝑉(𝑠) = 𝑟(𝑠) + 𝑚𝑎𝑥 𝑎 𝛶 ∑𝑠′ 𝑃( 𝑠, 𝑎, 𝑠′)𝑉(𝑠 )
′
(5)
Turbine output power (W)
7000 curve
0885-8969 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Canberra. Downloaded on April 28,2020 at 07:42:49 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TEC.2020.2990937, IEEE
Transactions on Energy Conversion
4
0885-8969 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Canberra. Downloaded on April 28,2020 at 07:42:49 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TEC.2020.2990937, IEEE
Transactions on Energy Conversion
5
In learning mode, the agent observes the state using Pgrid and ωg Take action zero
feedback, and then learns the best action by maximizing Q-
values to achieve the MPP. In each state, the agent can take N Y
ΔPgrid < α
seven actions as defined in (14). Three actions are for increment
in generator speed, three actions are for decrement in generator Fig. 4 Flowchart of Q-learning Based MPPT
speed and one action is for no change. For each action, agent The convergence of Q-learning is based on convergence of Q-
gets positive or negative reward based on increase or decrease values for all the state-action values. In online learning, waiting
in grid power as defined in (15). for all Q-values to converge is not a good option. Another
In Q-learning based MPPT, learning of states occurs online. option is to define the level of learning of each state separately.
Therefore, if during online learning, the system comes across Learning of each state is defined by the number of visits to the
MPP, the system should not wait for learning to be completed. state. The criterion for any state sij to be learned is
0885-8969 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Canberra. Downloaded on April 28,2020 at 07:42:49 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TEC.2020.2990937, IEEE
Transactions on Energy Conversion
6
0885-8969 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Canberra. Downloaded on April 28,2020 at 07:42:49 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TEC.2020.2990937, IEEE
Transactions on Energy Conversion
7
remains at its maximum value 0.48 in the varying wind speed SENSORS
also.
SCIG
DC
MOTOR
ia
va
Fig 9. Phase ‘a’ voltage and current
The controller first does the exploration by decreasing and
increasing the speed. By gaining the rewards for each action,
the controller reaches to peak point in around 10 to 11 seconds.
The peak point is 340 W, 1230 rpm as shown in Fig. 11.
0885-8969 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Canberra. Downloaded on April 28,2020 at 07:42:49 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TEC.2020.2990937, IEEE
Transactions on Energy Conversion
8
NO
|P(k)-P(k-1)|>10?
YES
Nr(k)=N(k)
NO YES
P(k)-P(k-1)>10?
NO
N(k)-N(k-1)>0? N(k)-N(k-1)>0?
NO
YES YES
Nr(k)=N(k)-40 Nr(k)=N(k)+40
0885-8969 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Canberra. Downloaded on April 28,2020 at 07:42:49 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TEC.2020.2990937, IEEE
Transactions on Energy Conversion
9
the end of 25 seconds is 293 watt-seconds. This analysis shows [11] J. Hui and A. Bakhshai, “A new adaptive control algorithm for maximum
power point tracking for wind energy conversion systems,” in Power
the superiority of proposed algorithm over P&O algorithm.
Elect. Specialists Conf., June 15-18, 2008, pp. 4003-4007.
Assuming same rate of energy generated by both methods, 293 [12] W.-M. Lin, C.-M. Hong, F.-S. Cheng, and K. H. Lu, “MPPT control
watt-seconds improvement in 25 seconds leads to 0.042 mega- strategy for wind energy conversion system based on RBF network,” in
Joules of improvement in 1 hour, which makes proposed EnergyTech, May 25-26, 2011, pp. 1-6.
[13] C.-H. Chen, C.-M. Hong, and T.-C. Ou, “WRBF network based control
algorithm significantly superior to existing methods.
strategy for PMSG on smart grid,” in Int. Conf. on Intelligent System
Applications to Power Systems, Sept. 25-28, 2011.
IX. CONCLUSION [14] J. Yaoqin, Y. Zhongqing, and C. Binggang, “A new maximum power
point tracking control scheme for wind generation,” Proc. Int. Conf. on
Q-learning based MPPT algorithm has been proposed for Power System Technology, Oct. 13-17, 2002, pp. 114-148.
variable-speed WECS. This algorithm has been found capable [15] S. M. R. Kazmi, H. Goto, H.-J. Guo, and O. Ichinokura, “A Novel
of searching MPP in real time. The wind speed change detection Algorithm for Fast and Efficient Speed-Sensorless Maximum Power
Point Tracking in Wind Energy Conversion Systems,” IEEE Trans. Ind.
makes the proposed algorithm works in real time with varying
Electron., vol. 58, no. 1, pp. 29–36, jan. 2011.
wind speed conditions. The MPPT technique is model-free and [16] W. Chun, Z. Zhang, W Qiao and L Qu, “Intelligent maximum power
capable to learn the states by interacting with system. The extraction control for wind energy conversion systems based on online
stopping criterion for learning of the system depends on the Q-learning with function approximation.” 2014 IEEE Energy
number of times a particular state has been visited. The state Conversion Congress and Exposition (ECCE) (2014): 4911-4916.
[17] R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction
which is learned will work in mode 1. The proposed algorithm Cambridge, Mass.: MIT Press, 1998.
is compared with another popular algorithm P&O which does [18] C.J.C.H Watkins, "Learning from Delayed Rewards,” PhD Dissertation,
not require system information,, and superiority of proposed Cambridge University, Cambridge, England, 1989.
algorithm is shown. As explained in Section-IV, the various [19] Y. Xia, K. H. Ahmed and B. W. Williams, "A New Maximum Power
Point Tracking Technique for Permanent Magnet Synchronous
dynamic parameters makes TSR and PSF algorithms unreliable Generator Based Wind Energy Conversion System," IEEE Transactions
for MPPT. So, the proposed Q-learning based algorithm is on Power Electronics, vol. 26, no. 12, pp. 3609-3620, Dec. 2011.
capable of online learning, identifying MPP even in system is
unlearned and to work at MPP with varying wind speed. Also, X. BIOGRAPHIES
due to one Q-value update at one step, it takes very small Ashish Kushwaha received B.Tech. Degree in electrical
computational time which makes it fast. However, there is enginnering from JSSATE, Noida in 2010, and the
scope of using function-approximation methods to reduce the M.Tech. Degree in electrical engineering from the Motilal
Nehru National Institute of Technology, Allahabad, India,
memory requirement because of use of look-up table.
in 2012. He is currently working toward the Ph.D. degree
in the Department of Electrical Engineering, Shiv Nadar
REFERENCES University, Gautam Buddha Nagar, UP, India.
Her research interests include renewable energy systems,
[1] M. G. Simões, F. A. Farret, “Alternative Energy Systems: Design and
power electronics, electric drives and control, and
Analysis with Induction Generators,” CRC Press, 2007.
machine learning.
[2] S. Musunuri and H. L. Ginn, “Comprehensive review of wind energy
M. Gopal He received the B.E. degree in electrical
maximum power extraction algorithms,” in Power and Energy Society
engineering, and the M.E. and Ph.D. degrees in control
General Meeting, July 24-29, 2011, pp. 1-8.
engineering from Birla Institute of Technology and
[3] L. Y. Pao and K. E. Johnson, “A tutorial on the dynamics and control of
Science, Pilani, India, in 1968, 1970, and 1976,
wind turbines and wind farms,” in American Control Conference, June
respectively. M. Gopal, an Ex-Professor of Department of
10-12, 2009, pp. 2076-2089.
Electrical Engineering, Indian Institute of Technology
[4] S. M. R. Kazmi, H. Goto, H.-J. Guo, and O. Ichinokura, “Review and
(IIT) Delhi, New Delhi, India, is presently associated
critical analysis of the research papers published till date on maximum
with, Shiv Nadar University, Gautam Budh Nagar (U.P.),
power point tracking in wind energy conversion system,” in Energy
India. His teaching and research stints span over three
Conversion Congress & Exposition, Sept. 12-16, 2010, pp. 4075-4082
decades at IITs. He is the author/co-author of six books on control engineering.
[5] S. Morimoto, H. Nakayama, M. Sanada, and Y. Takeda, “Sensorless
His video course on control engineering is available on YouTube. He has a large
output maximization control for variable-speed wind generation system
number of publications in reputed journals. His current research interests
using IPMSG,” IEEE Trans. on Ind. Appl., vol. 41, no. 1, pp. 60-67, Jan-
include pattern recognition, machine learning, soft computing, and intelligent
Feb 2005.
control.
[6] R. I. Putri, M. Pujiantara, A. Priyadi, T. Ise and M. H. Purnomo,
"Maximum power extraction improvement using sensorless controller
based on adaptive perturb and observe algorithm for PMSG wind turbine Bhim Singh (SM’99–F’10) was born in Rahamapur,
application," IET Electric Power Applications, vol. 12, no. 4, pp. 455- Bijnor, India, in 1956. He received the B.E. degree in
462, 4 2018. electrical from the University of Roorkee, Roorkee, India,
[7] H. Fathabadi, "Novel Maximum Electrical and Mechanical Power in 1977, and the M.Tech. degree in power apparatus and
Tracking Controllers for Wind Energy Conversion Systems," IEEE systems and Ph.D. degree in electrical machines from the
Journal of Emerging and Selected Topics in Power Electronics, vol. 5, Indian Institute of Technology (IIT) Delhi, New Delhi,
no. 4, pp. 1739-1745, Dec. 2017. India, in 1979 and 1983, respectively In 1983, he was a
[8] H. Li, K. Shi, and P. Mclaren, “Neural-Network-Based Sensorless Lecturer with the Department of Electrical Engineering,
Maximum Wind Energy Capture With Compensated Power University of Roorkee (now IIT Roorkee), where he became a Reader in 1988.
Coefficient,” IEEE Trans. on Ind. Appl., vol. 41, no. 6, pp. 1548–1556, In December 1990, he joined the Department of Electrical Engineering, IIT
Nov-Dec 2005. Delhi, as an Assistant Professor, where he became an Associate Professor in
[9] M. Cirrincione, M. Pucci, and G. Vitale, “Neural MPPT of Variable- 1994 and a Professor in 1997. He was the Head with the Department of
Pitch Wind Generators With Induction Machines in a Wide Wind Speed Electrical Engineering, IIT Delhi, from July 2014 to August 2016. He is
Range,” IEEE Trans. on Ind. Appl., vol. 49, no. 2, pp. 942–953, March- currently the Dean, Academics with IIT Delhi. He has guided 79 Ph.D.
April 2013. dissertations, 166 M.E./M.Tech./M.S.(R) thesis. He has executed more than 75
[10] Q. Wang and L. Chang, “An Intelligent Maximum Power Extraction sponsored and consultancy projects. His areas of research interests include
Algorithm for Inverter-Based Variable Speed Wind Turbine Systems,” photovoltaic (PV) grid interface systems, micro grid, power quality, and PV
IEEE Trans. Power Elect., vol. 19, no. 5, pp. 1242-1249, Sept. 2004. water pumping systems.
0885-8969 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Canberra. Downloaded on April 28,2020 at 07:42:49 UTC from IEEE Xplore. Restrictions apply.