Model-Free Optimized Tracking Control Heuristic

Wang, Ning

Model-Free Optimized Tracking Control Heuristic

dc.contributor.author	Wang, Ning
dc.contributor.supervisor	Gueaieb, Wail
dc.date.accessioned	2020-09-02T19:01:47Z
dc.date.available	2020-09-02T19:01:47Z
dc.date.issued	2020-09-02	en_US
dc.description.abstract	Tracking control algorithms often target the convergence of a tracking error. However, this can be at the expense of other important system characteristics, such as the control effort used to annihilate the tracking error, transient response, or steady-state characteristics, for example. Furthermore, most tracking control methods assume prior knowledge of the system dynamics, which is not always a realistic assumption, especially in the case of highly complex systems. In this thesis, a model-free optimized tracking control architectural heuristic is proposed. The suggested feedback system is composed of two control loops. The first is the tracking loop. It focuses on the convergence of the tracking error. It is implemented using two different model-free control algorithms for comparison purpose: Reinforcement Learning (RL) and the Nonlinear Threshold Accepting (NLTA) technique. The RL scheme reformulates the tracking error combinations into a form of Markov-Decision-Process (MDP) and applies Q-Learning to build the best tracking control policy for the dynamic system under consideration. On the other hand, the NLTA algorithm is applied to tune the gains of a PID controller. The second control loop is in the form of a nonlinear state feedback loop. It is implemented using a feedforward artificial neural network (ANN) to optimize a system-wide cost function which can be flexible enough to encompass a set of desired design requirements pertaining to the targeted system behavior. This may include, for instance, the target overshoot, settling time, rise time, etc. The proposed architectural heuristic provides a model-free framework to tackle such control problems, in the sense that the plant's dynamic model is not required to be known in advance. Yet, at least a subset of the stability region of the optimized gains has to be known in advance so that it can provide a search space for the optimization algorithms. Simulation results on two dynamic systems demonstrate the superiority of the proposed control scheme.	en_US
dc.identifier.uri	http://hdl.handle.net/10393/40911
dc.identifier.uri	http://dx.doi.org/10.20381/ruor-25137
dc.language.iso	en	en_US
dc.publisher	Université d'Ottawa / University of Ottawa	en_US
dc.subject	Machine Learning	en_US
dc.subject	Tracking Control	en_US
dc.subject	Reinforcement Learning	en_US
dc.subject	Nonlinear Threshold Accepting Heuristic	en_US
dc.subject	Neural Networks	en_US
dc.title	Model-Free Optimized Tracking Control Heuristic	en_US
dc.type	Thesis	en_US
thesis.degree.discipline	Génie / Engineering	en_US
thesis.degree.level	Masters	en_US
thesis.degree.name	MASc	en_US
uottawa.department	Science informatique et génie électrique / Electrical Engineering and Computer Science	en_US

Fichiers

Trousse originale

Voici les éléments 1 - 1 sur 1

Nom:: Wang_Ning_2020_thesis.pdf
Taille:: 14.47 MB
Format:: Adobe Portable Document Format
Description:: Master thesis of Ning Wang

Télécharger

Trousse de licence

Voici les éléments 1 - 1 sur 1

Nom:: license.txt
Taille:: 6.65 KB
Format:: Item-specific license agreed upon to submission
Description:

Télécharger

Collections

- Thèses, 2011 - // Theses, 2011 -