Learning in the synthesis of data-driven variable-gain controllers Marcel Heertjes, Bram Hunnekens, Nathan van de Wouw, and Henk Nijmeijer Abstract— To deal with performance trade-offs in the control of motion systems, a method is developed for designing variable- gain feedback controllers. The idea is to select a piecewise afﬁne controller structure and, subsequently, to ﬁnd the nonlinear controller parameter values of this structure by data-driven per- formance optimization. Herein an H2 performance objective is minimized. As a result, variable-gain controllers are synthesized using techniques from the ﬁeld of learning and optimization. The method is applied to a wafer stage simulation model. Index Terms— data-driven optimization, gradient methods, Lur’e systems, motion control, self-tuning, wafer scanners I. INTRODUCTION In the wafer scanning industry [4] variable-gain control is used to deal with performance trade-offs otherwise occurring under linear feedback; see also [9], [12], [13], [20]. The rationale is that low-frequency vibrations induced by the motion set-points of wafer scanners are better suppressed under high-gain feedback. Contrarily, high-frequency noise encountered during constant (scanning) velocity, thus in the absence of said set-points, becomes less ampliﬁed under low-gain feedback. Exploiting the property of continuously varying the controller gain, for example by using a deadzone nonlinearity in the controller structure [6], variable-gain control provides the means to suppress set-point induced vibrations under high-gain feedback in one part of the scan while keeping a low-gain noise response in another part. Tuning of the variable gain controller generally refers to frequency-domain loop shaping of the underlying linear sys- tem [6]. This is done by splitting up the nonlinear system into a linear system in feedback connection with a nonlinearity, i.e. adopt a Lur’e system formulation [21]. The linear system represents a closed-loop motion system, whose characteris- tics are the result of frequency-domain tunings [18]. On the one hand, these tunings should render the closed-loop system robustly stable, i.e. being able to deal with the effect of plant resonances (and the uncertainty thereabout) on the closed- loop stability properties. On the other hand, the tunings aim at improved closed-loop performance properties in view of the trade-off between low-frequency disturbance suppression and high-frequency noise ampliﬁcation. Stability of the nonlinear closed-loop system can be guar- anteed by frequency-domain evaluation through the circle criterion [6]. Performance, however, very much depends on the choice of the variable gains, the plant characteristics, and the (unknown) disturbances acting on the system, which vary from machine to machine but also from ﬁeld (wafer area during exposure) to ﬁeld. It therefore makes sense to All authors are with the Department of Mechanical Engineering, Eind- hoven University of Technology, 5600 MB Eindhoven, The Nether- lands. M.F. Heertjes is also with ASML, Veldhoven, The Netherlands. marcel.heertjes@asml.com This work is ﬁnancially supported by the Dutch Technology Foundation STW. assess (servo) performance per machine or ﬁeld and in time domain, the latter in view of the nonlinearity in the loop. This paper explicitly deals with the performance-based design of the nonlinear part of the Lur’e system. Observing that an accurate model description of the plant and the disturbances (suited for performance evaluation) is often lacking, an early selection of a ﬁxed nonlinear structure, e.g. deadzone or saturation [1], is not likely to induce best time-domain performances on individual machines during later machine qualiﬁcations. In [7], this problem was (only partly) tackled by self-tuning of the parameters of a deadzone nonlinearity, which was done per machine. With an itera- tive data-driven approach, see [2], [3], [5], [14] for other approaches, sampled data provided the information to ﬁnd an updated set of parameters on the basis of least-squares optimization; for self-tuning in the nonlinear context, see also [8], [11], [16]. In this paper, the nonlinearity is given by piecewise afﬁne functions having neither pre-deﬁned gains nor switching lengths. Adopting the iterative approach from [7] gives a method for data-driven variable-gain controller synthesis in which the gains and switching lengths can be tuned per machine or even (in the learning sense) from ﬁeld to ﬁeld. The latter has strong similarities with learning approaches having varying learning gains and/or Q-ﬁlters [19]. In contrast with these approaches, however, the method presented here considers (nonlinear) ﬁlter design rather than signal design. Also, it classiﬁes under feedback instead of under feedforward control. In a companion paper [10], a model-based approach is considered which is more suited for (large-scale) parameter studies conducted in the design phase when no machine measurements are yet available. De- pending on the speciﬁc disturbances and plant characteristics, the nonlinearity can become deadzone, saturation, or any other structure the piecewise afﬁne function supports; this is different from [7] where a ﬁxed deadzone structure is used. Dedicated optimization of machine performance is obtained with guaranteed (robust) stability properties. This is favorable for the motion industry in dealing with performance variation among machines, and for the wafer scanning industry in particular. The remainder of the paper is organized as follows. In Section 2, the Lur’e-type system description of the variable- gain motion control system is discussed. This includes the introduction of a three-parameter piecewise afﬁne (variable- gain) function, a stability analysis using the circle criterion, and a lifted system description describing the closed-loop dynamics of a sampled-data implementation of the nonlinear controller. In Section 3, a data-driven H 2 optimization ap- proach is presented that is used to ﬁnd the optimal parameters of the piecewise afﬁne function. Section 4 addresses a gen- eralization of the optimization method toward more arbitrary 2013 American Control Conference (ACC) Washington, DC, USA, June 17-19, 2013 978-1-4799-0176-0/$31.00 ©2013 AACC 6700