Autonomous Decision Making Process Supporting Cognitive Waveform Design Nicolas Colson *† , Apostolos Kountouris * * France Telecom R&D, Orange Labs 28, Chemin du Vieux Chˆ ene, 38243 Meylan, France Email: nicolas.colson@orange-ftgroup.com Armelle Wautier † , Lionel Husson † † Department of Telecommunications, SUPELEC 91192 Gif-sur-Yvette, France Abstract—In this paper, we propose a cognitive decision making process driving the dynamic reconﬁguration of a radio. The solution results from an original modeling of the cognitive design task based on the deﬁnition of two scales characterizing the solution space. It exploits the predictive capabilities of evolving connectionist systems improving their reliability through incremental learning as the radio interacts with its environment. The whole algorithm has been named RALFE for Reason And Learn From Experience based on its trial/error approach of the problem. This cognitive algorithm allows autonomous decision making with regard to multiple, possibly conﬂicting, operational objectives in a time-varying environment. The proposed approach is validated on a case of cognitive waveform design. I. I NTRODUCTION Cognitive radio [1], [2] is a technological concept pushing for the introduction of intelligent radio operation that goes beyond system adaptation and reconﬁguration on the basis of simple criteria and rules. To become fully operational, a Cognitive Radio requires three main conditions. First, the device must be built on top of a Software Deﬁned Radio platform [3] to beneﬁt from its ﬂexibility and agility. This condition makes the adaptation possible. Second, the de- vice must have a broad sensory activity in order to collect enough information to identify the context of the current communication: available services, radio link quality, service- dependent Quality of Service (QoS) requirements, regulation environment, remaining battery lifetime, etc. This condition is required to build a model of the context to adapt to. Third, the device must integrate a cognitive engine (CE) in order to solve, at runtime, the design problems formulated by the time- varying radio environment and the operational objectives set according to the context. This condition is central for cognitive behavior since it maps the context model to an appropriate action and thus corresponds to the intelligence per se. The cognitive behavior of a radio manifests itself through context awareness, multi-objective optimization capability, expertise based decision-making augmented by learning capabilities and last but not least transparent and computationally efﬁcient operation. These are, in a nutshell, the high-level requirements of the decision making approach proposed in this paper. To our knowledge, the only comprehensive approach with similar scope is that of Virginia Tech [2, Chap. 7]. VTech’s CE makes use of advanced genetic algorithms techniques [4] to control the cognitive radio reconﬁguration. A genetic al- gorithm is an efﬁcient metaheuristic for solving optimization problems. Its success in optimization results from the parallel search of multiple solutions within the problem space. How- ever, it is often balanced by a slow convergence or a heavy computational load inherent to the evaluation of multiple solutions over several generations. The paper is organized as follows. Section II provides a general description of the proposed solution. Section III applies the solution to a simple case study of cognitive waveform design and presents experimental results validating the approach. Finally, Section IV concludes the paper and discusses some future work perspectives. II. THE PROPOSED SOLUTION A. Modeling of the cognitive design task Design objectives may be classiﬁed into two categories. Constrained objectives limit the search space by forbidding some alternatives violating one or several imposed constraints to meet (e.g. QoS requirements, regulation environment). Optimization objectives guide the decision making process toward an appropriate solution among the ones compatible with the constraints (e.g. download faster, save the battery energy). These two kinds of objectives are speciﬁed through the deﬁnition of bounding/optimization policies deﬁned dy- namically in relation to the operational context evolution. This distinction between objectives lead us to the deﬁnition of two scales. The ﬁrst scale, called the Performance Scale (PS), ranks the conﬁgurations according to their robustness toward the constraints. This scale is set from expert knowledge acquired beforehand through simulation and stored into the CE. For example, the robustness toward transmission errors may be deduced from the relative position of the Bit Error Rate (BER) curves produced for some predeﬁned typical channel models. The second scale, called the Optimality Scale (OS), ranks the conﬁgurations according to their potential optimality with regard to the secondary objectives. This scale is set thanks to a grading system relying on conﬁguration-speciﬁc performance indicators to assign grades according to the objectives impor- tance. For example, a conﬁguration will be more satisfying in terms of throughput if it can transmit many information bits per symbol. However, attractive conﬁgurations are also more liable to violate the constraints due to their reduced robustness 978-1-4244-2644-7/08/$25.00 © 2008 IEEE