— 1 — An Investigation of Machine Learning Based Prediction Systems Carolyn Mair, Gada Kadoda, Martin Lefley, Keith Phalp Chris Schofield 1 , Martin Shepperd and Steve Webster Empirical Software Engineering Research Group Bournemouth University Talbot Campus Poole, BH12 5BB, UK {cmair, gkadoda, mlefley, kphalp, mshepper, swebster}@bmth.ac.uk http://dec.bmth.ac.uk/ESERG 9 July, 1999 Abstract Traditionally, researchers have used either off-the-shelf models such as COCOMO, or developed local models using statistical techniques such as stepwise regression, to obtain software effort estimates. More recently, attention has turned to a variety of machine learning methods such as artificial neural networks (ANNs), case-based reasoning (CBR) and rule induction (RI). This paper outlines some comparative research into the use of these three machine learning methods to build software effort prediction systems. We briefly describe each method and then apply the techniques to a dataset of 81 software projects derived from a Canadian software house in the late 1980s. We compare the 1 Chris Schofield is now with Nortel (cscho@nortelnetworks.com).