Satisﬁability as a Classiﬁcation Problem ⋆ David Devlin and Barry O’Sullivan Cork Constraint Computation Centre Department of Computer Science, University College Cork, Ireland {d.devlin|b.osullivan}@4c.ucc.ie Abstract. Given a Boolean formula, the classic satisﬁability problem is to decide whether there is a truth assignment to its variables such that the formula evaluates to true. The satisﬁability problem was the ﬁrst de- cision problem proven to be NP-Complete. Therefore, it is very unlikely that there exists an algorithm for solving the satisﬁability problem that has good worst-case performance. However, the satisﬁability problem is ubiquitous in artiﬁcial intelligence. In this paper, we view the satisﬁabil- ity problem as a classiﬁcation task. Based on easy to compute structural features of instances of large satisﬁability problems we use a variety of standard classiﬁer learners to classify previously unseen instances of the satisﬁability problem as either satisﬁable or unsatisﬁable. We show that standard learning techniques can very reliably perform this task, with accuracy in excess of 99% for hard 3-SAT problems, and usually in ex- cess of 90% for large industrial benchmarks. These results are surprising, and suggest that machine learning techniques can be very eﬀective at revealing the signiﬁcant structural characteristics that are important in satisﬁability testing. 1 Introduction The satisﬁability problem (Sat) is, informally, deﬁned as: given a Boolean for- mula, decide whether there is a truth assignment to each of its variables such that the formula evaluates to true. For example, consider the Boolean function And over variables A and B: φ And (A, B)= A.B This formula evaluates to true if both A and B are assigned true. We say that this formula is, therefore, satisﬁable. Sat is important from both theoretical and practical perspectives. Sat was the ﬁrst problem proved to be NP-Complete. The proof, Cook’s Theorem, was published in a 1971 paper by Stephen A. Cook [3]. That was a breakthrough result in computational complexity. Leonid Levin independently discovered the proof of Sat’s NP-Completeness, although he did not formally publish it until 1973. For this reason the proof is usually referred to as the Cook-Levin Theorem. ⋆ This work was supported by Science Foundation Ireland (Grant No. 05/IN/I886).