Chemical Combinatorics for Alkane-Isomer Enumeration and More L. Bytautas ² and D. J. Klein* Texas A&M University at Galveston, Galveston, Texas 77553-1675 Received May 18, 1998 Standard combinatorial enumeration techniques for alkanes are considered with a view to the extension to a widened range of chemically interesting features. As one brief point it is noted that these standard techniques naturally associate to generational schemes and thence have nomenclatural interpretations, which may be made to achieve some similarity to the standard IUPAC nomenclature. Our primary focus is the illustration that such combinatorial techniques are sufficient to enable computation of several graph-theoretic structural invariants averaged over (different types of) isomer classes. Such averages (and associated isomer counts) are tabulated for structural isomers for up to N ) 40 carbons, where there are 10 14 isomers (though the computational methodology should rather readily extend to at least N ) 80 where there should be 10 28 isomers). The averages for invariants are utilized to estimate several physicochemical properties averaged over these same isomer classes. The properties currently so considered are heat of formation, index of refraction, and magnetic susceptibility. Further, various asymptotic results for counts, mean invariants, and mean properties are noted, so that the exact graph-theoretic data are extrapolated with high accuracy to arbitrarily large alkanes. 1. INTRODUCTION One of the classic areas of chemistry concerns isomerism, starting in the 1860s, when it was realized (as in ref 1) that there were different possible structural formulas for the same atomic composition. That such different structural formulas corresponded to different compounds provided crucial evi- dence for the validity of classical structural formulas and the existence of structured molecules. The problem of formal isomer enumeration was initiated by Cayley 2 in 1874 for the case of alkanes, and this continued as a topic of interest for some time, 3 particularly with the group of Henze 4 making several enumerations, for alkanes and homologous sequences of various derivatives. Then in 1935 motivated by the chemical isomer problem Po ´lya 5,6 developed a powerful combinatorial theory for the enumeration of symmetry- mediated equivalence classes of “colorings”, and this enu- meration theory has now become standard fare in combina- torics texts. Beyond the formal mathematical theory, Po ´lya applied his theory to a few chemical problems, but in particular the problem of alkane isomer enumeration, and following Po ´lya there have been further (often very formal) refinements for this chemical problem in numerous papers, e.g., as refs 7-16. Work has been done emphasizing 7,8,10,13 the generality of the method in dealing with additional systems beyond alkanes, and there has been (a lesser amount of) work 5,9-12 dealing with the asymptotic behavior of the isomer counts, primarily for alkanes. The pre-1986 work on Po ´lya enumeration for chemical purposes is nicely reviewed by Read 7 who also gives a translation (made by D. Aeppli) of Po ´lya’s foundational paper, and Fujita 13 considers in detail many recent theoretical extensions (primarily concerning chirality and symmetry questions not only for isomers but also for reaction processes). Yeh 17 and Cyvin and co-workers 18,19 have recently repeatedly noted that much of Po ´lya’s general mathematical formalism can be foregone in many specific isomer enumerationssthat is, whole chapters often occurring in standard combinatorics texts concerning formal consideration of permutation groups, general “cycle index” functions, and “pattern inventories” as well as general theory of implicit and holomorphic functions often need not be gone through for many specific cases. Here we adopt this straightforward point of view for the classic case of alkanes, thereby refining some earlier results, and we develop possibilities for some extensions to new chemically oriented applications beyond simple isomer enumeration. Initially in section 2 the classical enumeration problem is reconsidered using such a straightforward ap- proach. It is noted in passing that there is a potential relation of these mathematical enumeration procedures to structure generation and associated nomenclatural systems, and this is detailed in the Appendix. The primary aim here of developing a more widely useful combinatoric chemistry is pursued in sections 3 and 4 where we show how the standard combinatorial enumerative techniques can be readily ex- tended to compute average values of certain graph-theoretic invariants, the averages being over the members of isomer classes. A few averages of some interest in “polymer statistics” are considered, including average numbers of conformations per isomer, the average through-bond diameter of a class of isomers, and the average through-bond (or shortest-path) distance between carbons in a class of isomers. This last quantity is obtained via a computation of the average so-called 20,21 “Wiener number” for isomeric alkanes. Also average counts of primary, secondary, tertiary, and quaternary carbons are made. Such graph-theoretic invari- ² On leave from Institute of Theoretical Physics & Astronomy, Gostauto 12, 2600 Vilnius, Lithuania. 1063 J. Chem. Inf. Comput. Sci. 1998, 38, 1063-1078 10.1021/ci980095c CCC: $15.00 © 1998 American Chemical Society Published on Web 10/17/1998