Biomedical Signal Processing and Control 44 (2018) 229–236 Contents lists available at ScienceDirect Biomedical Signal Processing and Control journal homepage: www.elsevier.com/locate/bspc Estimation of the second heart sound split using windowed sinusoidal models Rasmus G. Sæderup a, , Poul Hoang a, , Simon Winther c,d , Morten Bøttcher d , Johannes Struijk b , Samuel Schmidt b , Jan Østergaard a a Department of Electronic Systems, Aalborg University, Fredrik Bajers Vej 7B, 9220 Aalborg Øst, Denmark b Department of Health Science and Technology, Aalborg University, Fredrik Bajers Vej 7D, 9220 Aalborg Øst, Denmark c Department of Internal Medicine, Hospital Unit West, Gl. Landevej 61, 7400 Herning, Denmark d Department of Cardiology, Aarhus University Hospital, Aarhus, Denmark a r t i c l e i n f o Article history: Received 17 October 2017 Received in revised form 19 March 2018 Accepted 9 April 2018 a b s t r a c t Knowing the time difference between the onsets of the aortic part (A2) and the pulmonic part (P2) of the second heart sound (S2), also referred to as the time split (TS) of S2, can assist in the diagnosis of a variety of heart diseases. However, estimating the TS is a non-trivial task due to the potential overlap between A2 and P2. In this paper, a model-based approach is proposed where both A2 and P2 are modeled as windowed sinusoids with their sum forming the S2 signal. Estimation of the model parameters and the S2 split form a non-convex optimization problem, where a local minimum is obtained using a sequential optimization procedure. First, the window parameters are found as the solution to a regularized least squares problem. Then, the frequencies and phases of the sinusoids are found by locating the maximal peaks of the heart signals’ frequency magnitudes, and using the corresponding phases. Finally, the TS is estimated as the time difference between the peaks of the cross-correlations between the measured S2 signal and the modeled A2/P2 signals. The algorithm is able to estimate the TS for synthetic signals with a root-mean-square error (RMSE) of 7.6 ms for equidistantly placed TSs between 70 ms and 70 ms. The RMSE increases for small TSs in the interval 30 ms to 10 ms, and at increased noise levels. The algorithm was applied to phonocardiograms recorded from 146 patients, where the average estimated TS was 29.6 ms, in conformance with the average TS of healthy subjects as found in the literature. © 2018 Elsevier Ltd. All rights reserved. 1. Introduction The second heart sound, also known as the S2 sound, is com- prised of two sounds, namely the sound from the aortic valve closure (A2) and the sound from the pulmonary valve closure (P2). The time difference between the onsets of A2 and P2 is referred to as the S2 split [1] or the time split (TS). Determining the TS is of great importance, as it can assist in the diagnosis of a wide variety of heart diseases such as left bundle branch block (LBBB), right bundle branch block (RBBB), severe congestive heart failure, atrial septal defect or pulmonic stenosis [2]. Pulmonary hypertension (PH) is a disease where the patient has an increased pulmonary arterial pressure (PAP) and it is difficult to diagnose non-invasively [3]. A strong relation between the normalized splitting interval of S2 and Corresponding authors. E-mail addresses: rsader13@student.aau.dk (R.G. Sæderup), phoang13@student.aau.dk (P. Hoang). the systolic PAP has been found, hence making it possible to esti- mate the PAP non-invasively by estimating the TS using acoustical recordings of the heart also known as phonocardiograms (PCG) [4]. A measured S2 signal from a PCG recording is shown in Fig. 1, having a TS of 38 ms. Normally, A2 precedes P2 by 20–80 ms (30–40 ms on average) when inhaling [2], which means that A2 and P2 can be distinguished as two separate sounds. When exhaling, P2 rarely occurs after A2 by more than 30 ms, often yielding that the A2 and P2 sounds are perceived as occurring simultaneously and can therefore not be distinguished by humans [5]. Various methods related to estimation of the TS have been found in the literature, and can overall be put in three categories: model- ing the S2 sound, estimating the TS directly using signal processing methods, and source separating S2 into A2 and P2. For S2 modeling, the methods used include nonlinear chirps [6] and exponentially decaying sinusoids [7]. Estimating the TS directly from the data can be done by using the Hilbert transform to find the envelope of the signal [1], and https://doi.org/10.1016/j.bspc.2018.04.006 1746-8094/© 2018 Elsevier Ltd. All rights reserved.