Temporal convolutional neural network-based feature extraction and asynchronous channel information fusion method for heart abnormality detection in phonocardiograms
- Authors
- Shin, Jae-Man; Park, Seongyong; Shin, Keewon; Seo, Woo-Young; Kim, Hyun-Seok; Kim, Dong-Kyu; Moon, Baehun; Cha, Seul-Gi; Shin, Won-Jung; Kim, Sung-Hoon
- Issue Date
- Sep-2025
- Publisher
- Elsevier BV
- Keywords
- Auscultation; Murmur; Phonocardiogram (PCG); Temporal convolutional neural networks
- Citation
- Computer Methods and Programs in Biomedicine, v.269
- Indexed
- SCIE
SCOPUS
- Journal Title
- Computer Methods and Programs in Biomedicine
- Volume
- 269
- URI
- https://scholarworks.korea.ac.kr/kumedicine/handle/2021.sw.kumedicine/77691
- DOI
- 10.1016/j.cmpb.2025.108871
- ISSN
- 0169-2607
1872-7565
- Abstract
- Background and Objective: Auscultation-based cardiac abnormality detection is valuable screening approach in pediatric populations, particularly in resource-limited settings. However, its clinical utility is often limited by phonocardiogram (PCG) signal variability and a difficulty in distinguishing between pathological and innocent murmurs. Methods: We proposed a framework that leverages temporal convolutional network (TCN)-based feature extraction and information fusion to integrate asynchronously acquired PCG recordings at the patient level. A probabilistic representation of the pathological state was first extracted from segmented PCG signals using a TCN-based model. These segment-level representations were subsequently averaged to generate record- or patient-level features. The framework was designed to accommodate recordings of varying durations and different auscultation locations. Furthermore, we addressed domain adaptation challenges in cardiac abnormality detection by incorporating transfer learning techniques. Results: The proposed method was evaluated using two large, independent public PCG datasets, demonstrating robust performance at both record and patient levels. While its initial performance on an unseen external dataset was modest, likely due to demographic characteristics and signal acquisition, transfer learning significantly improved the model's performance, yielding an area under the receiver operating characteristic curve of 0.931 +0.027 and an area under the precision-recall curve of 0.867+0.064 in external validation. Combining internal and external datasets further enhanced model generalizability. Conclusion: This proposed framework accommodates multi-channel, variable-length PCG recordings, making it a flexible and accurate solution for detecting pediatric cardiac abnormalities, particularly in low-resource settings. The source code is publicly available on Github (https://github.com/baporlab/pcg_pathological_murmur_detect ion).
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - 5. Others > ETC > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.