Evaluating artificial intelligence-enabled medical tests in cardiology: Best practice

Jonas L. Isaksen*, Malene Norregaard, Martin Manninger, Dobromir Dobrev, Thomas Jespersen, Ben Hermans, Jordi Heijman, Gernot Plank, Daniel Scherr, Thomas Pock, Vajira Thambawita, Michael A. Riegler, Jorgen K. Kanters, Dominik Linz

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Machine learning methods are increasingly used in cardiovascular research. In order to highlight opportunities and challenges of the evaluation of studies applying machine learning, we use examples from cardiac electrophysiology, a field characterized by large and often imbalanced amounts of data. We provide recommendations and guidance on evaluating and presenting supervised machine learning studies. We recommend proper cohort selection, keeping training and testing data strictly separate, and comparing results to a reference model without machine learning as basic principles to ensure the quality of studies using machine learning methods. We furthermore recommend specific metrics and plots when reporting on machine learning including on models for multi-channel time series or images. This Best Practice paper represents a possible blueprint to help evaluate machine learning-based medical tests in cardiac electrophysiology and beyond.
Original languageEnglish
Article number101783
Number of pages9
JournalIJC Heart and Vasculature
Volume60
Early online date1 Aug 2025
DOIs
Publication statusPublished - 1 Oct 2025

Keywords

  • Artificial Intelligence
  • Machine Learning
  • Deep Learning
  • Medical Test
  • Evaluation
  • ATRIAL-FIBRILLATION
  • AI HALLUCINATIONS
  • PREDICTING STROKE
  • RISK
  • CLASSIFICATION
  • MORTALITY
  • DESIGN

Fingerprint

Dive into the research topics of 'Evaluating artificial intelligence-enabled medical tests in cardiology: Best practice'. Together they form a unique fingerprint.

Cite this