Diversity and inclusion within datasets in heart failure: a systematic review
Abstract
Background: Heart failure (HF) is a life-threatening disease affecting 64 million people worldwide. Artificial intelligence (AI) technologies are being developed for use in HF to support early diagnosis and stratification of treatment. The performance characteristics of AI technologies are influenced by whether the data used during the AI lifecycle reflects the populations for which the AI is used.
Objectives: The aim of the study was to identify and characterize datasets used across the lifecycle of AI technologies for HF, focusing on data diversity and inclusivity.
Methods: MEDLINE and Embase were systematically searched from January 1, 2012, until August 30, 2022, to identify articles relating to the development of AI in HF. Articles were independently screened by 2 reviewers to identify datasets. Dataset documentation was analyzed with a focus on accessibility, geographical origin, relevant metadata reporting, and dataset composition.
Results: The 72 datasets identified represented 23 countries and over 2 million individuals. In total, 62 (86%) datasets reported "age," 61 (85%) reported sex or gender, 21 (29%) reported race and/or ethnicity, and 8 (11%) reported socioeconomic status. In the 21 datasets that reported race and/or ethnicity, 89% of individuals represented were reported within the "White" or "Caucasian" category. Only 20 (28%) datasets were fully accessible.
Conclusions: Reporting of sex, gender, and socioeconomic status in HF datasets is inconsistent. There is a need to generate datasets that are transparently reported and accessible. Although collecting and reporting demographic attributes is complex and needs to be undertaken with appropriate safeguards, it is also an essential step toward building equitable AI-based health technologies.
Author
Laws, Elinor
Charalambides, Maria
Vadera, Sonam
Keller, Eva
Alderman, Joseph
Blackboro, Breanna
Hogg, Jeffry
Salisbury, Thomas
Palmer, Joanne
Calvert, Melanie
Mackintosh, Maxine
Matin, Rubeta
Sapey, Elizabeth
Ordish, Johan
McCradden, Melissa
Mateen, Bilal
Gath, Jacqui
Adebajo, Adewale
Kuku, Stephanie
Bradlow, William
Denniston, Alastair K
Liu, Xiaoxuan
Charalambides, Maria
Vadera, Sonam
Keller, Eva
Alderman, Joseph
Blackboro, Breanna
Hogg, Jeffry
Salisbury, Thomas
Palmer, Joanne
Calvert, Melanie
Mackintosh, Maxine
Matin, Rubeta
Sapey, Elizabeth
Ordish, Johan
McCradden, Melissa
Mateen, Bilal
Gath, Jacqui
Adebajo, Adewale
Kuku, Stephanie
Bradlow, William
Denniston, Alastair K
Liu, Xiaoxuan
Date
2025-03-26
Type
Article
Subject
Artificial intelligence, Biomedical technology, Heart failure, Equality, diversity and inclusion
Collections
Citation
Laws E, Charalambides M, Vadera S, Keller E, Alderman J, Blackboro B, Hogg J, Salisbury T, Palmer J, Calvert M, Mackintosh M, Matin R, Sapey E, Ordish J, McCradden M, Mateen B, Gath J, Adebajo A, Kuku S, Bradlow W, Denniston AK, Liu X. Diversity and Inclusion Within Datasets in Heart Failure: A Systematic Review. JACC Adv. 2025 Mar;4(3):101610. doi: 10.1016/j.jacadv.2025.101610.
Journal / Source Title
JACC. Advances
DOI
10.1016/j.jacadv.2025.101610
PMID
40155187
Publisher
Elsevier
Publisher’s URL
https://www.sciencedirect.com/journal/jacc-advances
