Evaluation of Artificial Intelligence-Based Gleason Grading Algorithms "in the Wild"

Khrystyna Faryna; Leslie Tessier; Juan Retamero; Saikiran Bonthu; Pranab Samanta; Nitin Singhal; Solène-Florence Kammerer-Jacquet; Camelia Radulescu; Vittorio Agosti; Alexandre Collin; Xavier Farré; Jacqueline Fontugne; Rainer Grobholz; Agnes Marije Hoogland; Katia Ramos Moreira Leite; Murat Oktay; Antonio Polonia; Paromita Roy; Paulo Guilherme; Theodorus H. van der Kwast; Jolique van Ipenburg; Jeroen van Der Laak; Geert Litjens

doi:10.1016/j.modpat.2024.100563

Article Dans Une Revue Modern Pathology Année : 2024

Evaluation of Artificial Intelligence-Based Gleason Grading Algorithms "in the Wild"

(1) , , , , , , (2, 3) , (4) , (5) , (6) , , (7) , , , (8) , , (9) , , , (10) , (1) , (1) , (1)

1
2
3
4
5
6
7
8
9
10

Khrystyna Faryna

Fonction : Auteur correspondant

Radboud University [Nijmegen]

Leslie Tessier

Fonction : Auteur

Juan Retamero

Fonction : Auteur

Saikiran Bonthu

Fonction : Auteur

Pranab Samanta

Fonction : Auteur

Nitin Singhal

Fonction : Auteur

Solène-Florence Kammerer-Jacquet

Fonction : Auteur
PersonId : 1020252
ORCID : 0000-0002-6623-741X

Centre Hospitalier Universitaire de Rennes [CHU Rennes] = Rennes University Hospital [Pontchaillou]

Institut de recherche en santé, environnement et travail

Camelia Radulescu

Fonction : Auteur
PersonId : 1218540
ORCID : 0000-0002-3704-8679

Hôpital Foch [Suresnes]

Vittorio Agosti

Fonction : Auteur

Università degli Studi di Brescia = University of Brescia

Alexandre Collin

Fonction : Auteur

Centre Hospitalier Universitaire d'Angers

Xavier Farré

Fonction : Auteur

Jacqueline Fontugne

Fonction : Auteur
PersonId : 1187454
ORCID : 0000-0001-9919-1641

Institut Curie [Paris]

Rainer Grobholz

Fonction : Auteur

Agnes Marije Hoogland

Fonction : Auteur

Katia Ramos Moreira Leite

Fonction : Auteur

Universidade de São Paulo = University of São Paulo

Murat Oktay

Fonction : Auteur

Antonio Polonia

Fonction : Auteur
PersonId : 817044
ORCID : 0000-0001-8312-1681

Instituto de Patologia e Imunologia Molecular da Universidade do Porto

Paromita Roy

Fonction : Auteur

Paulo Guilherme

Fonction : Auteur

Theodorus H. van der Kwast

Fonction : Auteur
PersonId : 1247291
ORCID : 0000-0001-8640-5786

Princess Margaret Hospital

Jolique van Ipenburg

Fonction : Auteur

Radboud University [Nijmegen]

Jeroen van Der Laak

Fonction : Auteur

Radboud University [Nijmegen]

Geert Litjens

Fonction : Auteur
PersonId : 1356980
ORCID : 0000-0003-1554-1291

Radboud University [Nijmegen]

Résumé

The biopsy Gleason score is an important prognostic marker for prostate cancer patients. It is, however, subject to substantial variability among pathologists. Artificial intelligence (AI)ebased algorithms employing deep learning have shown their ability to match pathologists' performance in assigning Gleason scores, with the potential to enhance pathologists' grading accuracy. The performance of Gleason AI algorithms in research is mostly reported on common benchmark data sets or within public challenges. In contrast, many commercial algorithms are evaluated in clinical studies, for which data are not publicly released. As commercial AI vendors typically do not publish performance on public benchmarks, comparison between research and commercial AI is difficult. The aims of this study are to evaluate and compare the performance of top-ranked public and commercial algorithms using real-world data. We curated a diverse data set of whole-slide prostate biopsy images through crowdsourcing containing images with a range of Gleason scores and from diverse sources. Predictions were obtained from 5 top-ranked public algorithms from the Prostate cANcer graDe Assessment (PANDA) challenge and 2 commercial Gleason grading algorithms. Additionally, 10 pathologists (A.C., C.R., J.v.I., K.R.M.L., P.R., P.G.S., R.G., S.F.K.J., T.v.d.K., X.F.) evaluated the data set in a reader study. Overall, the pairwise quadratic weighted kappa among pathologists ranged from 0.777 to 0.916. Both public and commercial algorithms showed high agreement with pathologists, with quadratic kappa ranging from 0.617 to 0.900. Commercial algorithms performed on par or outperformed top public algorithms. (c) 2024 THE AUTHORS. Published by Elsevier Inc. on behalf of the United States and Canadian Academy of Pathology. This is an open access article under the CC BY-NC-ND license (http://creativecommons. org/licenses/by-nc-nd/4.0/).

Mots clés

artificial intelligence computational pathology deep learning Gleason grading

Domaines

Santé publique et épidémiologie

Fichier principal

1-s2.0-S0893395224001431-main.pdf (5.36 Mo)

Origine	Publication financée par une institution

Laurent Jonchère : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04721197

Soumis le : vendredi 4 octobre 2024-11:46:26

Dernière modification le : mercredi 13 novembre 2024-17:02:04

Dates et versions

hal-04721197 , version 1 (04-10-2024)

Licence

Paternité - Pas d'utilisation commerciale - Pas de modification

Identifiants

HAL Id : hal-04721197 , version 1
DOI : 10.1016/j.modpat.2024.100563
PUBMED : 39025402

Citer

Khrystyna Faryna, Leslie Tessier, Juan Retamero, Saikiran Bonthu, Pranab Samanta, et al.. Evaluation of Artificial Intelligence-Based Gleason Grading Algorithms "in the Wild". Modern Pathology, 2024, 37 (11), pp.100563. ⟨10.1016/j.modpat.2024.100563⟩. ⟨hal-04721197⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UNIV-ANGERS FNCLCC CURIE UNAM IRSET UR1-UFR-SVE PSL UNIV-RENNES UR1-BIO-SA

47 Consultations

11 Téléchargements

Evaluation of Artificial Intelligence-Based Gleason Grading Algorithms "in the Wild"

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager