Development and retrospective validation of an artificial intelligence system for diagnostic assessment of prostate biopsies: Study protocol

  • Nita Mulliqi
  • , Anders Blilie
  • , Xiaoyi Ji
  • , Kelvin Szolnoky
  • , Henrik Olsson
  • , Matteo Titus
  • , Geraldine Martinez Gonzalez
  • , Sol Erika Boman
  • , Masi Valkonen
  • , Einar Gudlaugsson
  • , Svein Reidar Kjosavik
  • , José Asenjo
  • , Marcello Gambacorta
  • , Paolo Libretti
  • , Marcin Braun
  • , Radzislaw Kordek
  • , Roman Łowicki
  • , Kristina Hotakainen
  • , Päivi Väre
  • , Bodil Ginnerup Pedersen
  • Karina Dalsgaard Sørensen, Benedicte Parm Ulhøi, Mattias Rantalainen, Pekka Ruusuvuori, Brett Delahunt, Hemamali Samaratunga, Toyonori Tsuzuki, Emilius Adrianus Maria Janssen, Lars Egevad, Kimmo Kartasalo*, Martin Eklund
*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

4 Citations (Scopus)
1 Downloads (Pure)

Abstract

Introduction Histopathological evaluation of prostate biopsies using the Gleason scoring system is critical for prostate cancer diagnosis and treatment selection. However, grading variability among pathologists can lead to inconsistent assessments, risking inappropriate treatment. Similar challenges complicate the assessment of other prognostic features like cribriform cancer morphology and perineural invasion. Many pathology departments are also facing an increasingly unsustainable workload due to rising prostate cancer incidence and a decreasing pathologist workforce coinciding with increasing requirements for more complex assessments and reporting. Digital pathology and artificial intelligence (AI) algorithms for analysing whole slide images show promise in improving the accuracy and efficiency of histopathological assessments. Studies have demonstrated AI's capability to diagnose and grade prostate cancer comparably to expert pathologists. However, external validations on diverse data sets have been limited and often show reduced performance. Historically, there have been no well-established guidelines for AI study designs and validation methods. Diagnostic assessments of AI systems often lack preregistered protocols and rigorous external cohort sampling, essential for reliable evidence of their safety and accuracy. Methods and analysis This study protocol covers the retrospective validation of an AI system for prostate biopsy assessment. The primary objective of the study is to develop a high-performing and robust AI model for diagnosis and Gleason scoring of prostate cancer in core needle biopsies, and at scale evaluate whether it can generalise to fully external data from independent patients, pathology laboratories and digitalisation platforms. The secondary objectives cover AI performance in estimating cancer extent and detecting cribriform prostate cancer and perineural invasion. This protocol outlines the steps for data collection, predefined partitioning of data cohorts for AI model training and validation, model development and predetermined statistical analyses, ensuring systematic development and comprehensive validation of the system. The protocol adheres to Transparent Reporting of a multivariable prediction model of Individual Prognosis Or Diagnosis+AI (TRIPOD+AI), Protocol Items for External Cohort Evaluation of a Deep Learning System in Cancer Diagnostics (PIECES), Checklist for AI in Medical Imaging (CLAIM) and other relevant best practices. Ethics and dissemination Data collection and usage were approved by the respective ethical review boards of each participating clinical laboratory, and centralised anonymised data handling was approved by the Swedish Ethical Review Authority. The study will be conducted in agreement with the Helsinki Declaration. The findings will be disseminated in peer-reviewed publications (open access).

Original languageEnglish
Article numbere097591
Number of pages19
JournalBMJ Open
Volume15
Issue number7
DOIs
Publication statusPublished - 7 Jul 2025
Publication typeA1 Journal article-refereed

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Artificial Intelligence
  • HISTOPATHOLOGY
  • Prostate
  • Retrospective Studies

Publication forum classification

  • Publication forum level 1

ASJC Scopus subject areas

  • General Medicine

Fingerprint

Dive into the research topics of 'Development and retrospective validation of an artificial intelligence system for diagnostic assessment of prostate biopsies: Study protocol'. Together they form a unique fingerprint.

Cite this