Microsoft collaboration launches whole-slide AI model for digital pathology

By Conor Hale May 22, 2024 11:15am

A digital pathology collaboration between Microsoft, the University of Washington and Providence health network aims to overcome a few of the obstacles to fully implementing artificial intelligence in the field of cancer diagnostics—and in some cases, through sheer scale.

The team of researchers put forward a machine learning model that, according to Providence, is built upon one of the largest AI training efforts to date in real-world, whole-slide tissue analysis.

That includes 1.3 billion pathology images derived from more than 171,000 scanned slides provided by the health system—which pegs the dataset’s size as five to 10 times larger than other curated collections, such as The Cancer Genome Atlas.

The slides were taken from more than 30,000 patients and span 31 major tissue types, while the project as a whole also includes radiology scans, genomics results and patient health records.

“This transformative work is the result of focused efforts to overcome three major challenges that have stymied previous computational pathology models from widely being applied in the clinical setting: shortage of real-world data, inability to incorporate whole-slide modeling and lack of accessibility,” Ari Robicsek, Providence’s chief analytics and research officer, said in the health system’s blog post.

To digest it all, researchers adapted Microsoft’s LongNet program, which operates similarly to large language models, but with the ability to tackle much longer sequences of data. For example, a written prompt to an AI chatbot may be read by the computer as a sequence made of dozens of interconnected tokens—while LongNet is built to handle as many as 1 billion tokens at once.

The result is Prov-GigaPath, an AI pathology model designed to read patterns across the entire slide, with the goal of improving predictions about a patient’s particular cancer mutations and their subtypes as well as the effects the tumor microenvironment may have on different therapies.

Microsoft collaboration launches whole-slide AI model for digital pathology

Related

Related