Paracelsus Medizinische Privatuniversität (PMU)

Forschung & Innovation
Publikationen

MixUp-MIL: Novel Data Augmentation for Multiple Instance Learning and a Study on Thyroid Cancer Diagnosis

#2023

PMU Autor*innen
Christina Erhardt-Kreutzer, Sébastien Couillard-Després

Alle Autor*innen
Michael Gadermayr, Lukas Koller, Maximilian Tschuchnig, Lea Maria Stangassigner, Christina Erhardt-Kreutzer, Sébastien Couillard-Després, Gertie Janneke Oostingh, Anton Hittmair

Kurzfassung

Multiple instance learning is a powerful approach for whole slide image-based diagnosis in the absence of pixel- or patch-level annotations. In spite of the huge size of whole slide images, the number of individual slides is often rather small, leading to a small number of labeled samples. To improve training, we propose and investigate novel data augmentation strategies for multiple instance learning based on the idea of linear and multilinear interpolation of feature vectors within and between individual whole slide images. Based on state-of-the-art multiple instance learning architectures and two thyroid cancer data sets, an exhaustive study was conducted considering a range of common data augmentation strategies. Whereas a strategy based on to the original MixUp approach showed decreases in accuracy, a novel multilinear intra-slide interpolation method led to consistent increases in accuracy.