MitoEM Dataset: Large-scale 3D Mitochondria Instance Segmentation from EM Images


Donglai Wei1  Zudi Lin1  Daniel Franco-Barranco2,3  Nils Wendt4*  Xingyu Liu5*  Wenjie Yin1*  Xin Huang6*  Aarush Gupta7* 
Won-Dong Jang1     Xueying Wang1     Ignacio Arganda-Carreras2,3,8     Jeff W. Lichtman1     Hanspeter Pfister1    
1Harvard University     2Donostia International Physics Center     3University of the Basque Country
4Technical University of Munich     5Shanghai Jiao Tong University     6Northeastern University
7Indian Institute of Technology Roorkee     8Ikerbasque, Basque Foundation for Science
* Works were done as interns at Harvard University
MICCAI 2020 / ISBI 2021 Challenge

[Paper (Updated Results)]      [Code]      [Dataset]



Abstract


    Electron microscopy (EM) allows the identification of intracellular organelles such as mitochondria, providing insights for clinical and scientific studies. However, public mitochondria segmentation datasets only contain hundreds of instances with simple shapes. It is unclear if existing methods achieving human-level accuracy on these small datasets are robust in practice. To this end, we introduce the MitoEM dataset, a 3D mitochondria instance segmentation dataset with two 30x30x30 μm3 volumes from human and rat cortices respectively, 3,600x larger than previous benchmarks. With around 40K instances, we find a great diversity of mitochondria in terms of shape and density. For evaluation, we tailor the implementation of the average precision (AP) metric for 3D data with a 45x speedup. On MitoEM, we find existing instance segmentation methods often fail to correctly segment mitochondria with complex shapes or close contacts with other instances. Thus, our MitoEM dataset poses new challenges to the field.


Dataset

(Left) We plot the length versus volume of mitochondria instances for both volumes, where the length of the mitochondria is approximated by the number of voxels in its 3D skeleton. (Right) There is a strong linear correlation between the volume and length mitochondria in both volumes, which is the average thickness of the instance. While the MitoEM-H has more small instances, the MitoEM-R has more large instances with complex morphologies. We sample mitochondria of different length along the regression line and find instances share similar shapes to MOAS in both volumes.


Updates from the Proceeding Version

  • Sec. 2 "Dataset Acquisition": the human data is from the temporal lobe instead of the frontal lobe. The entire human dataset was introduced in [Shapson-Coe et al. 2021]
  • After the proceeding, we did another round of annotation cleaning and improved the training of our 3D model. Please cite and compare with the following numbers, if you plan to use this dataset. [Descriptions] [Leaderboard Results]
    Method MitoEM-H MitoEM-R
    Small Med Large All Small Med Large All
    U2D-B v2 0.106 0.592 0.563 0.566 0.057 0.450 0.300 0.335
    U3D-BC v2 0.426 0.838 0.798 0.804 0.311 0.845 0.803 0.816

Citation

@inproceedings{wei2020mitoem,
  title={MitoEM Dataset: Large-Scale 3D Mitochondria Instance Segmentation from EM Images},
  author={Wei, Donglai and Lin, Zudi and Franco-Barranco, Daniel and Wendt, Nils and Liu, Xingyu and 
  Yin, Wenjie and Huang, Xin and Gupta, Aarush and Jang, Won-Dong and Wang, Xueying and others},
  booktitle={International Conference on Medical Image Computing and Computer-Assisted Intervention},
  pages={66--76},
  year={2020},
  organization={Springer}
}

Acknowledgement

This work has been partially supported by NSF award IIS-1835231 and NIH award 5U54CA225088-03.