skip to main content
research-article

From Annotation to Computer-Aided Diagnosis: Detailed Evaluation of a Medical Multimedia System

Authors Info & Claims
Published:31 May 2017Publication History
Skip Abstract Section

Abstract

Holistic medical multimedia systems covering end-to-end functionality from data collection to aided diagnosis are highly needed, but rare. In many hospitals, the potential value of multimedia data collected through routine examinations is not recognized. Moreover, the availability of the data is limited, as the health care personnel may not have direct access to stored data. However, medical specialists interact with multimedia content daily through their everyday work and have an increasing interest in finding ways to use it to facilitate their work processes. In this article, we present a novel, holistic multimedia system aiming to tackle automatic analysis of video from gastrointestinal (GI) endoscopy. The proposed system comprises the whole pipeline, including data collection, processing, analysis, and visualization. It combines filters using machine learning, image recognition, and extraction of global and local image features. The novelty is primarily in this holistic approach and its real-time performance, where we automate a complete algorithmic GI screening process. We built the system in a modular way to make it easily extendable to analyze various abnormalities, and we made it efficient in order to run in real time. The conducted experimental evaluation proves that the detection and localization accuracy are comparable or even better than existing systems, but it is by far leading in terms of real-time performance and efficient resource consumption.

References

  1. Martın Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, Manjunath Kudlur, Josh Levenberg, Rajat Monga, Sherry Moore, Derek G. Murray, Benoit Steiner, Paul Tucker, Vijay Vasudevan, Pete Warden, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016. TensorFlow: A system for large-scale machine learning. Proc. of USENIX OSDI. 265--283. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Zeno Albisser, Michael Riegler, Pål Halvorsen, Jiang Zhou, Carsten Griwodz, IIangko Balasingham, and Cathal Gurrin. 2015. Expert driven semi-supervised elucidation tool for medical endoscopic videos. In Proc. of ACM MMSys. 73--76. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Luís A. Alexandre, Joao Casteleiro, and Nuno Nobreinst. 2007. Polyp detection in endoscopic video using SVMs. In Proc. of PKDD. 358--365.Google ScholarGoogle ScholarCross RefCross Ref
  4. Stefan Ameling, Stephan Wirth, Dietrich Paulus, Gerard Lacey, and Fernando Vilarino. 2009. Texture-based polyp detection in colonoscopy. In Bildverarbeitung für die Medizin. Springer, 346--350.Google ScholarGoogle Scholar
  5. Hermann Brenner, Matthias Kloor, and Christian Peter Pox. 2016. Colorectal cancer. The Lancet (2016), 1490--1502.Google ScholarGoogle Scholar
  6. Antoni Buades, Bartomeu Coll, and Jean-Michel Morel. 2011. Non-local means denoising. IPOL 1 (2011), 208--212.Google ScholarGoogle ScholarCross RefCross Ref
  7. Da-Chuan Cheng, Wen-Chien Ting, Yung-Fu Chen, Qin Pu, and Xiaoyi Jiang. 2008. Colorectal polyps detection using texture features and support vector machine. In Advances in Mass Data Analysis of Images and Signals in Medicine, Biotechnology, Chemistry and Food Industry. Springer, 62--72. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Christine Chin and David E. Brown. 2000. Learning in science: A comparison of deep and surface approaches. Journal of Research in Science Teaching 37, 2 (2000), 109--138.Google ScholarGoogle ScholarCross RefCross Ref
  9. Francesco Ciompi, Kaman Chung, Sarah J. van Riel, Arnaud Arindra Adiyoso Setio, Paul K. Gerke, Colin Jacobs, Ernst Th. Scholten, Cornelia Schaefer-Prokop, Mathilde M. W. Wille, Alfonso Marchiano, and others. 2016. Towards automatic pulmonary nodule management in lung cancer screening with deep learning. arXiv preprint arXiv:1610.09157 (2016).Google ScholarGoogle Scholar
  10. Yang Cong, Shuai Wang, Ji Liu, Jun Cao, Yunsheng Yang, and Jiebo Luo. 2015. Deep sparse feature selection for computer aided endoscopy diagnosis. Pattern Recognition 48, 3 (2015), 907--917. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Thomas de Lange, Stig Larsen, and Lars Aabakken. 2005. Image documentation of endoscopic findings in ulcerative colitis: Photographs or video clips? Gastrointestinal Endoscopy 61, 6 (2005), 715--720.Google ScholarGoogle ScholarCross RefCross Ref
  12. Ayso H. de Vries, Shandra Bipat, Evelien Dekker, Marjolein H. Liedenbaum, Jasper Florie, Paul Fockens, Roel van der Kraan, Elizabeth M. Mathus-Vliegen, Johannes B. Reitsma, Roel Truyen, and others. 2010. Polyp measurement based on CT colonography and colonoscopy: Variability and systematic differences. European Radiology 20, 6 (2010), 1404--1413.Google ScholarGoogle ScholarCross RefCross Ref
  13. Bradley Efron and Robert Tibshirani. 1997. Improvements on cross-validation: The .632+ bootstrap method. Journal of the American Statistical Association 92, 438 (1997), 548--560.Google ScholarGoogle Scholar
  14. Hugo Jair Escalante, Carlos A. Hérnadez, Luis Enrique Sucar, and Manuel Montes. 2008. Late fusion of heterogeneous methods for multimedia image retrieval. In Proc. of ICMR. 172--179. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Ferlay, E. Steliarova-Foucher, J. Lortet-Tieulent, S. Rosso, J. W. Coebergh, H. Comber, D. Forman, and F. Bray. 2013. Cancer incidence and mortality patterns in Europe: Estimates for 40 countries in 2012. European Journal of Cancer 49, 6 (2013), 1374--1403.Google ScholarGoogle ScholarCross RefCross Ref
  16. Andrew W. Fitzgibbon and Robert B. Fisher. 1995. A buyer’s guide to conic fitting. Proc. of (BMVC). 513--522. http://dl.acm.org/citation.cfm?id=243124.243148. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. The Apache Software Foundation. 2013. Apache Lucene - Index File Formats. Retrieved from https://lucene.apache.org/.Google ScholarGoogle Scholar
  18. B. Giritharan, Xiaohui Yuan, Jianguo Liu, B. Buckles, JungHwan Oh, and Shou Jiang Tang. 2008. Bleeding detection from capsule endoscopy videos. In Proc. of EMBS.Google ScholarGoogle ScholarCross RefCross Ref
  19. O. Holme, M. Bretthauer, A. Fretheim, J. Odgaard-Jensen, and G. Hoff. 2013. Flexible sigmoidoscopy versus faecal occult blood testing for colorectal cancer screening in asymptomatic individuals. The Cochrane Library.Google ScholarGoogle Scholar
  20. Sae Hwang, JungHwan Oh, W. Tavanapong, J. Wong, and P. C. de Groen. 2007. Polyp detection in colonoscopy video using elliptical shape feature. In Proc. of ICIP. 465--468.Google ScholarGoogle ScholarCross RefCross Ref
  21. H. Inoue, H. Kashida, S. Kudo, M. Sasako, T. Shimoda, H. Watanabe, S. Yoshida, M. Guelrud, C. J. Lightdale, K. Wang, and others. 2003. The Paris endoscopic classification of superficial neoplastic lesions: Esophagus, stomach, and colon: November 30 to December 1, 2002. Gastrointest Endosc 58, 6 Suppl (2003), S3--43.Google ScholarGoogle Scholar
  22. Menglin Jiang, Shaoting Zhang, Hongsheng Li, and Dimitris N. Metaxas. 2015. Computer-aided diagnosis of mammographic masses using scalable image retrieval. IEEE Transactions on Biomedical Engineering 62, 2 (2015), 783--792.Google ScholarGoogle ScholarCross RefCross Ref
  23. M. F. Kaminski, J. Regula, E. Kraszewska, M. Polkowski, U. Wojciechowska, J. Didkowska, M. Zwierko, M. Rupinski, M. P. Nowacki, and E. Butruk. 2010. Quality indicators for colonoscopy and the risk of interval cancer. New England Journal of Medicine 362, 19 (2010), 1795--1803.Google ScholarGoogle ScholarCross RefCross Ref
  24. J. Kang and R. Doraiswami. 2003. Real-time image processing system for endoscopic applications. In Proc. of IEEE CCECE.Google ScholarGoogle Scholar
  25. A. Khaleghi and I. Balasingham. 2015. Wireless communication link for capsule endoscope at 600 MHz. In Proc. of IEEE EMBC. 4081--4084.Google ScholarGoogle Scholar
  26. Donald Ervin Knuth. 1998. The Art of Computer Programming: Sorting and Searching. Vol. 3. Pearson Education.Google ScholarGoogle Scholar
  27. S. Kudo, S. Hirota, T. Nakajima, S. Hosobe, H. Kusaka, T. Kobayashi, M. Himori, and A. Yagyuu. 1994. Colorectal tumours and pit pattern. Journal of Clinical Pathology 47, 10 (1994), 880--885.Google ScholarGoogle ScholarCross RefCross Ref
  28. Baopu Li and M. Q.-H. Meng. 2012. Tumor recognition in wireless capsule endoscopy images using textural features and SVM-based feature selection. IEEE Transactions on Information Technology in Biomedicine 16, 3 (2012), 323--329. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Baopu Li and Max Q. H. Meng. 2009. Computer-based detection of bleeding and ulcer in wireless capsule endoscopy images by chromaticity moments. CBM 39, 2 (2009), 141--147. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Xirong Li, Cees G. M. Snoek, and Marcel Worring. 2010. Unsupervised multi-feature tag relevance learning for social image retrieval. In Proc. of ACM ICMR. 10--17. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Danyu Liu, Yu Cao, Kihwan Kim, Sean Stanek, Bancha Doungratanaex-Chai, Kungen Lin, Wallapak Tavanapong, Johnny S. Wong, Jung-Hwan Oh, and Piet C. de Groen. 2007. Arthemis: Annotation software in an integrated capturing and analysis system for colonoscopy. Computer Methods and Programs in Biomedicine 88, 2 (2007), 152--163. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Mathias Lux. 2013. LIRE: Open source image retrieval in Java. In Proc. of the 21st ACM MM. ACM, 843--846. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Mathias Lux and Michael Riegler. 2013. Annotation of endoscopic videos on mobile devices: A bottom-up approach. In Proc. of ACM MMSys’13. ACM, 141--145. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Shawn Mallery and Jacques Van Dam. 2000. Advances in diagnostic and therapeutic endoscopy. Medical Clinics of North America 84, 5 (2000), 1059--1083.Google ScholarGoogle ScholarCross RefCross Ref
  35. A. V. Mamonov, I. N. Figueiredo, P. N. Figueiredo, and Y.-H. R. Tsai. 2014. Automated polyp detection in colon capsule endoscopy. IEEE Transactions on Medical Imaging 33, 7 (2014), 1488--1502.Google ScholarGoogle ScholarCross RefCross Ref
  36. Ruwan Nawarathna, JungHwan Oh, Jayantha Muthukudage, Wallapak Tavanapong, Johnny Wong, Piet C. De Groen, and Shou Jiang Tang. 2014. Abnormal image detection in endoscopy videos using a filter bank and local binary patterns. NC 144 (2014), 70--91. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Anh Nguyen, Jason Yosinski, and Jeff Clune. 2014. Deep neural networks are easily fooled: High confidence predictions for unrecognizable images. arXiv preprint arXiv:1412.1897 (2014).Google ScholarGoogle Scholar
  38. S. Oba, S. Tanaka, Y. Sano, S. Oka, and K. Chayama. 2011. Current status of narrow-band imaging magnifying colonoscopy for colorectal neoplasia in Japan. Digestion 83, 3 (2011), 167--172.Google ScholarGoogle ScholarCross RefCross Ref
  39. Sungheon Park, Myunggi Lee, and Nojun Kwak. 2015. Polyp detection in colonoscopy videos using deeply-learned hierarchical features. Proc. of (ISBI).Google ScholarGoogle Scholar
  40. Konstantin Pogorelov, Sigrun Losada, Carsten Griwodz, Thomas de Lange, Kristin Ranheim Randel, Duc Tien Dang Nguyen, Håkon Kvale Stensland, Francesco De Natale, Dag Johansen, Michael Riegler, and Pål Halvorsen. 2017. A holistic multimedia system for gastrointestinal tract disease detection. In Proc. of MMSys. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2015. You only look once: Unified, real-time object detection. arXiv preprint arXiv:1506.02640 (2015).Google ScholarGoogle Scholar
  42. Michael Riegler, Martha Larson, Mathias Lux, and Christoph Kofler. 2014. How ‘how’ reflects what’s what: Content-based exploitation of how users frame social images. In Proc. of ACM MM. 397--406. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Michael Riegler, Mathias Lux, Vincent Charvillat, Axel Carlier, Raynor Vliegendhart, and Martha Larson. 2014b. VideoJot: A multifunctional video annotation tool. In Proc. of ACM ICMR. 534--537. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Michael Riegler, Mathias Lux, Carsten Griwodz, Concetto Spampinato, Thomas de Lange, Sigrun L. Eskeland, Konstantin Pogorelov, Wallapak Tavanapong, Peter T. Schmidt, Cathal Gurrin, Dag Johansen, Håvard Johansen, and Pål Halvorsen. 2016. Multimedia and medicine: Teammates for better disease detection and survival. In Proc. of ACM MM. 968--977. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Michael Riegler, Konstantin Pogorelov, Pål Halvorsen, Thomas de Lange, Carsten Griwodz, Peter Thelin Schmidt, Sigrun Losada Eskeland, and Dag Johansen. 2016. EIR - efficient computer aided diagnosis framework for gastrointestinal endoscopies. In Proc. of CBMI.Google ScholarGoogle ScholarCross RefCross Ref
  46. Jürgen Schmidhuber. 2015. Deep learning in neural networks: An overview. Neural Networks 61 (2015), 85--117. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Joe V. Selby, Gary D. Friedman, Charles P. Quesenberry Jr, and Noel S. Weiss. 1992. A case--control study of screening sigmoidoscopy and mortality from colorectal cancer. New England Journal of Medicine 326, 10 (1992), 653--657.Google ScholarGoogle ScholarCross RefCross Ref
  48. Theodoros Semertzidis, Dimitrios Rafailidis, Eleftherios Tiakas, Michael G. Strintzis, and Petros Daras. 2013. Multimedia indexing, search, and retrieval in large databases of social networks. In Social Media Retrieval. Springer, 43--63.Google ScholarGoogle Scholar
  49. Cees G. M. Snoek, Marcel Worring, and Arnold W. M. Smeulders. 2005. Early versus late fusion in semantic video analysis. In Proc. of ACM MM. 399--402. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Jingkuan Song. 2013. Effective hashing for large-scale multimedia search. In Proc. of Sigmod/PODS PhD Symp. 55--60. Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Donald F. Specht. 1990. Probabilistic neural networks. Neural Networks 3, 1 (1990), 109--118. Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Nima Tajbakhsh, Suryakanth Gurudu, and Jianming Liang. 2016. Automated polyp detection in colonoscopy videos using shape and context information. IEEE Transactions on Medical Imaging 35, 2 (Feb. 2016), 630--644.Google ScholarGoogle ScholarCross RefCross Ref
  53. Nima Tajbakhsh, Suryakanth R. Gurudu, and Jianming Liang. 2015. Automatic polyp detection in colonoscopy videos using an ensemble of convolutional neural networks. In Proc. of IEEE ISBI.Google ScholarGoogle ScholarCross RefCross Ref
  54. Kyosuke Tanaka, Carlos A. Rubio, Aldona Dlugosz, Kotryna Truskaite, Ragnar Befrits, Greger Lindberg, and Peter T. Schmidt. 2013. Narrow-band imaging magnifying endoscopy in adult patients with eosinophilic esophagitis/esophageal eosinophilia and lymphocytic esophagitis. Gastrointestinal Endoscopy 78, 4 (2013), 659--664.Google ScholarGoogle ScholarCross RefCross Ref
  55. The New York Times. 2013. The $2.7 Trillion Medical Bill. Retrieved from http://goo.gl/CuFyFJ.Google ScholarGoogle Scholar
  56. L. von Karsa, J. Patnick, and N. Segnan. 2012. European guidelines for quality assurance in colorectal cancer screening and diagnosis. First edition--executive summary. Endoscopy 44 Suppl 3 (2012), SE1--8.Google ScholarGoogle Scholar
  57. Dayong Wang, Aditya Khosla, Rishab Gargeya, Humayun Irshad, and Andrew H. Beck. 2016b. Deep learning for identifying metastatic breast cancer. arXiv preprint arXiv:1606.05718 (2016).Google ScholarGoogle Scholar
  58. Shuai Wang, Yang Cong, Huijie Fan, Lianqing Liu, Xiaoqiu Li, Yunsheng Yang, Yandong Tang, Huaici Zhao, and Haibin Yu. 2016. Computer-aided endoscopic diagnosis without human-specific labeling. Transactions on BME 63, 11 (2016).Google ScholarGoogle Scholar
  59. Yi Wang, Wallapak Tavanapong, Johnny Wong, JungHwan Oh, and Piet C. de Groen. 2011. Computer-aided detection of retroflexion in colonoscopy. In Proc. of IEEE CBMS. 1--6. Google ScholarGoogle ScholarDigital LibraryDigital Library
  60. Yi Wang, Wallapak Tavanapong, Johnny Wong, JungHwan Oh, and Piet C. de Groen. 2013. Near real-time retroflexion detection in colonoscopy. IEEE Journal of Biomedical and Health Informatics 17, 1 (2013), 143--152.Google ScholarGoogle ScholarCross RefCross Ref
  61. Yi Wang, Wallapak Tavanapong, Johnson Wong, JungHwan Oh, and Piet C. de Groen. 2014. Part-based multiderivative edge cross-sectional profiles for polyp detection in colonoscopy. Journal of BMHI 18, 4 (2014), 1379--1389.Google ScholarGoogle Scholar
  62. Yi Wang, Wallapak Tavanapong, Johnny Wong, Jung Hwan Oh, and Piet C. de Groen. 2015. Polyp-alert: Near real-time feedback during colonoscopy. Computer Methods and Programs in Biomedicine 120, 3 (2015), 164--179. Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. Yi Wang, Wallapak Tavanapong, Johnny S. Wong, JungHwan Oh, and Piet C. de Groen. 2010. Detection of quality visualization of appendiceal orifices using local edge cross-section profile features and near pause detection. IEEE Transactions on Biomedical Engineering 57, 3 (2010), 685--695.Google ScholarGoogle ScholarCross RefCross Ref
  64. World Health Organization - International Agency for Research on Cancer. 2012. Estimated Cancer Incidence, Mortality and Prevalence Worldwide in 2012. Retrieved from http://globocan.iarc.fr/Pages/fact_sheets_popula tion.aspx.Google ScholarGoogle Scholar
  65. Mingda Zhou, Guanqun Bao, Yishuang Geng, B. Alkandari, and Xiaoxi Li. 2014. Polyp detection and radius measurement in small intestine using video capsule endoscopy. In Proc. of BMEI. 237--241.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. From Annotation to Computer-Aided Diagnosis: Detailed Evaluation of a Medical Multimedia System

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 13, Issue 3
      August 2017
      233 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/3104033
      Issue’s Table of Contents

      Copyright © 2017 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 31 May 2017
      • Accepted: 1 April 2017
      • Revised: 1 March 2017
      • Received: 1 March 2016
      Published in tomm Volume 13, Issue 3

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader