skip to main content
10.1145/2500863.2500869acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

MFMS: maximal frequent module set mining from multiple human gene expression data sets

Published:11 August 2013Publication History

ABSTRACT

Advances in genomic technologies have allowed vast amounts of gene expression data to be collected. Protein functional annotation and biological module discovery that are based on a single gene expression data suffers from spurious coexpression. Recent work have focused on integrating multiple independent gene expression data sets. In this paper, we propose a two-step approach for mining maximally frequent collection of highly connected modules from coexpression graphs. We first mine maximal frequent edge-sets and then extract highly connected subgraphs from the edge-induced subgraphs. Experimental results on the collection of modules mined from 52 Human gene expression data sets show that coexpression links that occur together in a significant number of experiments have a modular topological structure. Moreover, GO enrichment analysis shows that the proposed approach discovers biologically significant frequent collections of modules.

References

  1. Gary D. Bader and Christopher W. V. Hogu. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinformatics, 4(2), 2003.Google ScholarGoogle Scholar
  2. Imre Derenyi, Gergely Palla, and Tamas Vicsek. Clique percolation in random networks. Phys. Rev. Lett., 94(16):160202, 2005.Google ScholarGoogle Scholar
  3. Audrey P Gasch and Michael B Eisen. Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering. Genome Biology, 3(11):research0059.1--0059.22, 2002.Google ScholarGoogle Scholar
  4. Karam Gouda and Mohammed J. Zaki. GenMax: An efficient algorithm for mining maximal frequent itemsets. Data Mining and Knowledge Discovery: An International Journal, 11 (3):223--242, Nov 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Haiyan Hu, Xifeng Yan, Yu Huang, and Xianghong Jasmine Zhou. Mining coherent dense subgraphs across massive biological networks for functional discovery. Bioinformatics, 21 Suppl 1:i213--i221, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Yu Huang, Haifeng Li, Haiyan Hu, Xifeng Yan, Michael S. Waterman, Haiyan Huang, and Xianghong Jasmine Zhou. Systematic discovery of functional modules and context-specific functional annotation of human genome. Bioinformatics, 23(13):i222--i229, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Daxin Jiang and Jian Pei. Mining frequent cross-graph quasi-cliques. ACM Trans. Knowl. Discov. Data, 2(4):16:1--16:42, jan 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Mehmet Koyuturk, Ananth Grama, and Wojciech Szpankowski. An efficient algorithm for detecting frequent subgraphs in biological networks. Bioinformatics, 20(Suppl 1): i200--i207, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Homin K. Lee, Amy K. Hsu, Jon Sajdak, Jie Qin, and Paul Pavlidis. Coexpression analysis of human genes across many microarray data sets. Genome Res., 14(6):1085--1094, 2004.Google ScholarGoogle Scholar
  10. Pierre-Nicolas Mougel, Mark Plantevit, Christophe Rigotti, Olivier Gandrillon, and Jean-Francois Boulicaut. Constraint-based mining of sets of cliques sharing vertex properties. In In: Workshop on Analysis of Complex NEtworks (ACNE 2010) co-located with ECML/PKDD 2010, 2010.Google ScholarGoogle Scholar
  11. Jian Pei, Daxin Jiang, and Aidong Zhang. On mining cross-graph quasi-cliques. In Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, KDD '05, pages 228--238, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Ahsanur Rahman, Christopher L Poirel, David J Badger, and TM Murali. Reverse engineering molecular hypergraphs. In Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine, pages 68--75. ACM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Xifeng Yan, Xianghong Jasmine Zhou, and Jiawei Han. Mining closed relational graphs with connectivity constraints. In Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, KDD '05, pages 324--333, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Barry R. Zeeberg, Weimin Feng, Geoffrey Wang, May D. Wang, Anthony T. Fojo, Margot Sunshine, Sudarshan Narasimhan, David W. Kane, William C. Reinhold, Samir Lababidi, and Kimberly. Gominer: A resource for biological interpretation of genomic and proteomic data. Genome Biology, 4(4):R28, 2003.Google ScholarGoogle ScholarCross RefCross Ref

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    BioKDD '13: Proceedings of the 12th International Workshop on Data Mining in Bioinformatics
    August 2013
    64 pages
    ISBN:9781450323277
    DOI:10.1145/2500863
    • General Chairs:
    • Jake Chen,
    • Mohammed Zaki,
    • Program Chairs:
    • Gaurav Pandey,
    • Huzefa Rangwala,
    • George Karypis

    Copyright © 2013 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 11 August 2013

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article

    Acceptance Rates

    BioKDD '13 Paper Acceptance Rate7of16submissions,44%Overall Acceptance Rate7of16submissions,44%

    Upcoming Conference

    KDD '24

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader