skip to main content
10.1145/3109859.3109895acmconferencesArticle/Chapter ViewAbstractPublication PagesrecsysConference Proceedingsconference-collections
short-paper

Chemical Reactant Recommendation Using a Network of Organic Chemistry

Published:27 August 2017Publication History

ABSTRACT

This paper focuses on the overall task of recommending to the chemist candidate molecules (reactants) necessary to synthesize a given target molecule (product), which is a novel application as well as an important step for the chemist to find a synthesis route to generate the product. We formulate this task as a link-prediction problem over a so-called Network of Organic Chemistry (NOC) that we have constructed from 8 million chemical reactions described in the US patent literature between 1976 and 2013. We leverage state-of-the-art factorization algorithms for recommender systems to solve this task. Our empirical evaluation demonstrates that Factorization Machines, trained with chemistry-specific knowledge, outperforms current methods based on similarity of chemical structures.

References

  1. Kyle J. M. Bishop, Rafal Klajn, and Bartosz A. Grzybowski. 2006. The core and most useful molecules in organic chemistry. Angewandte Chemie - International Edition 45, 32 (2006), 5348--5354.Google ScholarGoogle ScholarCross RefCross Ref
  2. E. J. Corey and W. T. Wipke. 1969. Computer-assisted design of complex organic syntheses. Science 166, 3902 (October 1969), 178--92.Google ScholarGoogle ScholarCross RefCross Ref
  3. Joseph L. Durant, Burton A. Leland, Douglas R. Henry, and James G. Nourse. 2002. Reoptimization of MDL Keys for Use in Drug Discovery. Journal of Chemical Information and Modeling 42, 6 (November 2002), 1273--1280.Google ScholarGoogle Scholar
  4. David K. Duvenaud, Dougal Maclaurin, Jorge Aguilera-Iparraguirre, Rafael Gómez-Bombarell, Timothy Hirzel, Alán Aspuru-Guzik, and Ryan P. Adams. 2015. Convolutional Networks on Graphs for Learning Molecular Fingerprints. In Proceeedings of the 28th Annual Conference on Neural Information Processing Systems. 2224--2232. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Kerry Dwan, Douglas G. Altman, Juan A. Arnaiz, Jill Bloom, An-Wen Chan, Eugenia Cronin, Evelyne Decullier, Philippa J. Easterbrook, Erik Von Elm, Carrol Gamble, Davina Ghersi, John P. A. Ioannidis, John Simes, and Paula R. Williamson. 2008. Systematic review of the empirical evidence of study publication bias and outcome reporting bias. PloS one 3, 8 (January 2008), e3081.Google ScholarGoogle ScholarCross RefCross Ref
  6. Zeno Gantner, Lucas Drumond, Christoph Freudenthaler, Steffen Rendle, and Lars Schmidt-Thieme. 2010. Learning attribute-to-feature mappings for cold-start recommendations. In Proceedings of the 10th IEEE International Conference on Data Mining. 176--185. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Zeno Gantner, Steffen Rendle, Christoph Freudenthaler, and Lars Schmidt-Thieme. 2011. MyMediaLite: A Free Recommender System Library. In 5th ACM International Conference on Recommender Systems (RecSys 2011). Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Chris M. Gothard, Siowling Soh, Nosheen A. Gothard, Bartlomiej Kowalczyk, YanhuWei, Bilge Baytekin, and Bartosz A. Grzybowski. 2012. Rewiring Chemistry: Algorithmic Discovery and Experimental Validation of One-Pot Reactions in the Network of Organic Chemistry. Angewandte Chemie International Edition 51, 32 (2012), 7922--7927.Google ScholarGoogle ScholarCross RefCross Ref
  9. Bartosz A. Grzybowski, Kyle J. M. Bishop, Bartlomiej Kowalczyk, and Christopher E. Wilmer. 2009. The "wired" universe of organic chemistry. Nature Chemistry 1, 1 (April 2009), 31--6.Google ScholarGoogle ScholarCross RefCross Ref
  10. Abraham Heifets and Igor Jurisica. 2012. Construction of New Medicines via Game Proof Search. In Proceedings of the 26th AAAI Conference on Artificial Intelligence, Vol. 2. 1564--1570. http://www.scopus.com/inward/record.url?eid=2-s2.0-84868287208 Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Matthew A. Kayala and Pierre Baldi. 2011. A Machine Learning Approach to Predict Chemical Reactions. In Proceeding of the 25th Annual Conference on Advances in Neural Information Processing Systems. 747--755. http://papers.nips.cc/paper/4356-a-machine-learning-approach-to-predict-chemical-reactions.pdf Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix Factorization Techniques for Recommender Systems. IEEE Computer 42, 8 (2009), 30--37. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Yehuda Koren and Robert M. Bell. 2015. Recommender Systems Handbook. Springer, Chapter Advances in Collaborative Filtering, 77--118.Google ScholarGoogle Scholar
  14. Mikolaj Kowalik, Chris M. Gothard, Aaron M. Drews, Nosheen A. Gothard, Alex Weckiewicz, Patrick E. Fuller, Bartosz A. Grzybowski, and Kyle J. M. Bishop. 2012. Parallel optimization of synthetic pathways within the network of organic chemistry. Angewandte Chemie (International ed. in English) 51, 32 (August 2012), 7928--32.Google ScholarGoogle Scholar
  15. James Law, Zsolt Zsoldos, Aniko Simon, Darryl Reid, Yang Liu, Sing Yoong Khew, A. Peter Johnson, Sarah Major, Robert A.Wade, and Howard Y. Ando. 2009. Route Designer: A Retrosynthetic Analysis Tool Utilizing Automated Retrosynthetic Rule Generation. Journal of Chemical Information and Modeling 49, 3 (2009), 593--602.Google ScholarGoogle ScholarCross RefCross Ref
  16. Daniel M. Lowe. 2012. Extraction of Chemical Structures and Reactions from the Literature. Ph.D. Dissertation. University of Cambridge.Google ScholarGoogle Scholar
  17. Aditya Krishna Menon and Charles Elkan. 2011. Link prediction via matrix factorization. Machine Learning and Knowledge Discovery in Databases 6912 (2011), 437--452. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Noel M. O'Boyle, Michael Banck, Craig A. James, Chris Morley, Tim Vandermeersch, and Geoffrey R. Hutchison. 2011. Open Babel: An open chemical toolbox. Journal of Cheminformatics 3, 1 (January 2011), 33.Google ScholarGoogle ScholarCross RefCross Ref
  19. Steffen Rendle. 2010. Factorization Machines. In Proceedings of the 10th IEEE International Conference on Data Mining. 995--1000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Steffen Rendle. 2012. Factorization Machine with libFM. ACM Transactions on Intelligent Systems and Technology 3, 3 (2012), 1--22. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian Personalized Ranking from Implicit Feedback. In Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence, Vol. cs.LG. 452--461. arXiv:1205.2618 http://dl.acm.org/citation.cfm?id=1795167 Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Nadine Schneider, Daniel M. Lowe, Roger A. Sayle, and Gregory A. Landrum. 2015. Development of a novel fingerprint for chemical reactions and its application to large-scale reaction classification and similarity. Journal of Chemical Information and Modeling 55, 1 (2015), 39--53.Google ScholarGoogle ScholarCross RefCross Ref
  23. Marwin H. S. Segler and Mark P. Waller. 2017. Modelling Chemical Reasoning to Predict and Invent Reactions. Chemistry - A European Journal 1521, 3765 (2017).Google ScholarGoogle Scholar
  24. Marwin H. S. Segler and Mark P.Waller. 2017. Neural-Symbolic Machine Learning for Retrosynthesis and Reaction Prediction. Chemistry - A European Journal 1521, 3765 (2017).Google ScholarGoogle Scholar
  25. Vladimir N. Vapnik. 1998. Statisical Learning Theory. Wiley.Google ScholarGoogle Scholar
  26. Peter Willett, John M. Barnard, and Geoffrey M. Downs. 1998. Chemical Similarity Searching. Journal of Chemical Information and Modeling 38, 6 (November 1998), 983--996.Google ScholarGoogle Scholar
  27. Yoshihiro Yamanishi, Jean-Philippe Vert, and Minoru Kanehisa. 2005. Supervised enzyme network inference from the integration of genomic data and chemical information. Bioinromatics 21 (2005), 468--477. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Chemical Reactant Recommendation Using a Network of Organic Chemistry

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      RecSys '17: Proceedings of the Eleventh ACM Conference on Recommender Systems
      August 2017
      466 pages
      ISBN:9781450346528
      DOI:10.1145/3109859

      Copyright © 2017 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 27 August 2017

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      RecSys '17 Paper Acceptance Rate26of125submissions,21%Overall Acceptance Rate254of1,295submissions,20%

      Upcoming Conference

      RecSys '24
      18th ACM Conference on Recommender Systems
      October 14 - 18, 2024
      Bari , Italy

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader