skip to main content
research-article

Supporting the Design of Machine Learning Workflows with a Recommendation System

Published:22 February 2016Publication History
Skip Abstract Section

Abstract

Machine learning and data analytics tasks in practice require several consecutive processing steps. RapidMiner is a widely used software tool for the development and execution of such analytics workflows. Unlike many other algorithm toolkits, it comprises a visual editor that allows the user to design processes on a conceptual level. This conceptual and visual approach helps the user to abstract from the technical details during the development phase and to retain a focus on the core modeling task. The large set of preimplemented data analysis and machine learning operations available in the tool, as well as their logical dependencies, can, however, be overwhelming in particular for novice users.

In this work, we present an add-on to the RapidMiner framework that supports the user during the modeling phase by recommending additional operations to insert into the currently developed machine learning workflow. First, we propose different recommendation techniques and evaluate them in an offline setting using a pool of several thousand existing workflows. Second, we present the results of a laboratory study, which show that our tool helps users to significantly increase the efficiency of the modeling process. Finally, we report on analyses using data that were collected during the real-world deployment of the plug-in component and compare the results of the live deployment of the tool with the results obtained through an offline analysis and a replay simulation.

References

  1. Rakesh Agrawal, Tomasz Imieliński, and Arun Swami. 1993. Mining association rules between sets of items in large databases. In Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data (SIGMOD’93). 207--216. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Szymon Bobek, Mateusz Baran, Krzysztof Kluza, and Grzegorz J. Nalepa. 2013. Application of Bayesian networks to recommendations in business process modeling. In Proceedings of the 2013 Workshop AI Meets Business Processes (AIBP’13). 41--50.Google ScholarGoogle Scholar
  3. Nguyen Ngoc Chan, Walid Gaaloul, and Samir Tata. 2011. Composition context matching for web service recommendation. In Proceedings of the 2011 IEEE International Conference on Services Computing (SCC’11). 624--631. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Nguyen Ngoc Chan, Walid Gaaloul, and Samir Tata. 2012. A recommender system based on historical usage data for web service discovery. Serv. Orient. Comput. Appl. 6, 1 (2012), 51--63. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Nguyen Ngoc Chan, Karn Yongsiriwit, Walid Gaaloul, and Jan Mendling. 2014. Mining event logs to assist the development of executable process variants. In Proceedings 26th International Conference on Advanced Information Systems Engineering. 548--563.Google ScholarGoogle ScholarCross RefCross Ref
  6. Paolo Cremonesi, Yehuda Koren, and Roberto Turrin. 2010. Performance of recommender algorithms on top-n recommendation tasks. In Proceedings of the 2010 ACM Conference on Recommender Systems (RecSys’10). 39--46. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Brian D. Davison and Haym Hirsh. 1998. Predicting sequences of user actions. In Proceedings of the AAAI/ICML ’98 Workshop on Predicting the Future: AI Approaches to Time Series Analysis (AAAI’98). 5--12.Google ScholarGoogle Scholar
  8. Remco Dijkman, Marlon Dumas, Boudewijn van Dongen, Reina Käärik, and Jan Mendling. 2011. Similarity of business process models: Metrics and evaluation. Inform. Syst. 36, 2 (2011), 498--516. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Xin Dong, Alon Halevy, Jayant Madhavan, Ema Nemes, and Jun Zhang. 2004. Similarity search for web services. In Proceedings of the 30th International Conference on Very Large Data Bases (VLDB’04). 372--383. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Philippe Fournier-Viger, Usef Faghihi, Roger Nkambou, and Engelbert Mephu Nguifo. 2012. CMRules: Mining sequential rules common to several sequences. Knowledge-Based Syst. 25, 1 (2012), 63--76. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Jiawei Han, Jian Pei, and Yiwen Yin. 2000. Mining frequent patterns without candidate generation. In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data (SIGMOD’00). 1--12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Thomas Hornung, Agnes Koschmider, and Georg Lausen. 2008. Recommendation based process modeling support: Method and user experience. In Proceedings of the 27th International Conference on Conceptual Modeling (ER’08). 265--278. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Ya-Han Hu and Yen-Liang Chen. 2006. Mining association rules with multiple minimum supports: A new mining algorithm and a support tuning mechanism. Decision Support Syst. 42, 1 (2006), 1--24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Dietmar Jannach and Simon Fischer. 2014. Recommendation-based modeling support for data mining processes. In Proceedings of the 8th ACM Conference on Recommender Systems (RecSys’14). 334--340. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Dietmar Jannach, Michael Jugovac, and Lukas Lerche. 2015. Adaptive recommendation-based modeling support for data analysis workflows. In Proceedings of the 20th International Conference on Intelligent User Interfaces (IUI’15). 252--262. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Dietmar Jannach, Lukas Lerche, Fatih Gedikli, and Geoffray Bonnin. 2013. What recommenders recommend - An analysis of accuracy, popularity, and sales diversity effects. In Proceedings of the 21st International Conference on User Modeling, Adaptation and Personalization (UMAP 2013). Rome, Italy.Google ScholarGoogle ScholarCross RefCross Ref
  17. Krzysztof Kluza, Mateusz Baran, Szymon Bobek, and Grzegorz J. Nalepa. 2013. Overview of recommendation techniques in business process modeling. In Proceedings of 9th Workshop on Knowledge Engineering and Software Engineering (KESE9). 46--57.Google ScholarGoogle Scholar
  18. Agnes Koschmider, Thomas Hornung, and Andreas Oberweis. 2011. Recommendation-based editor for business process modeling. Data Knowledge Eng. 70, 6 (2011), 483--503. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Henrik Leopold, Jan Mendling, and Hajo A. Reijers. 2011. On the automatic labeling of process models. In Advanced Information Systems Engineering. Lecture Notes in Computer Science, Vol. 6741. 512--520. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Ying Li, Bin Cao, Lida Xu, Jianwei Yin, Shuiguang Deng, Yuyu Yin, and Zhaohui Wu. 2014. An efficient recommendation method for improving business process modeling. IEEE Trans. Indust. Inform. 10, 1 (2014), 502--513.Google ScholarGoogle ScholarCross RefCross Ref
  21. Justin Matejka, Wei Li, Tovi Grossman, and George W. Fitzmaurice. 2009. CommunityCommands: Command recommendations for software applications. In Proceedings of the 22th Annual ACM Symposium on User Interface Software and Technology (UIST’09). 193--202. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Steffen Mazanek and Mark Minas. 2009. Business process models as a showcase for syntax-based assistance in diagram editors. In Model Driven Engineering Languages and Systems. Lecture Notes in Computer Science, Vol. 5795. 322--336. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Mirjam Minor, Ralph Bergmann, Sebastian Görg, and Kirstin Walter. 2010. Towards case-based adaptation of workflows. In Case-Based Reasoning. Research and Development. Lecture Notes in Computer Science, Vol. 6176. 421--435. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. David Piorkowski, Scott Fleming, Christopher Scaffidi, Christopher Bogart, Margaret Burnett, Bonnie John, Rachel Bellamy, and Calvin Swart. 2012. Reactive information foraging: An empirical investigation of theory-based recommender systems for programmers. In Proceedings of the 2012 Conference on Human Factors in Computing Systems (CHI’12). 1471--1480. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Mohamed Sellami, Samir Tata, Zakaria Maamar, and Bruno Defude. 2009. A recommender system for web services discovery in a distributed registry environment. In Proceedings of the 4th International Conference on Internet and Web Applications and Services (ICIW’09). 418--423. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Supporting the Design of Machine Learning Workflows with a Recommendation System

                  Recommendations

                  Comments

                  Login options

                  Check if you have access through your login credentials or your institution to get full access on this article.

                  Sign in

                  Full Access

                  • Published in

                    cover image ACM Transactions on Interactive Intelligent Systems
                    ACM Transactions on Interactive Intelligent Systems  Volume 6, Issue 1
                    Special Issue on New Directions in Eye Gaze for Interactive Intelligent Systems (Part 2 of 2), Regular Articles and Special Issue on Highlights of IUI 2015 (Part 1 of 2)
                    May 2016
                    219 pages
                    ISSN:2160-6455
                    EISSN:2160-6463
                    DOI:10.1145/2896319
                    Issue’s Table of Contents

                    Copyright © 2016 ACM

                    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                    Publisher

                    Association for Computing Machinery

                    New York, NY, United States

                    Publication History

                    • Published: 22 February 2016
                    • Revised: 1 November 2015
                    • Accepted: 1 November 2015
                    • Received: 1 July 2015
                    Published in tiis Volume 6, Issue 1

                    Permissions

                    Request permissions about this article.

                    Request Permissions

                    Check for updates

                    Qualifiers

                    • research-article
                    • Research
                    • Refereed

                  PDF Format

                  View or Download as a PDF file.

                  PDF

                  eReader

                  View online with eReader.

                  eReader