skip to main content
10.1145/1622176.1622215acmconferencesArticle/Chapter ViewAbstractPublication PagesuistConference Proceedingsconference-collections
research-article

Mining web interactions to automatically create mash-ups

Published: 04 October 2009 Publication History

Abstract

The deep web contains an order of magnitude more information than the surface web, but that information is hidden behind the web forms of a large number of web sites. Metasearch engines can help users explore this information by aggregating results from multiple resources, but previously these could only be created and maintained by programmers. In this paper, we explore the automatic creation of metasearch mash-ups by mining the web interactions of multiple web users to find relations between query forms on different web sites. We also present an implemented system called TX2 that uses those connections to search multiple deep web resources simultaneously and integrate the results in context in a single results page. TX2 illustrates the promise of constructing mash-ups automatically and the potential of mining web interactions to explore deep web resources.

References

[1]
Adar, E., Teevan, J., and Dumais, S. Large scale analysis of web revisitation patterns. In Proc. of the SIGCHI Conf. on Human factors in Comp. Sys. (CHI '09). Boston, Massachusetts, USA, 2009.
[2]
Bigham, J., Cavender, A.C., Kaminsky, R.S., Prince, C.M., and Robison, T.S. Transcendence: Enabling a personal view of the deep web. In Proc. of the 13th Intl. Conf. on Intelligent User Interfaces (IUI '08). Gran Canaria, Spain, 2008.
[3]
Bolin, M., Webber, M., Rha, P., Wilson, T., and Miller, R.C. Automation and customization of rendered web pages. In Proc. of the 18th ACM Symp. on User Interface Soft. and Tech. (UIST '05). Seattle, WA, USA, 2005, 163--172.
[4]
Chang, K. C.-C. and He, B. Toward large scale integration: Building a metaquerier over databases on the web. In Proc. of the 2nd Conf. on Innovative Data Sys. Research. 2005.
[5]
Doan, A., Domingos, P., and Halevy, A.Y. Reconciling schemas of disparate data sources: a machine-learning approach. In Proc. of the 2001 ACM SIGMOD Intl. Conf. on Management of data (SIGMOD '01). 2001, 509--520.
[6]
Dontcheva, M., Drucker, S.M., Wade, G., Salesin, D., and Cohen, M.F. Summarizing personal web browsing sessions. In Proc. of the 19th ACM Symp. on User Interface Soft. and Tech. (UIST '06). New York, NY, USA, 2006, 115--124.
[7]
Faaborg, A. and Lieberman, H. A goal-oriented web browser. In Proc. of the SIGCHI Conf. on Human Factors in Comp. Sys. (CHI '06). Montreal, Quebec, Canada, 2006, 751--760.
[8]
Fujima, J., Lunzer, A., Hornbk, K., and Tanaka, Y. Clip, connect, clone: combining application elements to build custom Interfaces for information access. In Proc. of the 17th ACM Symp. on User Interface Soft. and Tech. (UIST '04). ACM Press, New York, NY, USA, 2004, 175--184.
[9]
Hartmann, B., Wu, L., Collins, K., and Klemmer, S. Programming by a sample: Rapidly prototyping web applications with d.mix. In Proc. of the 20th Symp. on User Interface Soft. and Tech. (UIST '07). Newport, RI, USA, 2007.
[10]
Huynh, D.F., Miller, R.C., and Karger, D. Enabling web browsers to augment web sites' filtering and sorting functionalities. In Proc. of the 19th ACM Symp. on User Interface Soft. and Tech. (UIST '06). ACM Press, New York, NY, USA, 2006, 125--134.
[11]
Jung, H., Allen, J., Chambers, N., Galescu, L., Swift, M., and Taysom,W. One-shot procedure learning from instruction and observation. In Proc. of the Intl. FLAIRS Conf.: Special Track on Natural Language and Knowledge Representation.
[12]
Lin, J., Wong, J., Nichols, J., Cypher, A., and Lau, T.A. Enduser programming of mashups with vegemite. In Proc. of the 13th Intl. Conf. on Intelligent user Interfaces (IUI '09). Sanibel Island, Florida, USA, 2009, 97--106.
[13]
Little, G., Lau, T., Cypher, A., Lin, J., Haber, E.M., and Kandogan, E. Koala: capture, share, automate, personalize business processes on the web. In Proc. of the SIGCHI Conf. on Human factors in Comp. Sys. (CHI 2007). 2007, 943--946.
[14]
Madhavan, J., Halevy, A., Cohen, S., Dong, X., Jeffrey, S.R., Ko, D., and Yu, C. Structured data meets the web: A few observations. IEEE Computer Society: Bulletin of the Technical Committee on Data Engineering, 31, 4 (2006), 10--18.
[15]
Miller, R.C. and Myers, B. Creating dynamic world wide web pages by demonstration (1997).
[16]
Mukherjee, S., Yang, G., Tan, W., and Ramakrishnan, I. Automatic discovery of semantic structures in html documents. In Proc. of the Intl. Conf. on Document Analysis and Recognition (ICDAR '03). 2003.
[17]
Piggy bank. http://simile.mit.edu/piggy-bank/. Accessed April 2009.
[18]
Pilgrim, M., ed. Greasemonkey Hacks: Tips&Tools for Remixing the Web with Firefox. O'Reilly Media, 2005.
[19]
Raghavan, S. and Garcia-Molina, H. Crawling the hidden web. In Proc. of the Twenty-seventh Intl. Conf. on Very Large Databases (VLDB '01). 2001.
[20]
Selberg, E. and Etzioni, O. Multi-service search and comparison using the metacrawler. In Proc. of the 4th Intl. World Wide Web Conf. Darmstadt, Germany, 1995.
[21]
Solvent. http://simile.mit.edu/solvent. Accessed April 2009.
[22]
Toomim, M., Drucker, S.M., Dontcheva, M., Rahimi, A., Thomson, B., and Landay, J.A. Attaching UI enhancements to websites with end users. In Proc. of the ACM Conf. on Human Factors in Comp. Sys. (CHI 2009). Boston, MA, USA, 2009.
[23]
Wong, J. and Hong, J.I. Making mashups with marmite: towards end-user programming for the web. In Proc. of the SIGCHI Conf. on Human factors in Comp. Sys. (CHI '07). San Jose, CA, USA, 2007, 1435--1444.
[24]
Yahoo! pipes. Yahoo! Inc. http://pipes.yahoo.com/. Accessed February 2009.

Cited By

View all
  • (2017)mashpoint: Surfing the web in a data-oriented wayIEEE EUROCON 2017 -17th International Conference on Smart Technologies10.1109/EUROCON.2017.8011076(50-55)Online publication date: Jul-2017
  • (2011)MixerProceedings of the 13th IFIP TC 13 international conference on Human-computer interaction - Volume Part I10.5555/2042053.2042099(426-443)Online publication date: 5-Sep-2011
  • (2011)Services as materialsProceedings of the 2nd international workshop on Research in the large10.1145/2025528.2025532(9-12)Online publication date: 18-Sep-2011
  • Show More Cited By

Index Terms

  1. Mining web interactions to automatically create mash-ups

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    UIST '09: Proceedings of the 22nd annual ACM symposium on User interface software and technology
    October 2009
    278 pages
    ISBN:9781605587455
    DOI:10.1145/1622176
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 October 2009

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. deep web
    2. mash-ups
    3. meta-search
    4. programming-by-example
    5. web forms

    Qualifiers

    • Research-article

    Conference

    UIST '09

    Acceptance Rates

    Overall Acceptance Rate 561 of 2,567 submissions, 22%

    Upcoming Conference

    UIST '25
    The 38th Annual ACM Symposium on User Interface Software and Technology
    September 28 - October 1, 2025
    Busan , Republic of Korea

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 15 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2017)mashpoint: Surfing the web in a data-oriented wayIEEE EUROCON 2017 -17th International Conference on Smart Technologies10.1109/EUROCON.2017.8011076(50-55)Online publication date: Jul-2017
    • (2011)MixerProceedings of the 13th IFIP TC 13 international conference on Human-computer interaction - Volume Part I10.5555/2042053.2042099(426-443)Online publication date: 5-Sep-2011
    • (2011)Services as materialsProceedings of the 2nd international workshop on Research in the large10.1145/2025528.2025532(9-12)Online publication date: 18-Sep-2011
    • (2011)Mixer: Mixed-Initiative Data Retrieval and Integration by ExampleHuman-Computer Interaction – INTERACT 201110.1007/978-3-642-23774-4_36(426-443)Online publication date: 2011

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media