skip to main content
10.1145/1145581.1145609acmconferencesArticle/Chapter ViewAbstractPublication PagesicweConference Proceedingsconference-collections
Article

Chronica: a temporal web search engine

Published: 11 July 2006 Publication History

Abstract

Search engines regularly crawl the web taking vast snapshots of sitecontent. Because previous crawls are not archived, however, searchresults pertain only to a single, recent instant in time. Search engine users are unable to request pages discussing UK politics in2001, for example. The Internet Archive, an organization dedicated to maintaining such snapshots of the Internet, provides access to many previous web crawls, but lacks a search facility. Users of the ``Way Back Machine'' must provide a specific URL for which they want a listof snapshots organized by date. This short paper describes Chronica, atemporal search engine that indexes Internet Archive crawl data in order to provide search results spanning user-specified time ranges. Chronica can generate graphs showing query result hit counts across a given time span and even side-by-side comparisons of different query results. These graphs can be used to, among other things, track a term's popularity over time for marketing or academic research purposes.

References

[1]
Sheahan, Ryan. Improving Query Retrieval Times in the Temporal Search Engine, Masters Thesis; University Of Kansas, 2003.
[2]
Temporal Search Engine website. http://www.ittc.ku.edu/temporal/
[3]
Heritrix. http://crawler.archive.org
[4]
Terence Parr. Enforcing Strict Model-View Separation in Template Engines. In WWW2004 Conference Proceedings p. 224, May 17-20 2004, New York City.

Cited By

View all

Index Terms

  1. Chronica: a temporal web search engine

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICWE '06: Proceedings of the 6th international conference on Web engineering
    July 2006
    384 pages
    ISBN:1595933522
    DOI:10.1145/1145581
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 July 2006

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. crawling
    2. indexing
    3. search
    4. search engine
    5. temporal search

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2015)Modified pagerank for concept based searchJournal of Web Engineering10.5555/2871274.287128114:5-6(503-524)Online publication date: 1-Nov-2015
    • (2012)Handling temporal information in web search enginesACM SIGMOD Record10.1145/2380776.238078041:3(15-23)Online publication date: 5-Oct-2012
    • (2011)Hybrid index structures for temporal-textual web searchProceedings of the 13th Asia-Pacific web conference on Web technologies and applications10.5555/1996794.1996829(271-277)Online publication date: 18-Apr-2011
    • (2011)Hybrid Index Structures for Temporal-Textual Web SearchWeb Technologies and Applications10.1007/978-3-642-20291-9_28(271-277)Online publication date: 2011
    • (2010)NTLMProceedings of the 2010 international conference on Web information systems engineering10.5555/2044492.2044509(156-170)Online publication date: 12-Dec-2010
    • (2010)Exploiting time-based synonyms in searching document archivesProceedings of the 10th annual joint conference on Digital libraries10.1145/1816123.1816135(79-88)Online publication date: 21-Jun-2010
    • (2010)BT+-tree: A New Index for Temporal Information in Web PagesDatabase Theory and Application, Bio-Science and Bio-Technology10.1007/978-3-642-17622-7_8(68-78)Online publication date: 2010
    • (2008)Representing Spatiotemporal Information for Web PagesProceedings of the 2008 Fourth International Conference on Networked Computing and Advanced Information Management - Volume 0210.1109/NCM.2008.34(621-624)Online publication date: 2-Sep-2008

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media