skip to main content
10.1145/2187980.2188213acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
tutorial

Full-text search in email archives using social evaluation, attached and linked resources

Published:16 April 2012Publication History

ABSTRACT

Emails are important tools for communication and cooperation, they contain large amount of information and connections to knowledge and data sources. Because of this, it is very important to improve the efficiency of their processing. This paper describes an email search system which integrates full-text search with social search while processing also the attached and linked resources. The project described in this paper is still in progress. Due to this fact, some proposed parts of the system are not implemented and also not proven yet. The proposed equation for determining the social importance of an email has also to be tuned during the last phases of the development and the evaluation phase. The already implemented part of the system includes content extraction from the email messages, attached and linked resources and also the textual search and social relation extraction is implemented. The next phase of the development includes tuning of the social evaluation and it's integration with textual search.

References

  1. Jeffrey Jones, Gallup: Almost All E-Mail Users Say Internet, E-Mail Have Made Lives Better, http://www.gallup.com/poll/4711/Almost-All-EMail-Users-Say-Internet-EMailMade-Lives-Better.aspx, 2001Google ScholarGoogle Scholar
  2. Jiangong Zhang, Torsten Suel: Efficient Search in Large Textual Collections with Redundancy. WWW 2007 (Banff, Alberta, Canada, 2007) Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Karp-Rabin algorithm, Available at: http://www-igm.univ-mlv.fr/~lecroq/string/node5.html (2011)Google ScholarGoogle Scholar
  4. Saul Schleimer, Daniel S. Wilkerson, Alex Aiken: Winnowing: Local Algorithms for Document Fingerprinting. SIGMOD 2003 (San Diego, CA, 2003) Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Lampert, A., Dale, R., Paris, C.: Segmenting Email Message Text into Zones. Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing (Singapore, 2009) Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Henry Tirri, Jukka Perkiö, Ville Tuulos, Wray Buntine: Multi-Faced Information Retrieval System for Large Scale Email Archives. Proceedings of the 2005 IEEE/WICI/ACM International Conference on Web Intelligence (2005) Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Laclavík, M. 'eleng, M. Ciglan, M. Hluchý, L.: Ontea: Platform for Pattern Based Automated Semantic Annotation. Computing and Informatics, Vol. 28, 2009, pp. 555--579.Google ScholarGoogle Scholar
  8. Shinjae Yoo, Yiming Yang, Frank Lin, Il-Chul Moon: Mining Social Networks for Personalized Email Prioritization. KDD'09 (Paris, France, 2009) Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Apache Lucene: Overview. Available at: http://lucene.apache.org/java/docs/index.html. 2011.Google ScholarGoogle Scholar
  10. Sqlite3. Available at: http://www.sqlite.org. 2011.Google ScholarGoogle Scholar
  11. Vitor R. Carvalho, William W. Cohen: Learning to Extract Signature and Reply Lines from Email. CEAS-2004 (Conference on Email and Anti-Spam), Mountain View, CA, July 2004Google ScholarGoogle Scholar
  12. Shinjae Yoo , Yiming Yang , Frank Lin , Il-Chul Moon, Mining social networks for personalized email prioritization, Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, June 28-July 01, 2009, Paris, France Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Monica Cahill McJunkin, Precision and recall in title keyword searches, Information Technology and Libraries, v.14 n.3, p.161--171, Sept. 1995 Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Full-text search in email archives using social evaluation, attached and linked resources

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          WWW '12 Companion: Proceedings of the 21st International Conference on World Wide Web
          April 2012
          1250 pages
          ISBN:9781450312301
          DOI:10.1145/2187980

          Copyright © 2012 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 16 April 2012

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • tutorial

          Acceptance Rates

          Overall Acceptance Rate1,899of8,196submissions,23%
        • Article Metrics

          • Downloads (Last 12 months)2
          • Downloads (Last 6 weeks)1

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader