skip to main content
10.1145/2479787.2479795acmotherconferencesArticle/Chapter ViewAbstractPublication PageswimsConference Proceedingsconference-collections
research-article

Semantics-based news recommendation with SF-IDF+

Authors Info & Claims
Published:12 June 2013Publication History

ABSTRACT

Content-based news recommendations are usually made by employing the cosine similarity and the TF-IDF weighting scheme for terms occurring in news messages and user profiles. Recent developments, such as SF-IDF, have elevated news recommendation to a new level of abstraction by additionally taking into account term meaning through the exploitation of synsets from semantic lexicons and the cosine similarity. Other state-of-the-art semantic recommenders, like SS, make use of semantic lexicon-driven similarities. A shortcoming of current semantic recommenders is that they do not take into account the various semantic relationships between synsets, providing only for a limited understanding of news semantics. Therefore, we extend the SF-IDF weighting technique by additionally considering the synset semantic relationships from a semantic lexicon. The proposed recommendation method, SF-IDF+, as well as SF-IDF and several semantic similarity lexicon-driven methods have been implemented in Ceryx, an extension to the Hermes news personalization service. An evaluation on a data set containing financial news messages shows that overall (by accounting for all considered cut-off values) SF-IDF+ outperforms TF-IDF, SS, and SF-IDF in terms of F1-scores.

References

  1. Banerjee, S., Pedersen, T.: An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet. In: Gelbukh, A. F. (ed.) 4th International Conference on Computational Linguistics and Intelligent Text Processing (CICLING 2002). pp. 136--145. Springer-Verlag (2002) Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Bharat, K., Kamba, T., Albers, M.: Personalized, Interactive News on the Web. Multimedia Systems 6(5), 349--358 (1998) Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Billsus, D., Pazzani, M. J.: A Personal News Agent that Talks, Learns and Explains. In: Etzioni, O., Müller, J. P., Bradshaw, J. M. (eds.) 3rd Annual Conference on Autonomous Agents (AGENTS 1999). pp. 268--275. ACM (1999) Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Borsje, J., Levering, L., Frasincar, F.: Hermes: a Semantic Web-Based News Decision Support System. In: 23rd Annual ACM Symposium on Applied Computing (SAC 2008). pp. 2415--2420. ACM (2008) Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Cantador, I., Bellogín, A., Castells, P.: Ontology-Based Personalised and Context-Aware Recommendations of News Items. In: 2008 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2008). pp. 562--565. IEEE Computer Society (2008) Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Capelle, M., Moerland, M., Frasincar, F., Hogenboom, F.: Semantics-Based News Recommendation. In: Akerkar, R., Bădică, C., Dan Burdescu, D. (eds.) 2nd International Conference on Web Intelligence, Mining and Semantics (WIMS 2012). ACM (2012) Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Carreira, R., Crato, J. M., Gonçalves, D., Jorge, J. A.: Evaluating Adaptive User Profiles for News Classification. In: 9th International Conference on Intelligent User Interfaces (IUI 2004). pp. 206--212. ACM (2004) Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. In: 40th Anniversary Meeting of the Association for Computational Linguistics (ACL 2002). pp. 168--175. Association for Computational Linguistics (2002)Google ScholarGoogle Scholar
  9. Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press (1998)Google ScholarGoogle Scholar
  10. Frasincar, F., Borsje, J., Levering, L.: A Semantic Web-Based Approach for Building Personalized News Services. International Journal of E-Business Research 5(3), 35--53 (2009)Google ScholarGoogle Scholar
  11. Frasincar, F., IJntema, W., Goossen, F., Hogenboom, F.: Business Intelligence Applications and the Web: Models, Systems and Technologies, chap. A Semantic Approach for News Recommendation, pp. 102--121. IGI Global (2011)Google ScholarGoogle Scholar
  12. Getahun, F., Tekli, J., Chbeir, R., Viviani, M., Yetongnon, K.: Relating RSS News/Items. In: Gaedke, M., Grossniklaus, M., Díaz, O. (eds.) 9th International Conference on Web Engineering (ICWE 2009). pp. 442--452. Springer-Verlag (2009) Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. IJntema, W., Goossen, F., Frasincar, F., Hogenboom, F.: Ontology-Based News Recommendation. In: Daniel, F., Delcambre, L. M. L., Fotouhi, F., Garrigós, I., Guerrini, G., Mazón, J. N., Mesiti, M., Müller-Feuerstein, S., Trujillo, J., Truta, T. M., Volz, B., Waller, E., Xiong, L., Zimányi, E. (eds.) International Workshop on Business intelligencE and the WEB (BEWEB 2010) at 13th International Conference on Extending Database Technology and Thirteenth International Conference on Database Theory (EDBT/ICDT 2010). ACM (2010) Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Jensen, A. S., Boss, N. S.: Textual Similarity: Comparing Texts in Order to Discover How Closely They Discuss the Same Topics. Bachelor's Thesis, Technical University of Denmark (2008)Google ScholarGoogle Scholar
  15. Jiang, J. J., Conrath, D. W.: Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy. In: 10th International Conference on Research in Computational Linguistics (ROCLING 1997). pp. 19--33 (1997)Google ScholarGoogle Scholar
  16. Lang, K.: NewsWeeder: Learning to Filter Netnews. In: 12th International Conference on Machine Learning (ICML 1995). pp. 331--339. Morgan Kaufmann (1995)Google ScholarGoogle ScholarCross RefCross Ref
  17. Leacock, C., Chodorow, M.: WordNet: An Electronic Lexical Database, chap. Combining Local Context and WordNet Similarity for Word Sense Identification, pp. 265--283. MIT Press (1998)Google ScholarGoogle Scholar
  18. Lextek: Onix Text Retrieval Toolkit -- API Reference. http://www.lextek.com/manuals/onix/stopwords1.html (2012)Google ScholarGoogle Scholar
  19. Lin, D.: An Information-Theoretic Definition of Similarity. In: Shavlik, J. W. (ed.) 15th International Conference on Machine Learning (ICML 1998). pp. 296--304. Morgan Kaufmann (1998) Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Resnik, P.: Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In: 14th International Joint Conference on Artificial Intelligence (IJCAI 1995). pp. 448--453. Morgan Kaufmann (1995) Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Salton, G., Buckley, C.: Term-Weighting Approaches in Automatic Text Retrieval. Information Processing and Management 24(5), 513--523 (1988) Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Spell, B.: Java API for WordNet Searching (JAWS). http://lyle.smu.edu/~tspell/jaws/index.html (2012)Google ScholarGoogle Scholar
  23. Toutanova, K., Klein, D., Manning, C. D., Singer, Y.: Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network. In: Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLTNAACL 2003). pp. 252--259 (2003) Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Wu, Z., Palmer, M. S.: Verb Semantics and Lexical Selection. In: 32nd Annual Meeting of the Association for Computational Linguistics (ACL 1994). pp. 133--138. Association for Computational Linguistics (1994) Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Semantics-based news recommendation with SF-IDF+

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            WIMS '13: Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics
            June 2013
            408 pages
            ISBN:9781450318501
            DOI:10.1145/2479787

            Copyright © 2013 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 12 June 2013

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            WIMS '13 Paper Acceptance Rate28of72submissions,39%Overall Acceptance Rate140of278submissions,50%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader