skip to main content
10.1145/1401890.1402002acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Spotting out emerging artists using geo-aware analysis of P2P query strings

Published:24 August 2008Publication History

ABSTRACT

Record label companies would like to identify potential artists as early as possible in their careers, before other companies approach the artists with competing contracts. The vast number of candidates makes the process of identifying the ones with high success potential time consuming and laborious. This paper demonstrates how datamining of P2P query strings can be used in order to mechanize most of this detection process. Using a unique intercepting system over the Gnutella network, we were able to capture an unprecedented amount of geographically identified (geo-aware) queries, allowing us to investigate the diffusion of music related queries in time and space. Our solution is based on the observation that emerging artists, especially rappers, have a discernible stronghold of fans in their hometown area, where they are able to perform and market their music. In a file sharing network, this is reflected as a delta function spatial distribution of content queries. Using this observation, we devised a detection algorithm for emerging artists, that looks for performers with sharp increase in popularity in a small geographic region though still unnoticable nation wide. The algorithm can suggest a short list of artists with breakthrough potential, from which we showed that about 30% translate the potential to national success.

References

  1. L. A. N. Amaral, A. Scala, M. Barthelemy, and H. E. Stanley. Classes of small-world networks. PNAS, 97(21):11149--11152, Sept. 2000.Google ScholarGoogle ScholarCross RefCross Ref
  2. S. Bhattacharjee, R. D. Gopal, K. Lertwachara, and J. R. Marsden. Using P2P sharing activity to improve business decision making: proof of concept for estimating product life-cycle. Electronic Commerce Research and Applications, 4(1):14--20, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. P. Domingos and M. Richardson. Mining the network value of customers. In KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, pages 57--66. ACM, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. T. Garber, J. Goldenberg, B. Libai, and E. Muller. From density to destiny: Using spatial dimension of sales data for early prediction of new product success. Marketing Science, 23(3):419--428, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. S. Gish, Y. Shavitt, and T. Tankel. Geographical statistics and characteristics of p2p query strings. In The 6th International Workshop on Peer-to-Peer Systems (IPTPS'07), Feb. 2007.Google ScholarGoogle Scholar
  6. M. S. Granovetter. The strength of weak ties. The American Journal of Sociology, 78(6):1360--1380, 1973.Google ScholarGoogle ScholarCross RefCross Ref
  7. C. X. Ling and C. Li. Data mining for direct marketing: Problems and solutions. In Knowledge Discovery and Data Mining, pages 73--79, 1998.Google ScholarGoogle Scholar
  8. G. N. Noren, R. Orre, and A. Bate. A hit-miss model for duplicate detection in the who drug safety database. In KDD, pages 459--468. ACM, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. H. Rasti, D. Stutzbach, and R. Rejaie. On the long-term evolution of the two-tier gnutella overlay. In IEEE Global Internet Symposium, Barcelona, Spain, Apr. 2006.Google ScholarGoogle ScholarCross RefCross Ref
  10. J. Shepherd. Ghost rider fallout haunts Mistah F.A.B., Mar. 2007. Featured on The VIBE Magazine website. Last Accessed December 2007.Google ScholarGoogle Scholar
  11. F. Usama, P.-S. Gregory, and S. Padhraic. The kdd process for extracting useful knowledge from volumes of data. In Communication of the ACM, volume 29, pages 27--34, Nov. 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. J. Watts and S. H. Strogatz. Collective dynamics of small-world networks. Nature, 393:440--442, June 1998.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Spotting out emerging artists using geo-aware analysis of P2P query strings

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        KDD '08: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
        August 2008
        1116 pages
        ISBN:9781605581934
        DOI:10.1145/1401890
        • General Chair:
        • Ying Li,
        • Program Chairs:
        • Bing Liu,
        • Sunita Sarawagi

        Copyright © 2008 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 24 August 2008

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        KDD '08 Paper Acceptance Rate118of593submissions,20%Overall Acceptance Rate1,133of8,635submissions,13%

        Upcoming Conference

        KDD '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader