ABSTRACT
This paper addresses several key issues in extraction and mining of an academic social network: 1) extraction of a researcher social network from the existing Web; 2) integration of the publications from existing digital libraries; 3) expertise search on a given topic; and 4) association search between researchers. We developed a social network system, called ArnetMiner, based on proposed methods to the above problems. In total, 448,470 researcher profiles and 981,599 publications were extracted/integrated after the system having been in operation for two years. The paper describes the architecture and main features of the system. It also briefly presents the experimental results of the proposed methods.
- D. Blei, A. Ng, and M. Jordan. Latent Dirichlet Allocation. Journal of Machine Learning Research, 2003, 3, 993--1022. Google ScholarDigital Library
- D. Brickley and L. Miller, FOAF vocabulary specification, namespace document, 2004. http://xmlns.com/foaf/0.1/.Google Scholar
- J. Lafferty, A. McCallum, and F. Pereira. Conditional random fields: probabilistic models for segmenting and labeling sequence data. Proc. of ICML'2001. Google ScholarDigital Library
- Y. F. Tan, M. Kan, and D. Lee. Search engine driven author disambiguation. Proc. of JCDL'2006. pp. 314--315. Google ScholarDigital Library
- J. Tang, D. Zhang, and L. Yao. Social network extraction of academic researchers. Proc. of ICDM'2007. pp. 292--301 Google ScholarDigital Library
- D. Zhang, J. Tang, J. Li, and K. Wang. A constraint-based probabilistic framework for name disambiguation. Proc. of CIKM'2007. pp. 1019--1022 Google ScholarDigital Library
- J. Zhang, J. Tang, L. Liu, and J. Li. A mixture model for expert finding. Proc. of PAKDD'2008. (to appear) Google ScholarDigital Library
Index Terms
- Extraction and mining of an academic social network
Recommendations
ArnetMiner: extraction and mining of academic social networks
KDD '08: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data miningThis paper addresses several key issues in the ArnetMiner system, which aims at extracting and mining academic social networks. Specifically, the system focuses on: 1) Extracting researcher profiles automatically from the Web; 2) Integrating the ...
Constructing and mining a semantic-based academic social network
Knowledge integration and management in autonomous systemsA number of studies have focused on how to constructan academic social network. The relational ties among researchers are either co-authorship or shared keywords that are captured from scientific journals. The problem with such a network is that ...
Looking deeper into academic citations through network analysis: popularity, influence and impact
Google Scholar (GS) has progressively emerged as a tool which "provides a simple way to broadly search for scholarly literature across many disciplines and sources." As a free tool that provides citation metrics, GS has opened the academic word to a ...
Comments