ACM Home Page
Please provide us with feedback. Feedback
POLYPHONET: an advanced social network extraction system from the web
Full text PdfPdf (482 KB)
Source International World Wide Web Conference archive
Proceedings of the 15th international conference on World Wide Web table of contents
Edinburgh, Scotland
SESSION: Social networks table of contents
Pages: 397 - 406  
Year of Publication: 2006
ISBN:1-59593-323-9
Authors
Yutaka Matsuo  National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
Junichiro Mori  University of Tokyo, Tokyo, Japan
Masahiro Hamasaki  National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
Keisuke Ishida  National Institute of Advanced Industrial Science and Technology (AIST)
Takuichi Nishimura  AIST
Hideaki Takeda  NII
Koiti Hasida  AIST
Mitsuru Ishizuka  University of Tokyo, Tokyo, Japan
Sponsors
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 30,   Downloads (12 Months): 227,   Citation Count: 10
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1135777.1135837
What is a DOI?

ABSTRACT

Social networks play important roles in the Semantic Web: knowledge management, information retrieval, ubiquitous computing, and so on. We propose a social network extraction system called POLYPHONET, which employs several advanced techniques to extract relations of persons, detect groups of persons, and obtain keywords for a person. Search engines, especially Google, are used to measure co-occurrence of information and obtain Web documents.Several studies have used search engines to extract social networks from the Web, but our research advances the following points: First, we reduce the related methods into simple pseudocodes using Google so that we can build up integrated systems. Second, we develop several new algorithms for social networking mining such as those to classify relations into categories, to make extraction scalable, and to obtain and utilize person-to-word relations. Third, every module is implemented in POLYPHONET, which has been used at four academic conferences, each with more than 500 participants. We overview that system. Finally, a novel architecture called Super Social Network Mining is proposed; it utilizes simple modules using Google and is characterized by scalability and Relate-Identify processes: Identification of each entity and extraction of relations are repeated to obtain a more precise social network.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
L. A. Adamic and E. Adar. Friends and neighbors on the web. Social Networks, 25(3):211--230, 2003.
2
3
 
4
D. Bollegara, Y. Matsuo, and M. Ishizuka. Extracting key phrases to disambiguate personal names on the web. In Proc. CICLing 2006, 2006.
 
5
R. S. Burt. Structural Holes: The Social Structure of Competition. Harvard University Press, Cambridge, MA, 1992.
6
 
7
 
8
9
10
11
 
12
A. Culotta, R. Bekkerman, and A. McCallum. Extracting social networks and contact information from email and the web. In CEAS-1, 2004.
 
13
I. Davis and E. V. Jr. RELATIONSHIP: A vocabulary for describing relationships between people. http://vocab.org/relationship/.
14
15
 
16
J. Golbeck and J. Hendler. Accuracy of metrics for inferring trust and reputation in semantic web-based social networks. In Proc. EKAW 2004, 2004.
 
17
R. Guha and A. Garg. Disambiguating entities in web search. TAP project, http://tap.stanford.edu/PeopleSearch.pdf.
 
18
M. Hamasaki, H. Takeda, I. Ohmukai, and R. Ichise. Scheduling support system for academic conferences based on interpersonal networks. In Proc. ACM Hypertext 2004, 2004.
19
 
20
Y. Jin, Y. Matsuo, and M. Ishizuka. Extracting inter-business relationship from world wide web. In Workshop Notes, Web Community Structure and Network Analysis Workshop, 2005.
 
21
H. Kautz, B. Selman, and M. Shah. The hidden Web. AI magazine, 18(2):27--35, 1997.
 
22
P. Knees, E. Pampalk, and G. Widmer. Artist classification with web-based data. In 5th International Conf. on Music Information Retrieval(ISMIR), 2004.
 
23
 
24
J. Leskovec, L. A. Adamic, and B. A. Huberman. The dynamics of viral marketing, 2005. http://www.hpl.hp.com/research/idl/papers/viral/viral.pdf.
 
25
 
26
L. Lloyd, V. Bhagwan, D. Gruhl, and A. Tomkins. Disambiguation of references to individuals. Technical Report RJ10364(A0410-011), IBM Research, 2005.
 
27
B. Malin. Unsupervised name disambiguation via social network similarity. In Workshop Notes on Link Analysis, Counterterrorism, and Security, 2005.
 
28
G. S. Mann and D. Yarowsky. Unsupervised personal name disambiguation. In Proc. CoNLL, 2003.
 
29
 
30
Y. Matsuo, H. Tomobe, K. Hasida, and M. Ishizuka. Finding social network for trust calculation. In Proc. 16th European Conference on Artificial Intelligence (ECAI2004), pp. 510--514, 2004.
 
31
Y. Matsuo, H. Tomobe, K. Hasida, and M. Ishizuka. Social network extraction from the web information. Journal of the Japanese Society for Artificial Intelligence, 20(1E):46--56, 2005. in Japanese.
 
32
P. Mika. Flink: Semantic web technology for the extraction and analysis of social networks. Journal of Web Semantics, 3(2), 2005.
 
33
P. Mika. Ontologies are us: A unified model of social networks and semantics. In Proc. ISWC2005, 2005.
 
34
35
 
36
J. Mori, Y. Matsuo, and M. Ishizuka. Finding user semantics on the web using word co-occurrence information. In Proc. Int'l. Workshop on Personalization on the Semantic Web (PersWeb05), 2005.
 
37
H. Nakagawa, A. Maeda, and H. Kojima. Automatic term recognition system TermExtract. http://gensen.dl.itc.utokyo.ac.jp/gensenweb eng.html.
 
38
 
39
 
40
G. Palla, I. Derenyi, I. Farkas, and T. Vicsek. Uncovering the overlapping community structure of complex networks in nature and society. Nature, 435:814, 2005.
 
41
 
42
M. Sahami and T. Heilman. A web-based kernel function for matching short text snippets. In International Workshop on Learning in Web Search (LWS2005), pp. 2--9, 2005.
 
43
 
44
 
45
 
46
S. Wasserman and K. Faust. Social network analysis. Methods and Applications. Cambridge University Press, Cambridge, 1994.

CITED BY  10
 
 

Collaborative Colleagues:
Yutaka Matsuo: colleagues
Junichiro Mori: colleagues
Masahiro Hamasaki: colleagues
Keisuke Ishida: colleagues
Takuichi Nishimura: colleagues
Hideaki Takeda: colleagues
Koiti Hasida: colleagues
Mitsuru Ishizuka: colleagues