|
ABSTRACT
Social networks play important roles in the Semantic Web: knowledge management, information retrieval, ubiquitous computing, and so on. We propose a social network extraction system called POLYPHONET, which employs several advanced techniques to extract relations of persons, detect groups of persons, and obtain keywords for a person. Search engines, especially Google, are used to measure co-occurrence of information and obtain Web documents.Several studies have used search engines to extract social networks from the Web, but our research advances the following points: First, we reduce the related methods into simple pseudocodes using Google so that we can build up integrated systems. Second, we develop several new algorithms for social networking mining such as those to classify relations into categories, to make extraction scalable, and to obtain and utilize person-to-word relations. Third, every module is implemented in POLYPHONET, which has been used at four academic conferences, each with more than 500 participants. We overview that system. Finally, a novel architecture called Super Social Network Mining is proposed; it utilizes simple modules using Google and is characterized by scalability and Relate-Identify processes: Identification of each entity and extraction of relations are repeated to obtain a more precise social network.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
L. A. Adamic and E. Adar. Friends and neighbors on the web. Social Networks, 25(3):211--230, 2003.
|
 |
2
|
|
 |
3
|
|
| |
4
|
D. Bollegara, Y. Matsuo, and M. Ishizuka. Extracting key phrases to disambiguate personal names on the web. In Proc. CICLing 2006, 2006.
|
| |
5
|
R. S. Burt. Structural Holes: The Social Structure of Competition. Harvard University Press, Cambridge, MA, 1992.
|
 |
6
|
|
| |
7
|
|
| |
8
|
|
 |
9
|
|
 |
10
|
|
 |
11
|
|
| |
12
|
A. Culotta, R. Bekkerman, and A. McCallum. Extracting social networks and contact information from email and the web. In CEAS-1, 2004.
|
| |
13
|
I. Davis and E. V. Jr. RELATIONSHIP: A vocabulary for describing relationships between people. http://vocab.org/relationship/.
|
 |
14
|
|
 |
15
|
|
| |
16
|
J. Golbeck and J. Hendler. Accuracy of metrics for inferring trust and reputation in semantic web-based social networks. In Proc. EKAW 2004, 2004.
|
| |
17
|
R. Guha and A. Garg. Disambiguating entities in web search. TAP project, http://tap.stanford.edu/PeopleSearch.pdf.
|
| |
18
|
M. Hamasaki, H. Takeda, I. Ohmukai, and R. Ichise. Scheduling support system for academic conferences based on interpersonal networks. In Proc. ACM Hypertext 2004, 2004.
|
 |
19
|
|
| |
20
|
Y. Jin, Y. Matsuo, and M. Ishizuka. Extracting inter-business relationship from world wide web. In Workshop Notes, Web Community Structure and Network Analysis Workshop, 2005.
|
| |
21
|
H. Kautz, B. Selman, and M. Shah. The hidden Web. AI magazine, 18(2):27--35, 1997.
|
| |
22
|
P. Knees, E. Pampalk, and G. Widmer. Artist classification with web-based data. In 5th International Conf. on Music Information Retrieval(ISMIR), 2004.
|
| |
23
|
|
| |
24
|
J. Leskovec, L. A. Adamic, and B. A. Huberman. The dynamics of viral marketing, 2005. http://www.hpl.hp.com/research/idl/papers/viral/viral.pdf.
|
| |
25
|
|
| |
26
|
L. Lloyd, V. Bhagwan, D. Gruhl, and A. Tomkins. Disambiguation of references to individuals. Technical Report RJ10364(A0410-011), IBM Research, 2005.
|
| |
27
|
B. Malin. Unsupervised name disambiguation via social network similarity. In Workshop Notes on Link Analysis, Counterterrorism, and Security, 2005.
|
| |
28
|
G. S. Mann and D. Yarowsky. Unsupervised personal name disambiguation. In Proc. CoNLL, 2003.
|
| |
29
|
|
| |
30
|
Y. Matsuo, H. Tomobe, K. Hasida, and M. Ishizuka. Finding social network for trust calculation. In Proc. 16th European Conference on Artificial Intelligence (ECAI2004), pp. 510--514, 2004.
|
| |
31
|
Y. Matsuo, H. Tomobe, K. Hasida, and M. Ishizuka. Social network extraction from the web information. Journal of the Japanese Society for Artificial Intelligence, 20(1E):46--56, 2005. in Japanese.
|
| |
32
|
P. Mika. Flink: Semantic web technology for the extraction and analysis of social networks. Journal of Web Semantics, 3(2), 2005.
|
| |
33
|
P. Mika. Ontologies are us: A unified model of social networks and semantics. In Proc. ISWC2005, 2005.
|
| |
34
|
|
 |
35
|
|
| |
36
|
J. Mori, Y. Matsuo, and M. Ishizuka. Finding user semantics on the web using word co-occurrence information. In Proc. Int'l. Workshop on Personalization on the Semantic Web (PersWeb05), 2005.
|
| |
37
|
H. Nakagawa, A. Maeda, and H. Kojima. Automatic term recognition system TermExtract. http://gensen.dl.itc.utokyo.ac.jp/gensenweb eng.html.
|
| |
38
|
|
| |
39
|
|
| |
40
|
G. Palla, I. Derenyi, I. Farkas, and T. Vicsek. Uncovering the overlapping community structure of complex networks in nature and society. Nature, 435:814, 2005.
|
| |
41
|
|
| |
42
|
M. Sahami and T. Heilman. A web-based kernel function for matching short text snippets. In International Workshop on Learning in Web Search (LWS2005), pp. 2--9, 2005.
|
| |
43
|
Steffen Staab , Pedro Domingos , Peter Mika , Jennifer Golbeck , Li Ding , Tim Finin , Anupam Joshi , Andrzej Nowak , Robin R. Vallacher, Social Networks Applied, IEEE Intelligent Systems, v.20 n.1, p.80-93, January 2005
[doi> 10.1109/MIS.2005.16]
|
| |
44
|
|
| |
45
|
|
| |
46
|
S. Wasserman and K. Faust. Social network analysis. Methods and Applications. Cambridge University Press, Cambridge, 1994.
|
CITED BY 10
|
|
|
|
|
Denilson Alves Pereira , Berthier Ribeiro-Neto , Nivio Ziviani , Alberto H. F. Laender, Using web information for creating publication venue authority files, Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries, June 16-20, 2008, Pittsburgh PA, PA, USA
|
|
Ido Guy , Michal Jacovi , Elad Shahar , Noga Meshulam , Vladimir Soroka , Stephen Farrell, Harvesting with SONAR: the value of aggregating social network information, Proceeding of the twenty-sixth annual SIGCHI conference on Human factors in computing systems, April 05-10, 2008, Florence, Italy
|
|
|
|
|
Yutaka Matsuo , Junichiro Mori , Masahiro Hamasaki , Takuichi Nishimura , Hideaki Takeda , Koiti Hasida , Mitsuru Ishizuka, POLYPHONET: An advanced social network extraction system from the Web, Web Semantics: Science, Services and Agents on the World Wide Web, v.5 n.4, p.262-278, December, 2007
|
|
|
|
|
|
|
|
|
Pedro DeRose , Warren Shen , Fei Chen , AnHai Doan , Raghu Ramakrishnan, Building structured web community portals: a top-down, compositional, and incremental approach, Proceedings of the 33rd international conference on Very large data bases, September 23-27, 2007, Vienna, Austria
|
|