| Semantic analysis of web site audience |
| Full text |
Pdf
(254 KB)
|
| Source
|
Symposium on Applied Computing
archive
Proceedings of the 2006 ACM symposium on Applied computing
table of contents
Dijon, France
SESSION: Data mining (DM)
table of contents
Pages: 525 - 529
Year of Publication: 2006
ISBN:1-59593-108-2
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 18, Downloads (12 Months): 173, Citation Count: 0
|
|
|
ABSTRACT
With the emergence of the World Wide Web, analyzing and improving Web communication has become essential to adapt the Web content to the visitors' expectations. Web communication analysis is traditionally performed by Web analytics software, which produce long lists of page-based audience metrics. These results suffer from page synonymy, page polysemy, page temporality, and page volatility. In addition, the metrics contain little semantics and are too detailed to be exploited by organization managers and chief editors, who need summarized and conceptual information to take high-level decisions. To obtain such metrics, we mine the content of the Web pages output by the Web server. For a given taxonomy covering the Web site knwoledge domain, we compute the term weights in the output pages and we aggregate them using OLAP tools, in order to obtain concept-based metrics representing the audience of the Web site topics. To demonstrate how our approach solves the cited problems, we actually compute concept-based metrics with SQL Server OLAP Analysis Service and our prototype WASA for a number of case studies. Finally, we validate our results against a popular Web analytics tool.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
Ed H. Chi , Peter Pirolli , Kim Chen , James Pitkow, Using information scent to model user information needs and actions and the Web, Proceedings of the SIGCHI conference on Human factors in computing systems, p.490-497, March 2001, Seattle, Washington, United States
[doi> 10.1145/365024.365325]
|
| |
3
|
|
| |
4
|
A. Lozano-Tello and A. Gómez-Pérez. Ontometric: A method to choose the appropriate ontology. J. Database Manag., 15(2):1--18, 2004.
|
| |
5
|
|
| |
6
|
E. Malinowski and E. Zimányi. OLAP hierarchies: A conceptual perspective. In Proc. of the 16th Int. Conf. on Advanced Information Systems Engineering, CAiSE'04, LNCS 3084, pages 477--491. Springer-Verlag, 2004.
|
| |
7
|
J. G. March, H. A. Simon, and H. S. Guetzkow. Organizations. Cambridge Mass. Blackwell, second edition, 1983.
|
| |
8
|
|
| |
9
|
J. P. Norguet, E. Zimányi, and R. Steinberger. Improving web sites with web usage mining, web content mining, and semantic analysis. In Proc. of the 32nd Int. Conf. on Current Trends in Theory and Practice of Computer Science, SOFSEM, 2006.
|
| |
10
|
S. A. Ríos, J. D. Velásquez, E. S. Vera, H. Yasuda, and T. Aoki. Using SOFM to improve web site text content. In Proc. of First Int. Conf. on Advances in Natural Computation, ICNC 2005, Part II, pages 622--626, 2005.
|
 |
11
|
|
| |
12
|
R. Steinberger, B. Pouliquen, and C. Ignat. Exploiting multilingual nomenclatures and language-independent text features as an interlingua for cross-lingual text analysis applications. In Proc. B of the 7th Int. Multiconference on Language Technologies, IS 2004, 2004.
|
| |
13
|
R. Steinberger, B. Pouliquen, and C. Ignat. Navigating multilingual news collection using automatically extracted information. In Proc. of the 27th Int. Conf. on Information Technology Interfaces, ITI 2005, 2005.
|
| |
14
|
G. Stumme and A. Maedche. Fca-merge: Bottom-up merging of ontologies. In Proc. of the 17th Int. Joint Conf. on Artificial Intelligence, IJCAI, pages 225--234, 2001.
|
| |
15
|
|
|