Web Content Mining-A Study

M.Vanathi

Citation :

M.Vanathi, "Web Content Mining-A Study," International Journal of Electrical and Electronics Engineering, vol. 1, no. 1, pp. 23-27, 2014. Crossref, https://doi.org/10.14445/23488379/IJEEE-V1I1P105

Abstract

Data mining is accumulating the exact information needed by the user through several steps. Web is huge collection of potential information. Web mining is part of Data Mining where the user find his or her information in the Web. There are three types of Web Mining namely Web Content mining, Web Structure mining and Web Usage mining. This paper focuses on Web Content mining especially the techniques available for Web Content mining.

Keywords

Web content mining, NLP, Information retrieval

References

[1] K. Bharat and M. R. Henzinger. Improved algorithms for topic distillation in a hyperlinked environment. In Proceedings of the 21st
annual international ACM SIGIR conference on Research and development in information retrieval, pages 104–111, 1998.
[2] J. Borges and M. Levene. Mining association rules in hypertext databases. In proceedings of the Fourth International Conference on Knowledge
Discovery and Data Mining (KDD-98), 1998.
[3] J. Borges and M. Levene. Data mining of user navigation patterns. In Proceedings of the WEBKDD’99 Workshop on Web Usage Analysis and
User Profiling, pages 31–36, 1999.
[4] S. Brin and L. Page. The anatomy of a large-scale hypertextual Web search engine. In Seventh International World Wide Web Conference, 1998.
[5] A. Buchner, M. Baumgarten, S. Anand, M. Mulvenna, and J. Hughes. Navigation pattern discovery from internet data. In Proceedings of the
WEBKDD’99 Workshop on Web Usage Analysis and User Profiling, 1999.
[6] J. Carbonell, M. Craven, S. Fienberg, T. Mitchell, and Y. Yang. Report on the conald workshop on learning from text and the web. In CONALD Workshop on
Learning from Text and the Web, 1998.
[7] S. Chakrabarti, B. Dom, D. Gibson, J. Kleinberg, S. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins. Mining the link structure of the world wide web.IEEE Computer, 32(8):60–67, 1999.
[8] S. Chawathe, H. Garcia-Molina, J.Hammer, K. Ireland, Y.Papakonstantinou, J. Ullman, and J.Widom. The tsimmis project: Integration
of heterogeneous information sources. In Proceedings of the 10th Meeting of the Information Processing Society of Japan, pages 7–18, 1994.
[9] W. W. Cohen. What can we learn from the web? In Proceedings of the Sixteenth International Conference on Machine Learning (ICML’99), pages
515–521,1999.
[10] R. Cooley, B. Mobasher, and J.Srivastava. Data preparation for mining world wide web browsing patterns.Knowledge and Information Systems,1(1),1999.
[11] O. Etzioni. The world wide web:Quagmire or gold mine.Communications of the ACM,39(11):65–68, 1996.