AN EFFICIENT METHOD FOR WEB DATA EXTRACTION USING PARTIAL TREE ALIGNMENT ALGORITHM
Abstract
With the explosion of the World Wide Web, a wealth of data on many different subjects has become available online. Usually, users retrieve Web data by browsing and keyword searching. But, these traditional methods have their limitations and disadvantages. Search engine helps to retrieve the relevant web sites based on the keyword specified by the user. It performs various operations such as crawling, indexing etc. It displays thousands of links as a result of the web search, but there are many road blocks that can make this process difficult or even impossible. So, the proposed system mainly aims to eradicate the disadvantages of search engines by exploring the contents of a web page to a maximum extent. It finds the exact keywords that match a page. When the search engine searches for web pages related to exact keyword, it can return only a few pages which are highly focused, specific and relevant to the topic. By this, the end-user gets the required information related to the search. Experiment shows that new approach is feasible and effective.Downloads
Published
Issue
Section
License
COPYRIGHT AGREEMENT AND AUTHORSHIP RESPONSIBILITY
 All paper submissions must carry the following duly signed by all the authors:
“I certify that I have participated sufficiently in the conception and design of this work and the analysis of the data (wherever applicable), as well as the writing of the manuscript, to take public responsibility for it. I believe the manuscript represents valid work. I have reviewed the final version of the manuscript and approve it for publication. Neither has the manuscript nor one with substantially similar content under my authorship been published nor is being considered for publication elsewhere, except as described in an attachment. Furthermore I attest that I shall produce the data upon which the manuscript is based for examination by the editors or their assignees, if requested.â€