Sunday, 6 October 2013

Challenges in Effective Web Data Mining

Data collection and web data mining are critical processes for many companies and the marketing companies today. The techniques usually used include search engines,

topic-based searches and directories. Web data mining is necessary for any business that wants to create data warehouses by harvesting data from the internet. This is so

because high-quality and intelligent information may not be harvested from the internet easily. Such information is critical as it enables you to get desired results and the

business intelligence in demand.
Keyword-based searches are important in marketing of company products. They are usually affected by the following factors:
• Irrelevant pages. The use of common and general keywords on the search engines yields millions of web pages. Some of thesepages may be irrelevant and may not be of help

to the user.
• Ambiguous results.This is usually caused by multi-variant or similar keyword semantics. A name would be an animal, movie or even a sport accessory. This results in web

pages that are different what you are actually searching for.
• Possibility of missing some web pages.There is a great possibility of missing the most relevant information that is contained on web pages that are not indexed on a given

keyword.
One of the factors that prohibit the usage of web data mining is the effectiveness of search engine crawlers. This is widely evidenced by lack of access of the entire web due to

search engine crawlers and bot.This can be attributed partly tobandwidth limitations. It is important to understand that there are thousands of databases on the internet that can

deliver well-maintained information, high quality and are not easily accessed by crawlers.
In web data mining it is important to understand that majority of search engines have limited choices or alternatives for keyword query combination. For instance, yahoo and

Google offer option like phrase and even the exact matches that may limit even the search results. It is usually demands more efforts and even time and thereby get the most

important and relevant information.The human behavior and the alternatives usually change of time.This therefore implies that web pages need to be updated frequently and

there by reflect on the emerging trends. It is important to realize that there is a limited space for web data mining. This is so because the information that currently exists is

heavily relied on keyword-based indices. This does not apply for the real data.
It is important to realize that web data mining is an important tool for any business. It is therefore important to embrace this technology to solve data crisis problems. There are

several limitations and many challenges which may have resulted in the quest of effectively and efficiently in rediscovering the use of web resources. However, irrespective of the

challenges of web data mining, this technology is an effective tool that can be employed in many technological and scientific fields. It is therefore paramount to embrace this

technology and use it fully in order to realize your corporate goals.


Source: http://goarticles.com/article/Challenges-in-Effective-Web-Data-Mining/6771744/

No comments:

Post a Comment