Data Mining Evaluating Data Mining Thesis
- Length: 10 pages
- Sources: 8
- Subject: Education - Computers
- Type: Thesis
- Paper: #55885792
Excerpt from Thesis :
Assessing Data Mining as a Technology Trend
The catalyst of data mining's growth continues to be the unmet information needs within organizations that are seeking to gain a competitive advantage from the vast data they have accumulated. The convergence of hardware advances in virtualization of server technologies and their use for accelerating complex processing tasks (Luo, Lu, Huang, He, Shi, 2006) in conjunction with the development of text mining, clustering and relational analytics engines (Berry, 2004) is drastically re-ordering the data mining landscape. In addition the acceptance of AJAX as a programming language of choice for data-intensive applications has also served to accelerate the adoption of data mining throughout geographically dispersed organizations (Nayak, 2008). Software-as-a-Service (SaaS) platforms are also being created as a result of these trends including virtualization and AJAX or then client computing (Nayak, 2008). These technologies are making it possible to more quickly and thoroughly define the associations in data and also progress through the five process areas mentioned in the previous section of this analysis.
The more fundamental catalysts of this technological trend of data mining however are found in the unmet needs of organizations, both for-profit and non-profit, to gain greater insights and intelligence into their customers, operating and processes. The role of data mining has been one of creating greater analytical tools through the use of AJAX programming, .NET, Java (J2EE) and the development of Web Services (Nayak, 2008). There is a cycle of continuous innovation occurring today as a result. The technologies are continually fuelling greater flexibility and depth of analysis, while at the same time creating more efficient approaches to creating reports and online scorecards. The net result of these improvements in usability is a continual improvement in how the reports and analysis can be tailored to the needs of information users. For the first time this convergence of technologies and needs is leading to roles-based access of vast amounts of data analyzed through data mining engines and constraint-based modeling techniques (Sun, 2006). This is also fueling the use of data mining for more predictive analytics models in small and medium businesses as the applications are being delivered over the Internet (Nayak, 2008). Organizations are using data mining to also drive their strategies for Business Intelligence (BI) and advanced data warehousing (DW) platforms and programs that are making strategies more accomplishable through greater intelligence and more real-time feedback. In conclusion the needs of users are growing more complex and demanding in terms of analytics while data mining, business intelligence, and data warehouses are also evolving, further expanding the expectations. This cycle of innovation will continue to accelerate as technology gains are made while users of these systems devise creative new ways to use the data and capitalize on the insights they deliver.
Use of Data Mining at Google
Google's uses of data mining are both for the search services it delivers in addition to the extensive CRM platforms and systems used for targeting new corporate accounts, defining customer and audience segments, and devising new approaches to serving advertisers. Of all these customer groups, advertisers are the larger single source of revenue the company has due to their AdWords program. Google uses data mining to determine how effective their advertisers are with specific programs, to track trends of specific queries, determine how to improve the performance of their servers and virtualization routines, and also how to determine which are the best new potential products to launch. The Google latent semantic indexing technology is used for pattern matching (Buddhakulsomsiri, Zakarian, 2009) in addition to linguistics modeling and analysis. Google uses these technologies to create predictive linguistic models that assist the company in managing the search process more effectively. The use of latent semantic indexing actually creates more effective uses of computing time the company has on its servers, in addition to making the search models themselves more effective and streamlined in terms of linguistic associations made (Berry, 2004). Google has the goal of creating a data mining technology that is intelligent and self-learns patterns in data over time so that queries of their search engine and its associated products can be more efficient.
In addition to using data mining for their core search engine performance improvement strategies, and for determining how best to serve their advertising customers, Google also uses data mining to enable greater levels of process improvement (Osei-bryson, Rayward-smith, 2009). Google uses data mining for analyzing the performance of their applications, hosting centers, and business processes to determine how best to improve them over time. In this way its managers can gain more effective insight into how to best streamline, re-vamp and improve processes over time. Business process re-engineering is a main focus of the company using data mining, in addition to creating programs for continuous process improvement globally throughout its divisions and subsidiaries.
Future of Data Mining
The future of data mining is going to be defined by the technologies making the development of streamlined interfaces based on AJAX, J2EE possible. In conjunction with this development will be the continual improvement of XML networking technologies and speeds, making it possible for data mining to eventually become a true Web Service (Nayak, 2008). The continual improvements in usability and user interfaces will also lead to significant advances in how data mining is used for aligning with business roles and responsibilities. No longer constrained by the taxonomies used to create the databases, data mining applications will be able to create personalized, highly flexible taxonomies on the fly given a given user's requirements as well. In short, the data mining applications in the future will be transparent to the business processes and goals being achieved by companies over time. There will be more demarcation line between the data mining application and its supporting systems, databases and systems of record as well. Data mining will be integrated directly into knowledge flow as a result (Lai, Liu, 2009).
Data mining's initial development began…