INTERNATIONAL JOURNAL OF ENGINEERING, SCIENCE AND - Volume 4, Issue 1 , March 2015
Pages: 96-104
Date of Publication: 03-Jan-2015
Print Article
Download XML Download PDF
DISTRIBUTED AND PARALLEL DATA MINING WITH GRID COMPUTING ENVIRONMENT
Author: AMIT KAPOOR
Category: Engineering, Science and Mathematics
Abstract:
Advance in computing and communication over wired and wireless network have resulted in many distributive computing environments. Many of these environments have different distributed sources of voluminous data and multiple compute nodes. The Various science applications are typically at the forefront of large scale computing problems.
Fundamental scientific problems currently being explored generate gradually more complex data, require more realistic simulations of the processes under study and demand greater and more difficult visualizations of the results. These problems often require numerous complex calculations and collaboration among people with multiple disciplines and geographic locations. Examples of scientific grand challenge problems include multi-scale environmental modeling and ecosystem simulations, biomedical imaging and biomechanics, nuclear power and weapons simulations, fluid dynamics and fundamental computational science.
Grid computing has been proposed as a novel computational model, distinguished from conventional distributed computing by its focus on large-scale resource sharing, innovative applications, and, in some cases, high-performance orientation. Today grids can be used as effective infrastructures for distributed high-performance computing and data processing. The grid can play signi?cant role in providing an effective computational support for distributed knowledge discovery applications. For the development of data mining applications on grids we designed a system called Knowledge Grid.
Data mining techniques can be efficiently deployed in a grid environment and operational grids can be mined for patterns that may help to optimize the effectiveness and efficiency of the grid computing infrastructure. This paper discusses how Grid computing can be used to support distributed data mining. Grid-based data mining uses Grids as decentralized high-performance platforms where to execute data mining tasks and knowledge discovery algorithms and applications.
Keywords: Grid Computing, Distributed Data Mining, Data mining algorithms, Knowledge Grid
|