Towards Augmentation of Present Technologies and Environment of Grid Computation: A Methodical Study, and Analysis

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 16

International Journal of Application or Innovation in Engineering & Management (IJAIEM)

Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847

Towards Augmentation of Present Technologies and Environment of Grid Computation: A Methodical Study, and Analysis
Shah Murtaza Rashid Al Masud
Najran University, P.O. Box 1988 Najran, Saudi Arabia

ABSTRACT
In recent years the computer systems and networks performance increase due to the faster, sophisticated and powerful software, hardware and supercomputers. But still there are many problems in the fields of engineering, technology, science and business, and these are mainly relying on the complexity and size of computing systems. To solve these problems we have to collect, unify and use geologically disseminated resources as a single powerful computer system having the properties including high throughput distributed supercomputing. Grid computation technology provides services, tools and infrastructure necessary for developing, managing and using widely distributed, large scale and virtual computing systems including scientific instruments, data centers and data caches. Grid provides a marvelous derivation for collecting, combining, storing computational resources from diverse to integrated environment for proper computing, resources sharing, distribution, communication, security, and management. Grid is important framework for local and global societies in education, business, government, and research. To cope up with the modern technologies in computing like heterogeneity, scalability, portability, and interoperability among various resources and sites its an utmost and recent need and demand to build a standard architecture and platform of Grid infrastructure. In this research paper the authors briefly studied, analyzed, and present the state of-the-art of Grid computing as a whole including its importance, architecture, and scheduling techniques. This research paper will be a great source and guideline for researchers and academicians to further study and develop a unified and emerging Grid environment and advance research in this field.

Keywords: Grid computing, Grid infrastructure, Grid environment, Supercomputer, Architecture, Scheduling technique

1. Introduction
Researchers in the early 1990s developed a new computing platform for computational and storage management, information aggregation, trigger services and sharing resources, accurate data movement, security specification, and proper monitoring that added additional services in entire computing systems providing trustworthy, reliable, pervasive, and economical access to computational potentials. Its a hardware and software infrastructure namely computational Grid. Grid computing helps to solve any scientific problem by applying resources of many computers in a network in order to come up to a single problem without accessing to huge amount of data and processing cycles [1]. Its a platform for high performance and distributed computing permits users to access heterogeneous resources. Its an enhanced virtualized form of distributed high performance computing systems utilizing the available computational resources and spreading across several machines where each of which accomplishing some tiny proportion of computation. Grid can be classified into three main classes as presented in Table 1. Table 1: Grid Types
Grid Types Computational grid Scavenging grids Data grids Specification Allocating resources specifically for computing power Finding and using unused resources like machine cycles for resourceintensive tasks. Provides a unified model through which data can be queried, managed and secured.

From Internet to powerful, high-speed and low-cost computers and network technologies are changing the way of using computers today that leads and guides facilitating distributed computers known as Grid computing [1]. It is also known as Meta computing, scalable computing, global computing, Internet computing and in recent time peer-to-peer computing [2].

Volume 2, Issue 5, May 2013

Page 285

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847
Grids technology is geologically disseminated and distributed and possesses by diverse organizations for solving largescale computational and data concentrated problems in science, engineering, technology and commerce hence creating virtual organizations [3] and enterprises [4] as envisioned in [5] to share resources, skills, and core competencies in order to better respond to business opportunities or large-scale application processing requirements. The whole functionalities of Grid are supported by computer networks which will be helpful for many applications including supercomputing and high throughput computation. The most frequently used terms or components in Grid computing [6] are presented in Table 2. Table 2: Frequently used Grid computing terms
Grid Terms Grid Information Service (GIS) Services management Task Properties of Task Job/Application/ Metatask Job Management Block (JMB) Data management Data Management (DMB) Resource User Task Scheduling Scheduler Resource Broker Resource management Site/ Node Grid Portal Grid Security Infrastructure (GSI) Security management Global Grid Local Grid Grid Protocol Block Description Collecting and passing the Information having the details of the available Grid resources to the resource broker. Query the grid in an effective and efficient way by its users and applications. A task is element of Grid scheduler that should be scheduled. Task is assigned to a resource. CPU and memory requirement, deadline, priority, etc are the parameters of the properties of a task. A job or metatask, or application (jobs are composed of sub-jobs and/or tasks, and sub-jobs.) is a set of elements namely atomic tasks that will be carried out on a set of resources. JMB is also known as grid resource allocation Manager (GRAM provides the services to launch a job on a particular resource. Data must be transported, cleansed, and processed. DMB is accountable for moving files and data to different nodes within the grid. Grid Resources (for example Cluster, PC, Supercomputer, database, instruments, etc.) are required to carry out an operation or are required to execute the user jobs in global Grid. In order to speed up the execution of the application in the Grid system the users transmit computation or data exhaustive application It is the mapping of tasks to a preferred and selected group of resources which may be distributed in multiple administrative domains. Goal: Rules and priorities are to be set and selected for routing jobs on a grid-based infrastructure. It monitors the progress of active jobs. Broker: It provides information about available resources on the grid and distributes the jobs in an application to the Grid resources and also gives details working status of these resources for further execution. The grid must be aware of the availability of resources for different tasks. A site (or node) is an autonomous entity composed of one or multiple resources. It is an interface to launch applications for users. GSI helps to establish the identity of the processor as well as the users of the grid. It consists of 2 components -user security and-node security. Access to and use of the available resources should be managed by authorized users in Grid system. It is a universal computing environment, offering a different range of heterogeneous services. Its a global sharing of resources through the internet. It is situated within an organization. It is a protocol that reconciles or mediates between user and resource during access to any resource in Grid environment.

A typical view of Grid environment along with its entities is shown in Figure 1. Using this interdisciplinary system the scientists and researchers will be able to interact with colleagues, access instrumentation, share data and computational resources, and access information in digital libraries that will be available on-line. [7]. It has a direct impact on the design and development of next generation of computer applications taking into consideration of complexity of future applications that will grow rapidly, time to market pressure, cost reasons, it is foreseeable that no single company or organization would be able to, for example, create by itself complex and diverse software, or hire and train all the necessary expertise necessary to build an application. In general, such applications tend to be multidisciplinary multi modular, written by several development teams using several programming languages, using multisource heterogeneous data, which can be mobile and interactive. Their execution will take very little time [8]. In this research

Volume 2, Issue 5, May 2013

Page 286

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847
paper the authors have studied and given a total procedure and analyzed the components towards developing a heterogeneous Grid architecture.

Figure 1: Grid components and their relationship This paper aims to present a brief description Grid computation, surrounding environment, and its architecture. Necessary components used for computational Grid, reasons of Grid computing, users' classes, Grid services are presented in Section 1 and Section 2. Grid applications, potential grid programming models, design features required by Grid environment, benefits of using Grid computation are given in Section 3. Grid topology, Grid Layers, Protocols, and their Activities towards designing Grid Architecture Model, and its requirements are illustrated in Section 4. Areas of Grid evolution, categories of major and existing Grid Projects, available Grid projects and their objectives, Available Grid tools are studied and presented in Section 5. Grid scheduling process, its parameter and components are discussed in Section 6. Research challenges and future work in analyzed in Section 7. Finally the research paper is concluded in Section 8.

2. Reasons Of Computational Grid


Main reasons for computational Grids are as follows: To maintain and support synchronized and coordinated use of resources at multiple sites for computation. Endeavor disseminated or distributed computing technologies and enterprise Java enable resource sharing within a single organization. Managing and coordinating problem solving and resource sharing in multi-disciplinary virtual organizations (VO). Direct access to various applications programs, computers, software, data and other resources other than primarily file exchange. Ensuring highly secured and controlled sharing and monitoring. To ensure adequate computing environment. To increase computational power. Other reasons for computational Grids are as follows: Technology improvement Increase in demand-driven access to computational power Increased utilization of idle capacity Greater sharing of computational results New problem-solving techniques and tools Services provide by Grids technology is illustrated in Table 3. Classes of Grid users and their activities are mentioned in Table 4.

Volume 2, Issue 5, May 2013

Page 287

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847
Table 3: Grid Services
Service Name Computational services Description Computational services provided by Grid is called computational Grid that ensures secure services for executing application jobs individually or collectively managed by resource brokers on distributed computational resources. Some examples of computational Grids are: NASA IPG [9], the World Wide Grid [10], and the NSF TeraGrid [11]. To ensure secure access to distributed datasets and their management providing a scalable storage and access to the data sets. Sample applications that need such services for management, sharing, and processing of large datasets are high-energy physics [12] and accessing distributed chemical databases for drug design [13]. It provides secure access to remote software and libraries transparently along with application management. Examples of this emerging technologies are such as Web services [14], NetSolve [15]. It concerns with the extraction, storing, accessing, sharing, maintaining and presentation of data by using various services of computation. The main concern is how the knowledge is acquired, used, retrieved, published, and maintained to assist users in achieving their particular goals and objectives.

Data services

Application services

Information services Knowledge services

Table 4: Classes of Grid Users


Classes of Grid Users End Users Application Developers Tool Developers Grid Developers System administrators Activities and Major Concerns Solving problems by using applications. Its major concerns: transparency, performance Developing applications by using programming models, tools. It major concerns: ease of use, performance Develops tools, and programming models where the major concerns are adaptively, exposure of performance Provides basic and local grid services including local simplicity, connectivity, and security. Manages Grid resources where the major concerns are: balancing local and global concerns.

3. Grid Applications, Design Features, and Benefits


Other than distributed supercomputing a large number of other applications can benefit from the Grid computation science, engineering, commerce, and educational fields [16], [17]. The Internet computing projects, such as SETI@Home [18] and Distributed.Net [19] are some example of Grid applications. It also helps to solve challenging problems such as biophysics, chemistry, biology, scientific instrumentation [20], drug design [21], [22], tomography [23], high energy physics [24], data mining, financial analysis, nuclear simulations, material science, chemical engineering, environmental studies, climate modeling [25], weather prediction, molecular biology, neuroscience/brain activity analysis [26], structural analysis, mechanical CAD/CAM, and astrophysics. Component composition architecture [27], different disciplinary modules, fluids and structures in an aeronautics code [28], [29], linear, Euler, and Navier Stokes aerodynamics [30], [31]; in parallel and distributed simulation technologies [32], [33] along with some applications belonging to areas such as the design of complex systems, education and training, entertainment, military, social and business collaborations, telecommunications, transportation. Most of the key Grid applications [34] are presented in Table 5. Table 5: Grid Applications
Major Grid Applications Distributed super computing High Throughput On demand Data intensive Collaborative Characteristics Very large problems need a lot of CPUs, memory, and other tools. Examples: DIS Join many idle resources to increase collective throughput. Examples: Chip design, cryptographic problems Remote resources integrated with local computation. Examples: Medical instrumentation, Cloud detection, Network-enabled solvers Amalgamation and synthesis of new information from many or large data sources. Examples: sky survey, physics data Maintain communication or collaborative work between multiple participants. Examples: education, data exploration, collaborative design

Some new emerging class of application that can benefit from the Grid is service-oriented computing (e.g. application service provides and the users QoS requirements driven access to remote software and hardware resources [35]. Potential grid programming models are explicit in Table 6.

Volume 2, Issue 5, May 2013

Page 288

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847
Table 6: Potential grid programming models
Grid programming models Datagram and stream communication model Shared memory, multithreading model Data parallelism model Message passing model Object oriented model Remote procedure call model High throughput model Group ordered based model Agents based model Characteristics Low overhead, low level model. Examples: TCP, UDP, Multicast High level, low scalability model. Examples: DSM-distributed shared memory Automatic parallelization, restricted applicability. Example: HPC++ High performance, low level model. Example: PVM, MPI Support for large system design, low performance. Example: Java RMI Simplicity is high, but restricted applicability based model. Example: DCE, ONC Ease of use but restricted applicability based model. Example: Condor, Nimrod Robustness, Low performance, scalability. Example: Isis, Totem High flexibility, low performance, robustness. Example: Aglets, Telescript

3.1 Design Features of Grid Computation The main design features [6] [36] required by a Grid environment are presented in Table 7. Table 7: Design features required by Grid environment Importance
The administrative hierarchy defines how administrative information runs through the Grid. It supports protocols that are used for bulk-data transport, streaming data, group communications, and those used by distributed objects. Also used to support QoS parameters such as latency, bandwidth, reliability, fault-tolerance, and jitter control. One of the major aims of Grid system is to make all resources accessible. The components of GIS provide the mechanisms for registering and obtaining information about the structure, resources, services, and status of any Grid computation system. The naming service (names are used to denote to a wide variety of objects such as computers, services, or data objects) affords a constant and uniform name space across the complete Grid environment. Distributed file system offers an identical global namespace, support a range of file I/O protocols, require little or no program modification, and provide means that enable performance optimizations to be implemented, such as the usage of caches. A security and authorization infrastructure are the significant part to the success or failure of a Grid environment. Any distributed computing system like Grid must concentrate about these four security aspects: privacy, reliability or integrity, verification and validation that is authentication, and accountability or responsibility Proper monitoring resources and applications need to provide for a reliable and robust environment in Grid system. The management of processor time, memory, network, storage, and other components in this Grid environment in very important. It's very significant to efficiently and effectively schedule the applications that need to utilize the available resources in the Grid computing environment. Both resource management and scheduling should be transparent. It is essential to support mechanisms and policies that help in regulate resource supply and demand [37], [38]. An economic approach is one means of managing resources in a complex and decentralized manner. This approach provides incentives for resource owners, and users to be part of the Grid and develop and using strategies that help maximize their objectives. It should comprise interfaces, application programming interfaces (APIs), utilities, and tools to deliver a rich development environment including general scientific languages such as C, C++, Java, and Fortran must be available, application-level interfaces such as message passing interface (MPI) and virtual machine (PVM) which supports interoperability - by this they mean flexibility and heterogeneity; programming paradigms that must support message passing or distributed shared memory; a suite of numerical and other commonly used libraries should also be available. Tools, resources and services Grid environment should be intuitive and easy to use. They must be platforms and operating systems independent. The system should be available on-line or web-centric that will help to easily access distributed and supercomputing and scientific resources from anywhere at any time by any user over any platform.

List of Design Features


Administrative hierarchy Communication services

Information services

Naming services

Distributed file systems and caching Security and authorization

System status and fault tolerance Resource management and scheduling

Computational economy and resource trading

Programming paradigms

tools

and

User and administrative graphic user interface (GUI)

3.2 Advantages or major benefits of Grid computation Some of the advantages of Grid Computation are listed and presented in Table 8 below [39][40]: Seamless and secure access to large number of geographically distributed resources.

Volume 2, Issue 5, May 2013

Page 289

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847
Reduction in average job response time may occur but an overhead of limited network bandwidth and latency exists. Provides users around the world with dynamic and adaptive access to unparalleled levels of computing. With the infrastructure provided by the Grid, scientists are able to perform complex tasks, integrate their work and collaborate remotely. Grids can lead to savings in processing time. Efficient, effective, and economic utilization of available resources. Increased availability and reliability of resources. Shared access (by multiple users) to large amounts of data. Improved methods for collaborative work. Unprecedented Price-to-Performance ratio. Table 8: Benefits of using Grid computation Description
Grid resources should be available to multiple administrative domains those can be access from anywhere any time. Resources, tools and services of Grid are diverse and heterogeneous in nature and involve a enormous range of technologies. The Grid system is propagating from a few integrated and geographically located resources to millions that must manage and tolerate latency and bandwidth. It is important to dynamically manage and reduce the failure rate of resources sharing and distribution in Grid. Resources managers and applications must use the available resources and services efficiently and effectively. The aim is to assign users, resources and organizations from different domains to VO. Grid affords to large amount of resources. Possible to specify equipment, software, and other services. Scalability technique can be supported I terms of additional quantity of such resources. Access People around the world must utilize the computer resources in effective and efficient way. Grid computing resources should not be turning off in any circumstances, even if processors fail in some places. A Grid application is a combination of many micro jobs which are executing on different processors. This can be achieved by scheduling Grid jobs on processors with low utilization IT Grid middleware provides a uniform method to heterogeneous the whole system

Scopes of Benefits
Multiple administrative domains Heterogeneity Scalability Dynamicity or adaptability

Virtual Organization (VO) Access to excess of plethora of IT resources Better utilization of underused resources. Increase reliability of computing Parallelization of processing Resource balancing Simplified resources management of

4. Grid Topology, Protocols and Activities towards Designing Grid Architecture

Model
4.1 Grid Topology Grid topology is divided into three types and are illustrated in Table 9. Table 9: Grid Topology
Topology Intragrid Extragrid Intergrid Description Local Grid within an organization. Trust based on personal contacts. Resources of a group of organizations. Connected by virtual private networks. Trust based on Business-to-business contacts. Global sharing of resources through the Internet. Trust based on certification.

4.2 Grid layers For the organization, establishment and management of resources among Grid environment a novel model and technology has been developed which is called as Grid architecture. Grid architecture consists of five important layers namely: fabric, connectivity, resource, collective, and application layers. It defines the purpose and functions of its components and common mechanisms, interfaces, schema, and protocols at each layer. In this environment users and resources can settle and negotiate, find and establish, manage and control and share resources [41]-[46]. Each layer shares the behavior of the underlying component layers. Table 10 describes the core features of each of these

Volume 2, Issue 5, May 2013

Page 290

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847
component layers, starting from the bottom of the stack and moving upward. Figure 2 shows the component layers of the grid architecture and the interface between the layers. 1. Fabric Layer. This is the lowest layer and includes the physical devices or resources (e.g.). 2. Connectivity Layer. This layer above the fabric layer includes the communication and authentication protocols required for Grid network transactions (e.g. exchange of data between resources and verification of the identity of users and resources). 3. Resource Layer. This layer contains connectivity protocols to enable the secure initiation, resource monitoring, and control of resource-sharing operations. 4. Collective Layer. This layer above the Resource Layer contains protocols, services, and APIs (Application Programming Interface) to implement transactions among resources (e.g. resource discovery, and job scheduling). 5. User Application layer. This is the highest layer and calls on all other layers for resources. Table 10: Five fundamental layers of Grid architecture
Grid Layers Fabric Layer Generality This is the lowest and basic layer. It defines the interface to local resources like computers, storage systems, networks, sensors and instruments, software modules and systems resources, which may be shared. This layer situated above the fabric layer. It determines the basic communication and authentication protocols required for grid-specific networking-service transactions like exchange of data between resources and verification of the identity of users and resources. Protocols defined by connective layer are using in this layer. Communication and security protocols are used to control secure negotiation, initiation, monitoring, accounting, and payment for the sharing of functions of individual resources. Using the protocols of resource layer and connectivity layer, the collective layer implements a variety of sharing behaviors and also responsible for global resource management and interaction. Functionality Extended Level Storage systems, clusters, networks, network caches, code repositories, catalog, scheduler

Connectivity Layer

Communication, service discovery (DNS) authentication, authorization, delegation

Core Middleware

Resource Layer

Access to data, access to computers and computations, access to network performance data, access to information about system structure, and state.

Core Middleware

Collective Layer

Application Specific Coherency control, replica selection, task management, virtual data catalog, virtual data code catalog, dynamic check point, job management, failover, staging, distributed data archive, solver coupler. Generic Replica catalog, replica management, co-allocation, certificate authorities, metadata catalogs, resource discovery, brokering, certificate authorities and revocation, community authorization, system monitoring.

User Level Middleware

Application layer

Using various collaboration and resources access protocols this layer enables the use of resources in a Grid environment.

High Throughput Computing System, Discipline-Specific Data Grid Application

Volume 2, Issue 5, May 2013

Page 291

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847

Figure 2: The fundamental layers of Grid architecture with its relationship to Internet protocol and its major components 4.3 Grid requirements Major requirements of Grid architectural are as illustrated bellow: Identity & authentication Authorization & policy Resource discovery Resource characterization Resource allocation (Co-) reservation, workflow Distributed algorithms Remote data access High-speed data transfer Performance guarantees Monitoring Adaptation Intrusion detection Resource management Accounting & payment Fault management System evolution 4.4 Grid protocols The objective of Grid or sometime as global Grid is supporting various heterogeneous resources and delivering worldwide computing with presenting different types of services. It requires generic interfaces for resources and services, respectively. Global Grid must supports its basic services like Grid information service (GIS), Grid monitoring service (GMS), Grid scheduling (GS) and brokering service (GBS), Grid security services (GSS), and Grid data transfer service (GDTS) [47] [48] [49]. In this paper, the author discussed the basic and important protocols related to these basic/generic services. Grid protocols are classified into five main classes, they are Grid network communication and Grid data transfer protocols, Grid information security protocols, Grid resource information protocols, Grid management protocols, and Grid interface protocols, The full classification of these protocols is shown in Table 11. Table 11: This is the classification of core Grid protocols for Global Grid Computing. The classification is based on kind of activities handled by the protocols.
Grid Protocol Classification Grid management protocols Example Grid monitoring protocol Job submission protocol Job management protocol Advance reservation and co-allocation protocol SLA negotiation protocol Resource registration protocol Resource discovery protocol Resource inquiry protocol

Grid resource information protocols

Volume 2, Issue 5, May 2013

Page 292

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847
Grid interface protocols Basic protocol for web service based interaction Protocols for web service security Protocols for Stateful web service includes: Web services resource framework protocol Web services addressing protocol Web services notification protocol Secure communication protocol User proxy creation protocol Global subject to local subject mapping Registration protocol Authentication protocol Single sign-on protocol Delegation protocol Grid network communication protocol Grid Data transfer protocol

Grid information security protocols

Grid network communication protocols and Data transfer protocols

The name of protocols related to the Grid layers [47] [48] [49] is shown in Table 12. The dependency is like this: the application layer protocols depend on the collective layer protocols, the collective layer protocols depend on resource layer protocols, resource layer protocols depend on connectivity layer protocols, and finally the connectivity layer protocols depend on the fabric layer protocols.

Table 12: List of supporting protocols related to Grid layers.


Grid Layers Fabric Layer Connectivity Layer Supporting Protocols NFS storage access protocol Cluster resource management systems process management protocol Secure communication protocol User proxy creation protocol Global subject to local subject mapping registration protocol Authentication protocol Single Sign-On (SSO) protocol Delegation protocol Grid communication protocol Resource registration protocol Resource inquiry protocol Grid monitoring protocol Grid Data transfer protocol Advance Reservation protocol Job submission protocol Job and resources management protocol Information protocol Resource discovery protocol Co-allocation protocol SLA negotiation protocol Grid monitoring protocol Application layer based transport protocol TCP based Data Transfer Protocol UDP based Data Transfer Protocol (UDT) Reliable transfer protocol-SABUL Group Transport Protocol (GTP) Common data transfer protocol (GridFTP) User interface protocol (GridCopy)

Resource Layer

Collective Layer

Application Layer

5. Areas of Grid Evolution, Grid Projects and Tools


5.1 Areas of Grid evolution To explore Grids application worldwide to everybody Grid has evolved two dimensions related to semantic web and these are architectural evolution and scope evolution as presented in Table 13.

Volume 2, Issue 5, May 2013

Page 293

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847
Table 13: Areas of Grid evolution
Areas Evolution Architectural evolution of Types of Grid The service based Grid: Data and Computational Grid Description In this evolutionary area of Grid, the fabric layer supplies unprocessed or raw computing power, high speed bandwidth and associated data storage in a protected, secure and auditable way that will help to allocate, schedule, execute of computational resources and the way that data will be shipped between processing resources will also be handled here. Metacomputer or virtual computer technique is used to present various and diverse resources. To represent, store, access, share and maintain information, the information grid provides homogeneous access to heterogeneous distributed information. Knowledge based methodologies and technologies are used in knowledge grid that will help responding to advanced questions and finding the appropriate processes to deliver answers in the required form.

Scope evolution

The information and knowledge based Grid

5.2 Grid projects and Tools Grid project are highly devoted to the following conditions [50] and mainly classified into two categories: international Grid projects and national Grid projects as presented in Table 14. Some of the available Grid projects and their main objectives are presented in Table 15 [51] [52] [53] [54]. Available services and application oriented Grid tools are presented in Table 16. Secure and close partnership and collaboration with real Grid projects in engineering, technology, academic, science and industry. Building, enlargement and endorsement of typical Grid protocols to facilitate heterogeneity, interoperability and shared infrastructure Building, enlargement and endorsement of typical Grid software APIs and SDKs to facilitate portability and code sharing Table 14: Categories of Grid Projects and Examples Categories of Grid Projects Examples
International Grid Projects Open middleware infrastructure instate Europe (OMII-Europe), May 2006-May 2008 Enabling Grids for e-Business (RGEE) , March 2004-March 2006 Enabling Grids for E-sciencE II(EGEE II), April 2006-April 2008 BREIN, September 2007-August 2009 DataTAG, January 2001- January 2003 BalticGrid, November 2005- April 2008 European DataGrid (EDG), March 2001-March 2004 D-Grid (German) GARUDA (Indian) Grid computing project at VECC (Calcutta, India) China Grid project INFN Grid (Italy) Knowledge Grid Malaysia NAREGI project Singapore National Grid Project Thai National Grid Project BELNET Grid, Belgium

National Grid Projects

Table 15: Available Grid projects and their Objectives


Available Grid Projects Access Grid [55] IBM IntraGrid [56] BlueGrid [57] Objectives Construct and organize collective collaboration systems using service oriented technologies Grid testbed linking IBM laboratories Computer and technical needs including Installations, Repairs, Computer Virus problems, Upgrades, Training and Explaining, Web site development solutions, convert your Video and Audio Memories, web site hosting solution

Volume 2, Issue 5, May 2013

Page 294

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847
DISCOM [58] DOE Science Grid [59] Earth System Grid (ESG) [60] It developed an operational Grid that provides access to resources at three U.S. DOE weapons laboratories It developed an operational Grid that provides access to resources and applications at U.S. DOE science laboratories and partner universities Delivery and analysis of large climate model datasets for the climate research community Developed for applications in high energy physics, environmental science, bioinformatics Developed technology for remote access to supercomputer resources and simulation codes Create a national computational collaborator for fusion research Research on Grid technologies; development and support of Globus Toolkit; application and deployment Developed and supported for Grid technologies and applications Developed and applied for operational Grid. Applied for particle physics research within UK. Integration, deployment, support of the NSF middleware infrastructure for research and education Research into program development technologies for Grid applications Created technology R&D for data analysis in physics experiments: ATLAS, CMS, LIGO, SDSS Developed production Grid for application in aero-sciences and other NASA missions Build international data Grid enabling large-scale experimentation on Grid technologies and applications Developed production Grid for application in earthquake engineering Built production Grids for application in data analysis in high energy and nuclear physics experiments U.S. science infrastructure linking four major resource sites at 40 Gb/s Support center for Grid projects within the U.K. Technologies developed and applied for remote access to supercomputers Observatory Developed production Grids for application in data analysis in astronomy

European Union (EU) DataGrid [61] EuroGrid, Grid Interoperability (GRIP) [62] Fusion Collaboratory [63] Globus Project [64] GridLab [65] GridPP [66] Grid Research Integration Dev. & Support Center [67] Grid Application Development Software [68] Grid Physics Network [69] Information Power Grid [70] International Virtual Data Grid Laboratory [71] Network for Earthquake Eng. Simulation Grid [72] Particle Physics Data Grid [73] TeraGrid [74] UK Grid Support Center [75] Unicore [76] National Virtual (NVO) [77] Reality Grid [78]

Enable the realistic modeling of complex condensed matter systems at the molecular and mesocale levels Aircraft Environment Created a Grid-based distributed diagnostics system for application in aircraft engines

Distributed Maintenance (DAME) [79]

Grid Resources for Industrial Applications (GRIA) [80] CrossGrid [81]

Focused on outsourcing of computational services for structural analysis and digital film post production Applied for interactive computer and data intensive applications.

Volume 2, Issue 5, May 2013

Page 295

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847
Table 16: Available Grid Tools
Grid Tools Simulation tools: Examples GridSim job scheduling SimGrid single client multiserver scheduling Bricks scheduling GangSim- Ganglia VO OptoSim Data Grid Simulations G3S Grid Security services Simulator security services Advanced Resource Connector (NorduGrid's ARC) Berkeley Open Infrastructure for Network Computing (BOINC) Globus Toolkit Sun Grid Engine ProActive UNICORE SDSC Storage resource broker (data grid) gLite (EGEE) NInf GridRPC IceGrid Java CoG Kit Alchemi GridGain gridGISTICS Gridbus Middleware Vishwa UGP GRIA Open Grid Forum (Formerly Global Grid Forum) Object Management Group Enabling Grids for E-sciencE NorduGrid Open Science Grid OurGrid Sun Grid Xgrid Distributed European Infrastructure for Supercomputing Applications DEISA FusionGrid

Software implementations and middleware:

Alliances and organizations Grid Production grids

6. Grid Scheduling Process


Grid scheduling is the procedure of building scheduling resolution involving resources over multiple organizational domains. A job in Grid scheduling is anything that needs a resource and a resource is something that can be scheduled like disk space. A job may use multiple resources at multiple sites [82]. A common Grid scheduling architecture is proposed by Y. Zhu in [83]. User's and applications aims and objectives including cost, response-time and additional objectives represented by the resource providers including throughput, profit must be optimized by scheduling system. Resources must be managed efficiently, but no control over local resources, distributed and information about the systems are limited or dated. Scheduling system must consider the parameters [82] depicted in Table 17. Table 17: Scheduling Parameters
Parameter Name Issues Resource Capability (clock speed, memory size) Resource State (such as CPU load, memory available, disk storage free) Resource Requirements of an Application Access Speed (such as disk access speed) Free or Available Nodes Priority (that the user has) Queue Type and Length Network Bandwidth, Load, and Latency (if jobs need to communicate) Reliability of Resource and Connection Application Deadline User Capacity/Willingness to Pay for Resource Usage

Parameter 1

Parameter 2

Volume 2, Issue 5, May 2013

Page 296

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847
Resource Cost (in terms of dollars that the user need to pay to the resource owner) Resource Cost Variation in terms of Time-scale (like high @ daytime and low @ night) Historical Information, including Job Consumption Rate Computational economy: Resource Cost (set by its owner) Price (that the user is willing to pay) Deadline (the period by which an application execution need to completed)

Parameter 3

6.1 Phases of Grid scheduling process A generalized scheduling process in the Grid system having three stages or phases namely resource discovering and filtering, resource selecting and scheduling according to certain objectives, and job submission or execution are presented in Table 18 [82] [84]. Table 18: Phases of Grid Scheduling Process
Phases Resource Discovery Elements Authorization filtering: list of accessible resources Application requirement definition: RAM, connectivity, space, etc... Minimal requirement filtering: remove unmatched Dynamic information gathering: more detailed dynamic resource information System selection: Common: Condor Matchmaker/Class Ad contains an expression Advance reservation: (no) expiration or cost Job submission: application submitted to resources Preparation tasks: Setup, staging, etc. Monitoring progress: repetitive query, reschedule Job completion: user notification (email) Cleanup tasks: retrieve data, remove settings/data

Resource selecting

job submission or execution

6.2 Abilities of Grid Scheduling Architecture Interact with local resource managers; It interacts between interact with external services like information, forecasting, submission, security or execution services that cant be defined by Grid scheduling architecture. It calculates a schedule and gives a scheduling decision upon receiving scheduling problems. It divides a scheduling problem into sub-problems, receive scheduling decisions and again combine them into a new one To take decision and solve the problem it forwards problems to other scheduling instances.

7. Research Challenges and Future Work


In order to provide at least a plausible view of the future Grid it is very important to overcome challenges before establishing it. The future Grid should follow and contain all the properties that help the Grid to be used easily, efficiently, effectively, and flexibly, we must be concrete at the point of expectation from Grid [86]. In Table 19 authors summarized the nature of the challenges that should followed and overcome towards developing an admirable architecture of Grid environment. Table 19: Major research challenges towards developing Grid computing systems
Research Domain The Nature of Applications Programming Tools Models and Challenges Research is obligatory in meta computing, technology, scientific and engineering spheres including business, art, and entertainment. Analyzing and developing advanced algorithms for complex grid architectures, technology for translating user performance requirements into system resource requirements, technology to create grid-enabled software components, providing tools that allow programmers to understand and explain program behavior and performance. The systems must be easy, simple and place minimal loads on local sites. In order to provide a range of potentially sophisticated services it needs to achieve a wide variety of complex, performance-sensitive applications including scalability and evolution to future systems and services, it demands for new software architecture which is not appear in existing distributed architecture. Research is required for latency-tolerant and fault-tolerant solution strategies, along with highly concurrent and speculative execution techniques. It needs to provide improved methods for specifying application-level requirements, for translating these requirements into computational resources and network-level quality-ofservice parameters (heterogeneous, shared networks), and for arbitrating between

System Architecture

Algorithms and ProblemSolving Methods Resource Management

Volume 2, Issue 5, May 2013

Page 297

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847
Security Instrumentation Performance Analysis End Systems and convicting demands. Novel network security infrastructure required for resource sharing. For users it may require assurance and licensing mechanisms that can provide guarantees. Noteworthy progresses in instrumentation, measurement, and analysis are required in order to reduce performance problems in the complex environments of future grids to appropriate application and system characteristics. Significant research are needed for end systems that are relatively small and are connected to networks by interfaces and with operating system mechanisms originally developed for reading and writing slow disks. Significant researches and implications are needed for both low and high bandwidths, future network protocols, and stringent performance assurances. Meeting these requirements will require major advances in the technologies used to transport, switch, route, and manage network flows.

Network Protocols Infrastructure

and

8. Conclusion
Grid computing has serious social consequences on users and societies. However it is very difficult to predict the future in a field such as information technology and computer science, the technological advances are moving forward very hastily. There are presently a hefty number of projects and a diverse range of new and emerging Grid developmental approaches being pursued. These systems vary from Grid frameworks to application test beds, and from collaborative environments to batch submission mechanisms. But, it is not straightforward and easy tasks to anticipate what approach will dominant the future Grid environment. This research paper is the great foundation and resource of Grid environment and architecture towards developing a worthy future Grid. Researchers in this research paper have provided extensive ideas which will be encouraging towards developing an adaptable, malleable, extensible, dependable, consistent, pervasive, secured, and inexpensive Grid technology.

References
[1.] Foster I, Kesselman C (eds.). The Grid: Blueprint for a Future Computing Infrastructure. Morgan Kaufmann: San Francisco, CA, 1999. [2.] Oram A (ed.). Peer-to-Peer: Harnessing the Power of Disruptive Technologies. OReilly Press: U.S.A., 2001. [3.] Foster I, Kesselman C, Tuecke S. The anatomy of the Grid: Enabling scalable virtual organizations. International Journal of Supercomputer Applications 2001. To appear. [4.] Buyya R, Stockinger H, Giddy J, Abramson D. Economic models for management of resources in peer-to-peer and Grid computing. SPIE International Conference on Commercial Applications for High-Performance Computing, Denver, CO, August 2024, 2001. [5.] Camarinha-Matos L, Afsarmanesh H (eds.). Infrastructure For Virtual Enterprises: Networking Industrial Enterprises. Kluwer Academic Press: Norwell, MA, 1999. [6.] M. Baker, R. Buyya and D. Laforenza, Grids and Grid Technologies for Wide-area Distributed Computing, in J. of Software-Practice & Experience, Vol. 32, No.15, pp: 1437-1466, December 2002. [7.] PITAC. Information technology: Transforming our society. Information Technology Advisory Committee Interim Report to the U.S. President, August 1998. http://www.ccic.gov/ac/report/section 1.html. [8.] Darema F. Next generation software research directions. http://www.cise.nsf.gov/eia/NGS-slides/sld001.htm. [9.] Johnston W, Gannon D, Nitzberg B. Grids as production computing environments: The engineering aspects of NASAs information power grid. Eighth IEEE International Symposium on High Performance Distributed Computing, Redondo Beach, CA, August 1999. IEEE Computer Society Press: Los Alamitos, CA, 1999. [10.] Buyya R. The World-Wide Grid. http://www.buyya.com/ecogrid/wwg/. [11.] NSF Tera-Grid. http://www.teraGrid.org/. [12.] Hoschek W, Jaen-Martinez J, Samar A, Stockinger H, Stockinger K. Data management in an international data Grid project. Proceedings of the 1st IEEE/ACM International Workshop on Grid Computing (Grid2000), Bangalore, India, 1720 December 2000. Springer: Berlin, 2000. [13.] Buyya R. The Virtual Laboratory Project: Molecular modeling for drug design on Grid. IEEE Distributed Systems Online 2001; 2(5). http://www.buyya.com/vlab/. [14.] W3C. Web services activity. http://www.w3.org/2002/ws/. [15.] Casanova H, Dongarra J. NetSolve: A network server for solving computational science problems. International Journal of Supercomputing Applications and High Performance Computing 1997; 11(3). [16.] Leinberger W, Kumar V. Information Power Grid: The new frontier in parallel computing? IEEE Concurrency 1999; 7(4). [17.] Brown MD et al. The International Grid (iGrid): Empowering global research community networking using high performance international Internet services. http://www.globus.org/research/papers.html [April 1999].

Volume 2, Issue 5, May 2013

Page 298

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847
[18.] SETI@Home. http://setiathome.ssl.berkeley.edu/. [19.] Distributed.Net. http://www.distributed.net/. [20.] Abramson D, Giddy J, Kotler L. High performance parametric modeling with Nimrod/G: Killer application for the global Grid? International Parallel and Distributed Processing Symposium (IPDPS). IEEE Computer Society Press: Los Alamitos, CA, 2000. [21.] Buyya R. The Virtual Laboratory Project: Molecular modeling for drug design on Grid. IEEE Distributed Systems Online 2001; 2(5). http://www.buyya.com/vlab/. [22.] Buyya R, Branson K, Giddy J, Abramson D. The Virtual Laboratory: A toolset to enable distributed molecular modelling for drug design on the World-Wide Grid. Concurrency and Computation: Practice and Experience 2002. [23.] Smallen S et al. Combining workstations and supercomputers to support Grid applications: The parallel tomography experience. The 9th Heterogenous Computing Workshop (HCW 2000, IPDPS), Cancun, Mexico, April 2000. [24.] Holtman K. CMS DataGrid system overview and requirements. The Compact Muon Solenoid (CMS) Experiment Note 2001/037, CERN, Switzerland, 2001. [25.] Allcock B, Foster I, Nefedova V, Chervenak A, Deelman E, Kesselman C, Lee J, Sim A, Shoshani A, Drach B,Williams D. High-performance remote access to climate simulation data: A challenge problem for data Grid technologies. Proceedings of SC2001 Conference, Denver, CO, November 2001. [26.] Date S, Buyya R. Economic and on demand brain activity analysis on the Grid. The 2nd Pacific Rim Application and Grid Middleware Assembly Workshop, Seoul, Korea, July 2002. [27.] Gannon D. Component architectures for high performance, distributed meta computing. http://www.objs.com/workshops/ws9801/papers/paper086.html. [28.] NASAs Numerical Propulsion System Simulation (NPSS). http://cict.grc.nasa.gov/npssintro.shtml. [29.] Perez C, Priol T. JACO3: A Grid environment that supports the execution of coupled numerical simulation. http://www.ercim.org/publication/Ercim/ News/enw45/priol.html. [30.] Fox GC, Williams RD, Messina PC. Parallel Computing Works. Morgan Kaufmann, 1994. [31.] Foster I, Zang T. Building multidisciplinary applications. http://www-fp.mcs.anl.gov/hpcc/section2.6.5.html. [32.] Fujimoto RM. Parallel and Distributed Simulation Systems. Wiley, 2000; 300. [33.] Laforenza D. Programming high performance applications in Grid environments. Invited Talk at EuroPVM/MPI Conference, Greece, April 2002. http://www.gridforum.org/7 APM/aps.htm. [34.] Foster I, Kesselman C (eds.). The Grid: Blueprint for a Future Computing Infrastructure. Morgan Kaufmann: San Francisco, CA, 1999. [35.] Buyya R, Giddy J, Abramson D. A case for economy grid architecture for service-oriented Grid computing. 10th IEEE International Heterogeneous Computing Workshop (HCW 2001), in Conjunction with IPDPS 2001, San Francisco, CA, April 2001. IEEE Computer Society Press: Los Alamitos, CA, 2001. [36.] The Grid: International Efforts in Global Computing, Mark Baker, Rajkumar Buyya and Domenico Laforenza [37.] Buyya R, Abramson D, Giddy J. Economy driven resource management architecture for computational power grids. International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA2000), Las Vegas, NV, 2000. CSREA Press: Athens, GA, 2000. [38.] Buyya R. Economic-based distributed resource management and scheduling for Grid computing. PhD Thesis, Monash University, Melbourne, Australia, April 2002. [39.] Reinefed Alexander , Position Paper, www.egrid.org/ec_initiatives/reinefeld.html [40.] www.entropia.com/pdf/DCGridBrouchre0402.pdf [41.] Realizing the Information Future: The Internet and Beyond. National Academy Press, 1994. http://www.nap.edu/readingroom/books/rtif/. [42.] Beiriger, J., Johnson, W., Bivens, H., Humphreys, S. and Rhea, R., Constructing the ASCI Grid. In Proc. 9th IEEE Symposium on High Performance Distributed Computing, 2000, IEEE Press. [43.] Foster, I. and Kesselman, C. The Globus Project: A Status Report. In Proc. Heterogeneous Computing Workshop, IEEE Press, 1998, 4-18. [44.] Foster, I., Kesselman, C. and Tuecke, S. The Globus Toolkit and Grid Architecture, In preparation, 2001. [45.] Johnston, W.E., Gannon, D. and Nitzberg, B., Grids as Production Computing Environments: The Engineering Aspects of NASA's Information Power Grid. In Proc. 8th IEEE Symposium on High Performance Distributed Computing, 1999, IEEE Press. [46.] Stevens, R., Woodward, P., DeFanti, T. and Catlett, C. From the I-WAY to the National Technology Grid. Communications of the ACM, 40(11):50-61. 1997. [47.] Harshad B. Prajapati and Vipul K. Dabhi, Classification and Characterization of Core Grid Protocols for Global Grid Computing [48.] K. Nadiminti and R. Buyya, Global Grid computing: Where are we today? Enterprise Open Source Journal, vol. 2, pp. 2629, November/December 2006 [49.] B. E. White, Layered Communications Architecture for the Global Grid,

Volume 2, Issue 5, May 2013

Page 299

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 5, May 2013 ISSN 2319 - 4847
[50.] http://www.cs.umd.edu/class/fall2002/cmsc818s/Lectures/globus-short.pdf [51.] www.gridforum.org [52.] Wolfgang Gentzsch, Major Grid Projects Around the World, http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=1639651 [53.] www.globus.org [54.] http://www.mcs.anl.gov/~itf/grid-projects/ [55.] www.mcs.anl.gov/FL/accessgrid; DOE, NSF [56.] http://intragrid.webahead.ibm.com/ [57.] http://www.thebluegrid.com/ [58.] www.cs.sandia.gov/discom DOE Defense Programs [59.] www.sciencegrid.org DOE Office of Science [60.] www.earthsystemgrid.org DOE Office of Science [61.] www.eu-datagrid.org European Union [62.] www.eurogrid.org European Union [63.] www.fusiongrid.org DOE Off. Science [64.] www.globus.org DARPA, DOE, NSF, NASA, Msoft [65.] www.gridlab.org European Union [66.] www.gridpp.ac.uk U.K. eScience [67.] www.grids-center.org NSF [68.] www.hipersoft.rice.edu/grads; NSF [69.] www.griphyn.org NSF [70.] www.ipg.nasa.gov NASA [71.] www.ivdgl.org NSF [72.] www.neesgrid.org NSF [73.] www.ppdg.net DOE Science [74.] www.teragrid.org NSF [75.] www.grid-support.ac.uk U.K. eScience [76.] http://www.unicore.eu BMBFT [77.] www.srl.caltech.edu/nvo [78.] http://www.realitygrid.org U.K. eScience Program [79.] http://www.cs.york.ac.uk/dame [80.] http://www.gria.org EU [81.] www.crossgrid.org EU [82.] Xiwen Cheng, Seminar Grid Computing: Grid Scheduling, http://gc.liacs.nl/2010/GridScheduling_Xiwen.pdf [83.] Y. Zhu, A Survey on Grid Scheduling Systems, Department of Computer Science, Hong Kong University of science and Technology, 2003. [84.] J. Schopf, Ten Actions When SuperScheduling, document of Scheduling Working Group, Global Grid Forum, http://www.ggf.org/documents/GFD.4.pdf, July 2001. [85.] Joshua Paul Winraj. A, S. Sathyalakshmi, Support Vector Machine Based Fault Tolerant Grid, International Journal of Engineering Research & Technology (IJERT), Vol. 2 Issue 4, April 2013, ISSN: 2278-0181 [86.] Ian Foster, Carl Kesselman, Computational Grids, http://www.globus.org/alliance/publications/papers/chapter2.pdf

AUTHOR
Shah Murtaza Rashid Al Masud is a lecturer at the faculty of Computer Science and Information Systems, Najran University, Najran, KSA. He received his M.Sc and B.Sc in Computer Engineering in the specialization of Computer intellect systems and networks in 2000, and 2001 respectively from Khrakov State University of Radio Electronics, Kharkov, Ukraine. His current research interest include parallel and distributed systems, GIS, GRID, Cloud computing, wireless BAN, expert systems, fuzzy logic, physical computation and thermodynamics, reversible logic, , and quantum computation. He has also published papers in accredited national and international journals and conference proceedings. Besides that, he also serves as a reviewer for various conferences and journals. Currently he is the member in various academic and scientific organizations.

Volume 2, Issue 5, May 2013

Page 300

You might also like