International Journal of Applied Information Systems (IJAIS) – ISSN : 2249-0868 Foundation of Computer Science FCS, New York, USA Volume 4– No.9, December 2012 – www.ijais.org 21 Workflow Management in Cloud Computing Monika Bharti M.E. student Computer Science and Engineering Department Thapar University, Patiala Anju Bala Assistant Professor Computer Science and Engineering Department Thapar University, Patiala ABSTRACT Cloud computing is a paradigm that provides demand service resources like software, hardware, platform, and infrastructure. Under cloud environment, workflow is an emerging technique for future scalable applications. This paper discusses the various tools for generating workflow and these tools have been compared on the basis of operating system, databases, architecture and so on. The application on workflow is generated with Pegasus tool which can be further deployed on its compatible cloud platforms like Eucalyptus, Amazon EC2, Open Stack etc. General Terms Cloud Computing, Workflows. Keywords Cloud Computing,Pegasus,Workflows. 1. INTRODUCTION Cloud computing has recently emerged as a new paradigm for hosting and delivering services over the Internet. Cloud computing is attractive to business owners as it eliminates the requirement for users to plan ahead for provisioning, and allows enterprises to start from the small and increase resources only when there is a rise in service demand [1]. It provides web- based software, middleware and computing resources on demand. The research issues for cloud computing are security, load balancing, resource provisioning, energy efficiency, workflows and so on. This paper focus on workflows, which works behind cloud to manage resources, various clients, cost constraints. The concept of workflow is proposed by fixed work procedures with conformist activities. The tasks are divided into subtasks, roles, rules and processes to execute and observe the workflow, workflow system boost the level of production of organization and work efficiency. Various types of workflows are business workflow, abstract workflow, concrete workflow, scientific workflow and so on. Business workflow allows controlled flow of execution and simplifies workflow management. It provides support for security, reliability, transactions, and performance. Its performance can be increased by use of faster server. Its workflow lifecycle is design, deployment, execution, monitoring and finally refinement. Scientific workflow supports for large data flows and need to do parameterized execution of large number of jobs. It is also to monitor and control workflow execution including ad-hoc changes. The input given to workflow is written in languages like Java, Perl, Python and the output generated is the workflow. These workflows are managed and coordinated by workflow management system, which provides the end user with the required data and the appropriate application program for their tasks. It allocate tasks to end-user based only on the performance of constraints like control flow, data flow, transition conditions or pre- and post-conditions. The issues that arise with workflow and its management are workflow scheduling, fault tolerance, energy efficiency and so on. Workflow scheduling maps and manages the execution of inter-dependent tasks on the distributed resources.Fault tolerant is when a system’s service failure can be avoided when faults are present in the system. In this paper, workflow tools and implementation of workflows using Pegasus tool is discussed. Section 2 represents challenges for workflows in cloud computing. Section 3 shows detailed comparison between various workflow tools whereas Section 4 represents experimental results of the generated workflow using Pegasus tool. 2. RESEARCH CHALLENGES FOR WORKFLOWS IN CLOUD COMPUTING 2.1Security While running software and keeping data on virtual machine appears daunting to many. Well-known security issues such as data loss, phishing pose serious threats to organization's data and software [2]. 2.2DataLock-IN The customer cannot easily extort their data and programs from one site to run on other site. The solution is to standardize the API’s, so that the SaaS developer could deploy their services and data across multiple providers [3]. 2.3Reliability and Performance Performance and availability of the applications are important criteria defining the success of an enterprise’s business. However, the fact that organizations lose control over IT environment and important success metrics like performance and reliability. This are dependent on factors outside the control of the IT organizations makes it dangerous for some mission critical applications [4].