ISSN 2394-3777 (Print)
ISSN 2394-3785 (Online)
Available online at www.ijartet.com
International Journal of Advanced Research Trends in Engineering and Technology (IJARTET)
Vol. 4, Issue 9, September 2017
All Rights Reserved © 2017 IJARTET 29
Ranking Algorithm Based on File’s Accessing
Frequency for Cloud Storage System
Mrs. S.Annal Ezhil Selvi
1
, Dr. R. Anbuselvi
2
Assistant Professor in Computer Science, Bishop Heber College, Trichy, TamilNadu, India.
1
Associate Professor in Computer Science, Bishop Heber College, Trichy, TamilNadu, India.
2
Abstract: The number of cloud storage users has improved abundantly at recent times. The reason behind is, the Cloud Storage
system minimizes the burden of maintenance and it has less storage cost compare with other storage methods. It provides high
availability, reliability and it is most suitable for high volume of data storage. In order to provide high availability and
reliability, the systems introduce redundancy. In replicated system, the cloud storage services are hosted by multiple
geographically distributed data centres. But the file Replication is rendering little bit threat about the Cloud Storage System for
the users and for the providers it is a big challenge to offer efficient Data Storage. Since the increasingly expanded utility of
Cloud storage, the improvement of resources management in the shortest time to respond to the user’s requests and
the geographical constraints are of prime importance to both the Cloud service providers and the users. The data replication
helps in attractive the data availability which reduces the overall access time of the files, but at the same time it occupies more
storage space and storage cost. In order to rectify the above mentioned problems, need to identify the popularity of the file. So
this paper proposed new ranking algorithm which lists the most often accessed files and less frequently accessed files. In future
the least accessed a file’s replications going to be reduced likewise most accessed file’s replications going to be increased based
on their SLA.
Keywords: Cloud Computing, Data Replication, Popularity Degree, Distribution Networks Cloud Storage, Storage Cost.
I. INTRODUCTION
Cloud computing is a system which works “on
demand” or “Pay per Use” concept. In cloud computing all
the computational resources (like storage, data) are shared
among the users [1, 2, 3 and 4].Service Level Agreement
(SLA) is connecting the user and the service provider. This
agreement defines QoS parameters (like availability,
Reliability, Scalability and cost etc.).
Cloud storage is a representation of data storage in
which the digital records are stored in logical collection. The
physical storage data stored on multiple servers (and frequent
locations), manage by a hosting company [5]. These cloud
storage sources are responsible for assuring the records
available and accessible. Peoples and organizations buy or let
storage capacity beginning the providers to store user. Today,
popular Internet companies, such as Google, Yahoo, and
Microsoft offers more services for millions of users every day.
These services are hosted in datacentres that contain
thousands of servers, as well as power delivery (and backup)
and networking infrastructures. Because users demand high
availability and low response times, each service is mirrored
by multiple datacentres that are geographically distributed [6].
Each datacentre is supposed to serve the requests of the users
that are closest (in terms of network latency) to them. If this
datacentre becomes unavailable or unreachable, these requests
are forwarded to a mirror datacentre.
A Cloud storage data replication service is a
managed service in which stored or archived records is
duplicated in real time over a storage area network. Further
terms for this type of service consist of file replication, data
replication, and remote storage replication. The appearance
can also refer to a program or grouping that facilitates such
duplication. Cloud Storage replication services provide an
extra determine of redundancy that can be invaluable if the
main storage backup system fails. The instant of that the cloud
user can access to the replicated data to minimize downtime
and its associated costs [7]. The services, if accurately
implement, can clustering based make more efficient disaster
recovery process by generating a replica copy of the entire
backed-up files on a continuous basis [8].
In cloud inconvenience are normal slightly
incomparable, so high availability; high performance and
high fault tolerance are important factors to be considered.
Concept of replication is used in order to get high
availability, high performance and high fault tolerance [10].
Replication is the method to store multiple copies of a data
files at data centers for performance and availability reasons.
Seeing that cloud is on demand model, therefore the user will
pay for using the cloud storage. User will prefer that service
provider who will guarantee their maximum demand about
the data storage. As the result, replication is used to reach
highest availability [1 to 4, 9 and 10]. But at the same time it
is not needed that the benefits accrued from the replication