Please wait a minute...
Frontiers of Computer Science

ISSN 2095-2228

ISSN 2095-2236(Online)

CN 10-1014/TP

Postal Subscription Code 80-970

2018 Impact Factor: 1.129

Front. Comput. Sci.    2016, Vol. 10 Issue (5) : 925-935    https://doi.org/10.1007/s11704-016-5182-6
RESEARCH ARTICLE
Adaptive data replication strategy in cloud computing for performance improvement
Najme MANSOURI()
Department of Computer Science, Shahid Bahonar University of Kerman, Kerman 7616914111, Iran
 Download: PDF(588 KB)  
 Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks
Abstract

Cloud computing is becoming a very popular word in industry and is receiving a large amount of attention from the research community. Replica management is one of the most important issues in the cloud, which can offer fast data access time, high data availability and reliability. By keeping all replicas active, the replicas may enhance system task successful execution rate if the replicas and requests are reasonably distributed. However, appropriate replica placement in a large-scale, dynamically scalable and totally virtualized data centers is much more complicated. To provide cost-effective availability, minimize the response time of applications and make load balancing for cloud storage, a new replica placement is proposed. The replica placement is based on five important parameters: mean service time, failure probability, load variance, latency and storage usage. However, replication should be used wisely because the storage size of each site is limited. Thus, the site must keep only the important replicas.We also present a new replica replacement strategy based on the availability of the file, the last time the replica was requested, number of access, and size of replica. We evaluate our algorithm using the CloudSim simulator and find that it offers better performance in comparison with other algorithms in terms of mean response time, effective network usage, load balancing, replication frequency, and storage usage.

Keywords cloud computing      CloudSim      replica placement      replica replacement     
Corresponding Author(s): Najme MANSOURI   
Just Accepted Date: 24 November 2015   Online First Date: 18 July 2016    Issue Date: 07 September 2016
 Cite this article:   
Najme MANSOURI. Adaptive data replication strategy in cloud computing for performance improvement[J]. Front. Comput. Sci., 2016, 10(5): 925-935.
 URL:  
https://academic.hep.com.cn/fcs/EN/10.1007/s11704-016-5182-6
https://academic.hep.com.cn/fcs/EN/Y2016/V10/I5/925
1 Mi H B, Wang H M, Zhou Y F, Rung-Tsong Lyu M, Cai H, Yin G. An online service-oriented performance profiling tool for cloud computing systems. Frontiers of Computer Science, 2013, 7(3): 431–445
https://doi.org/10.1007/s11704-013-2193-4
2 Fu X, Zhou C. Virtual machine selection and placement for dynamic consolidation in Cloud computing environment. Frontiers of Computer Science, 2015, 9(2): 322–330
https://doi.org/10.1007/s11704-015-4286-8
3 Chen T, Bahsoon R, Tawil A R. Scalable service-oriented replication with flexible consistency guarantee in the cloud. Information Sciences, 2014, 264: 349–370
https://doi.org/10.1016/j.ins.2013.11.024
4 Wu H, Zhang W B, Zhang J H, Wei J, Huang T. A benefit-aware on-demand provisioning approach for multi-tier applications in cloud computing. Frontiers of Computer Science, 2013, 7(4): 459–474
https://doi.org/10.1007/s11704-013-2201-8
5 Al-Fares M, Loukissas A, Vahdat A. A scalable, commodity data center network architecture. Computer Communication Review, 2008, 38: 63–74
https://doi.org/10.1145/1402946.1402967
6 Amazon-S3.Amazon simple storage service (Amazon s3). , 2009
7 Ghemawat S, Gobioff H, Leung S. The Google file system. In: Proceedings of the 19th ACM Symposium on Operating Systems Principles. 2003
https://doi.org/10.1145/945445.945450
8 Calheiros R N, Ranjan R, Beloglazov A, Rose C, Buyya R. CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms. Software: Practice and Experience, 2011, 41(1): 23–50
https://doi.org/10.1002/spe.995
9 Qiu L L, Padmanabhan V N, Voelker G M. On the placement of Web server replicas. In: Proceedings of the 20th Annual Joint Conference of the IEEE Computer and Communications Societies. 2001, 1587–1596
10 Aazami A, Ghandeharizadeh S, Helmi T. Near optimal number of replicas for continuous media in ad-hoc networks of wireless devices. In: Proceedings of the 10th International Workshop on Multimedia Information Systems. 2004
11 Intanagonwiwat C, Govindan R, Estrin D. Directed diffusion: a scalable and robust communication paradigm for sensor networks. In: Proceedings of the 6th Annual International Conference on Mobile Computing and Networking. 2000
https://doi.org/10.1145/345910.345920
12 Tang B, Das S R, Gupta H. Benefit-based data caching in ad hoc networks. IEEE Transactions on Mobile Computing, 2008, 7(3): 289–304
https://doi.org/10.1109/TMC.2007.70770
13 Jin S D, Wang L M. Content and service replication strategies in multihop wireless mesh networks. In: Proceedings of ACM International Conference on Modeling, Analysis and Simulation of Wireless and Mobile Systems. 2005
14 Dabrowski C. Reliability in grid computing systems. Concurrency Practice and Experience, 2009, 21(8): 927–959
https://doi.org/10.1002/cpe.1410
15 Bonvin N, Papaioannou T G, Aberer K. Dynamic cost-efficient replication in data clouds. In: Proceedings of the 1stWorkshop on Automated Control for Datacenters and Clouds. 2009, 49–56
https://doi.org/10.1145/1555271.1555283
16 Milani B A, Navimipour N J. A comprehensive review of the data replication techniques in the cloud environments: major trends and future directions. Journal of Network and Computer Applications, 2016, 64: 229–238
https://doi.org/10.1016/j.jnca.2016.02.005
17 Bonvin N, Papaioannou T G, Aberer K. A self-organized, fault tolerant and scalable replication scheme for cloud storage. In: Proceedings of the 1st ACM Symposium on Cloud Computing. 2010, 205–216
https://doi.org/10.1145/1807128.1807162
18 Nguyen T, Cutway A, Shi W. Differentiated replication strategy in data centers. In: Proceedings of the IFIP International Conference on Network and Parallel Computing. 2010, 277–288
https://doi.org/10.1007/978-3-642-15672-4_24
19 Ahmad N, Fauzi A C, Sidek R M, Zin N M, Beg A H. Lowest data replication storage of binary vote assignment data grid. In: Proceedings of the 2nd International Conference on Networked Digital Technologies. 2010, 466–473
https://doi.org/10.1007/978-3-642-14306-9_46
20 Bin L, Jiong Y, Hua S, Mei N. A QoS-aware dynamic data replica deletion strategy for distributed storage systems under cloud computing environments. In: Proceedings of the 2nd International Conference on Cloud and Green Computing. 2012, 219–225
21 Shvachko K, Hairong K, Radia S, Chansler R. The Hadoop distributed file system. In: Proceedings of the 26th Symposium on Mass Storage Systems and Technologies. 2010, 1–10
https://doi.org/10.1109/msst.2010.5496972
22 Rahman R M, Barker K, Alhajj R. Replica placement design with static optimality and dynamic maintainability. In: Proceedings of the 6th IEEE International Symposium on Cluster Computing and the Grid. 2006, 434–437
https://doi.org/10.1109/ccgrid.2006.85
23 Mansouri N, Dastghaibyfard G H. A dynamic replica management strategy in data grid. Journal of Network and Computer Applications, 2012, 35(4): 1297–1303
https://doi.org/10.1016/j.jnca.2012.01.014
24 Mansouri N, Dastghaibyfard G H. Enhanced dynamic hierarchical replication and weighted scheduling strategy in data grid. Journal of Parallel and Distributed Computing, 2013, 73(4): 534–543
https://doi.org/10.1016/j.jpdc.2013.01.002
25 Mansouri N. Network and data location aware approach for simultaneous job scheduling and data replication in large-scale data grid environments. Frontiers of Computer Science, 2014, 8(30): 391–408
https://doi.org/10.1007/s11704-014-3146-2
26 Dogan A. A study on performance of dynamic file replication algorithms for real-time file access in data grids. Future Generation Computer Systems, 2009, 25(8): 829–839
https://doi.org/10.1016/j.future.2009.02.002
27 Hussein M, Mousa M H. A light-weight data replication for cloud data centers environment. International Journal of Engineering and Innovative Technology, 2012, 1(6): 169–175
28 Rajalakshmi A, Vijayakumar D, Srinivasagan K G. An improved dynamic data replica selection and placement in cloud. In: Proceedings of the 2014 International Conference on Recent Trends in Information Technology. 2014, 1–6
https://doi.org/10.1109/ICRTIT.2014.6996180
29 Li B, Song S, Bezakova I, Cameron W. Energy-aware replica selection for data-intensive services in Cloud. In: Proceedings of the 20th IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems. 2012, 504–506
https://doi.org/10.1109/mascots.2012.66
30 Barroso L, Holzle U. The case for energy-proportional computing. Computer, 2007, 40(12): 33–37
https://doi.org/10.1109/MC.2007.443
31 Li W H, Yang Y, Yuan D. A novel cost-effective dynamic data replication strategy for reliability in Cloud data centres. In: Proceedings of the 9th IEEE International Conference on Dependable, Autonomic and Secure Computing. 2011, 496–502
https://doi.org/10.1109/dasc.2011.95
32 Wei Q, Veeravalli B, Gong B, Zeng L, Feng D. CDRM: A cost-effective dynamic replication management scheme for cloud storage cluster. In: Proceedings of the IEEE International Conference on Cluster Computing. 2010, 188–196
https://doi.org/10.1109/cluster.2010.24
33 Yuan D, Yang Y, Liu X, Chen J J. A data placement strategy in scientific cloud workflows. Future Generation Computer Systems, 2010, 26(8): 1200–1214
https://doi.org/10.1016/j.future.2010.02.004
34 McCormick W T, Sehweitzer P J, White T W. Problem decomposition and data reorganization by a clustering technique. Operations Research, 1972, 20(5): 993–1009
https://doi.org/10.1287/opre.20.5.993
35 Jeffrey D, Sanjay G. MapReduce: simplifed data processing on large clusters. In: Proceedings of the 6th Symposium on Operating System Design and Implementation (OSDI). 2004, 137–150
36 Kwan T, Mcgrath R, Reed D. NCSAs World Wide Web server design and performance. Computer, 1995, 28(11): 67–74
https://doi.org/10.1109/2.471181
37 Xie T. SEA: a striping-based energy-aware strategy for data placement in RAID-structured storage systems. IEEE Transactions on Computers, 2008, 57(6): 748–761
https://doi.org/10.1109/TC.2008.27
38 Howell F, Mcnab R. SimJava: a discrete event simulation library for Java. In: Proceedings of the 1st International Conference onWeb-based Modeling and Simulation. 1998
39 Cameron D G, Carvajal-schiaffino R, Millar A P, Nicholson C, Stockinger K, Zini F. UK Grid Simulation with OptorSim. In: Proceedings of UK e-Science All Hands Meeting. 2003
[1]  Supplementary Material Download
[1] Wei ZHENG, Ying WU, Xiaoxue WU, Chen FENG, Yulei SUI, Xiapu LUO, Yajin ZHOU. A survey of Intel SGX and its applications[J]. Front. Comput. Sci., 2021, 15(3): 153808-.
[2] Najme MANSOURI, Mohammad Masoud JAVIDI, Behnam Mohammad Hasani ZADE. Hierarchical data replication strategy to improve performance in cloud computing[J]. Front. Comput. Sci., 2021, 15(2): 152501-.
[3] Jiayang LIU, Jingguo BI, Mu LI. Secure outsourcing of large matrix determinant computation[J]. Front. Comput. Sci., 2020, 14(6): 146807-.
[4] Meysam VAKILI, Neda JAHANGIRI, Mohsen SHARIFI. Cloud service selection using cloud service brokers: approaches and challenges[J]. Front. Comput. Sci., 2019, 13(3): 599-617.
[5] Qiang LIU, Xiaoshe DONG, Heng CHEN, Yinfeng WANG. IncPregel: an incremental graph parallel computation model[J]. Front. Comput. Sci., 2018, 12(6): 1076-1089.
[6] Fei TIAN, Tao QIN, Tie-Yan LIU. Computational pricing in Internet era[J]. Front. Comput. Sci., 2018, 12(1): 40-54.
[7] Xiong FU, Juzhou CHEN, Song DENG, Junchang WANG, Lin ZHANG. Layered virtual machine migration algorithm for network resource balancing in cloud computing[J]. Front. Comput. Sci., 2018, 12(1): 75-85.
[8] Haibao CHEN,Song WU,Hai JIN,Wenguang CHEN,Jidong ZHAI,Yingwei LUO,Xiaolin WANG. A survey of cloud resource management for complex engineering applications[J]. Front. Comput. Sci., 2016, 10(3): 447-461.
[9] Zhaoning ZHANG,Dongsheng LI,Kui WU. Large-scale virtual machines provisioning in clouds:challenges and approaches[J]. Front. Comput. Sci., 2016, 10(1): 2-18.
[10] Bing YU,Yanni HAN,Hanning YUAN,Xu ZHOU,Zhen XU. A cost-effective scheme supporting adaptive service migration in cloud data center[J]. Front. Comput. Sci., 2015, 9(6): 875-886.
[11] Xiong FU,Chen ZHOU. Virtual machine selection and placement for dynamic consolidation in Cloud computing environment[J]. Front. Comput. Sci., 2015, 9(2): 322-330.
[12] Solomon Guadie WORKU,Chunxiang XU,Jining ZHAO. Cloud data auditing with designated verifier[J]. Front. Comput. Sci., 2014, 8(3): 503-512.
[13] Heng WU, Wenbo ZHANG, Jianhua ZHANG, Jun WEI, Tao HUANG. A benefit-aware on-demand provisioning approach for multi-tier applications in cloud computing[J]. Front Comput Sci, 2013, 7(4): 459-474.
[14] Haibo MI, Huaimin WANG, Yangfan ZHOU, Michael Rung-Tsong LYU, Hua CAI, Gang YIN. An online service-oriented performance profiling tool for cloud computing systems[J]. Front Comput Sci, 2013, 7(3): 431-445.
[15] Ling LIU. Computing infrastructure for big data processing[J]. Front Comput Sci, 2013, 7(2): 165-170.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed