Nature Inspired Data Placement Strategy in Distributed Cloud Environment using Improved Firefly Algorithm

International Journal of Electronics and Communication Engineering
© 2023 by SSRG - IJECE Journal
Volume 10 Issue 8
Year of Publication : 2023
Authors : B. Prabhu Shankar, H. Najmusher, N. Rajkumar, R. Jayavadivel, C. Viji, S. M. Nandha Gopal
pdf
How to Cite?

B. Prabhu Shankar, H. Najmusher, N. Rajkumar, R. Jayavadivel, C. Viji, S. M. Nandha Gopal, "Nature Inspired Data Placement Strategy in Distributed Cloud Environment using Improved Firefly Algorithm," SSRG International Journal of Electronics and Communication Engineering, vol. 10,  no. 8, pp. 59-67, 2023. Crossref, https://doi.org/10.14445/23488549/IJECE-V10I8P106

Abstract:

The execution of scientific applications needs high-processing computers and requires massive storage. This resulted in deploying applications in a distributed environment with high performance and extensive storage. Applications processed in cloud platforms face intolerable delays due to data movement across the centres. Optimized distribution of datasets among the global data centres has become an essential issue in the distributed cloud environment. This work proposes an improved data placement called IFA Data Placement (IFA-DP) method for a heterogeneous cloud environment. An effective and efficient optimal data placement strategy is proposed using a metaheuristic global optimisation firefly algorithm. The metaheuristic behavior of fireflies finds a better optimal solution. The primary aim of this work is to reduce the response time and execution cost, which is then proved by the simulation results. The access time of the Proposed IFA-DP is less by at least 2s compared to the existing methods.

Keywords:

Data placement, Firefly algorithm, Metaheuristic optimization, Cloud computing.

References:

[1] Tevfik Kosar, and Miron Livny, “A Framework for Reliable and Efficient Data Placement in Distributed Computing Systems,” Journal of Parallel and Distributed Computing, vol. 65, no. 10, pp. 1146-1157, 2005.
[CrossRef] [Google Scholar] [Publisher Link]
[2] Huan Liu, and Dan Orban, “GridBatch: Cloud Computing for Largescale Data-Intensive Batch Applications,” 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID), Lyon, France, pp. 295-305, 2008.
[CrossRef] [Google Scholar] [Publisher Link]
[3] Ewa Deelman et al., “The Cost of Doing Science on the Cloud: The Montage Example,” SC ’08: Proceedings of the 2008 ACM/IEEE Conference on Supercomputing, Austin, Tx, USA, pp. 1-12, 2008.
[CrossRef] [Google Scholar] [Publisher Link]
[4] T. Kosar, and M. Livny, “Stork: Making Data Placement a First-Class Citizen in the Grid,” 24th International Conference on Distributed Computing Systems, Tokyo, Japan, pp. 342-349, 2004.
[CrossRef] [Google Scholar] [Publisher Link]
[5] Ewa Deelman, and Ann Chervenak, “Data Management Challenges of Data-Intensive Scientific Workflows,” 2008 8th IEEE International Symposium on Cluster Computing and the Grid (CCGRID), Lyon, France, pp. 687-692, 2008.
[CrossRef] [Google Scholar] [Publisher Link]
[6] S. Veerapandi, R. Surendiran, and K. Alagarsamy, “Enhanced Fault Tolerant Cloud Architecture to Cloud-Based Computing using Both Proactive and Reactive Mechanisms,” DS Journal of Digital Science and Technology, vol. 1, no. 1, pp. 32-40, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Lili Qiu, V. N. Padmanabhan, and G. M. Voelker, “On the Placement of Web Server Replicas,” Proceedings IEEE INFOCOM 2001,Conference on Computer Communications, Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213), Anchorage, AK, USA, vol. 3, pp. 1587-1596, 2001.
[CrossRef] [Google Scholar] [Publisher Link]
[8] K. R. Pattipati, and J. L. Wolf, “A File Assignment Problem Model for Extended Local Area Network Environments,” International Conference on Distributed Computing Systems, Paris, France, pp. 554-561, 1990.
[CrossRef] [Google Scholar] [Publisher Link]
[9] Shyamala Doraimani, and Adriana Iamnitchi, “File Grouping for Scientific Data Management: Lessons from Experimenting with Real Traces,” HPDC ’08: Proceedings of the 17th International Symposium on High Performance Distributed Computing, Boston, pp. 153- 164, 2008.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Gilles Fedak, Haiwu He, and Franck Cappello, “BitDew: A Programmable Environment for Large-Scale Data Management and Distribution,” SC ’08: Proceedings of the 2008 ACM/IEEE Conference on Supercomputing, Austin, TX, USA, pp. 1-12, 2008.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Marcel Chibuzor Amaechi, Matthias Daniel, and Bennett E O, “Data Storage Management in Cloud Computing using Deduplication Technique,” SSRG International Journal of Computer Science and Engineering, vol. 7, no. 7, pp. 1-7, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[12] Zheng Pai et al., “A Data Placement Strategy for Data-Intensive Applications in Cloud,” Chinese Journal of Computers, vol. 33, no. 8, pp. 1472-1480, 2010.
[Google Scholar] [Publisher Link]
[13] Dharma Nukarapu et al., “Data Replication in Data Intensive Scientific Applications with Performance Guarantee,” IEEE Transactions on Parallel and Distributed Systems, vol. 22, no. 8, pp. 1299-1306, 2011.
[CrossRef] [Google Scholar] [Publisher Link]
[14] Zhao Er-Dun et al., “A Data Placement Strategy Based on Genetic Algorithm for Scientific Workflows,” 2012 8th International Conference on Computational Intelligence and Security, Guangzhou, China, pp. 146-149, 2012.
[CrossRef] [Google Scholar] [Publisher Link]
[15] Nishu Rana, and Pardeep Kumar, “Random Walk-Based ACO Load Balancing Algorithm for Cloud Computing Environment,” International Journal of P2P Network Trends and Technology, vol. 8, no. 6, pp. 8-15, 2018.
[Publisher Link]
[16] Wei Guo, and Xinjun Wang, “A Data Placement Strategy Based on Genetic Algorithm in the Cloud Computing Platform,” 2013 10th Web Information System and Application Conference, Yangzhou, China, pp. 369-372, 2013.
[CrossRef] [Google Scholar] [Publisher Link]
[17] Thomas Back, Evolutionary Algorithms in Theory and Practice: Evolution Strategies, Evolutionary Programming, Genetic Algorithms, Oxford University Press, 1996.
[CrossRef] [Google Scholar] [Publisher Link]
[18] Melanie Mitchell, An Introduction to Genetic Algorithms, MIT Press, 1998.
[Google Scholar] [Publisher Link]
[19] Kalyanmoy Deb et al., “A Fast and Elitist Multiobjective Genetic Algorithm: NSGA-II,” IEEE Transactions on Evolutionary Computation, vol. 6, no. 2, pp. 182-197, 2002.
[CrossRef] [Google Scholar] [Publisher Link]
[20] Zhuo Tang et al., “A Data Skew Oriented Reduce Placement Algorithm Based on Sampling,” IEEE Transactions on Cloud Computing, vol. 8, no. 4, pp. 1149-1161, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[21] Mohammad Javad Abbasi, and Mehrdad Mohri, “Scheduling Tasks in the Cloud Computing Environment with the Effect of Cuckoo Optimization Algorithm,” SSRG International Journal of Computer Science and Engineering, vol. 3, no. 8, pp. 1-9, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[22] Neha Thakkar, and Rajender Nath, “Discrete Artificial Bee Colony Algorithm for Load Balancing in Cloud Computing Environment,” International Journal of P2P Network Trends and Technology, vol. 8, no. 6, pp. 1-7, 2018.
[Publisher Link]
[23] Qing Zhao, Congcong Xiong, and Peng Wang, “Heuristic Data Placement for Data-Intensive Applications in Heterogeneous Cloud,” Journal of Electrical and Computer Engineering, vol. 2016, pp. 1-8, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[24] Lizheng Guo et al., “A Particle Swarm Optimization for Data Placement Strategy in Cloud Computing,” Information Engineering and Applications, pp. 946-953, 2012.
[CrossRef] [Google Scholar] [Publisher Link]
[25] Manmohan Chaubey, and Erik Saule, “Replicated Data Placement for Uncertain Scheduling,” 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, Hyderabad, India, pp. 464-472, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[26] Divyakant Agrawal, Sudipto Das, and Amr El Abbadi, “Big Data and Cloud Computing: Current State and Future Opportunities,” EDBT/ICDT ’11: Proceedings of the 14th International Conference on Extending Database Technology, Uppsala, pp. 530-533, 2011.
[CrossRef] [Google Scholar] [Publisher Link]
[27] Xin-She Yang, “Firefly Algorithms for Multimodal Optimization,” Stochastic Algorithms: Foundations and Applications, pp. 169-178, 2009.
[CrossRef] [Google Scholar] [Publisher Link]
[28] S. Veerapandi, R. Surendiran, and K. Alagarsamy, “Live Virtual Machine Pre-copy Migration Algorithm for Fault Isolation in Cloud Based Computing Systems,” DS Journal of Digital Science and Technology, vol. 1, no. 1, pp. 23-31, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[29] Michael Armbrust et al., “A View of Cloud Computing,” Communications of the ACM, vol. 53, no. 4, pp. 50-58, 2010.
[CrossRef] [Google Scholar] [Publisher Link]
[30] Peter Mell, and Timothy Grance, “The NIST Definition of Cloud Computing,” Thesis, National Institute of Standards and Technology, 2011.
[Google Scholar] [Publisher Link]
[31] Eduardo Pinheiro, and Ricardo Bianchini, “Energy Conservation Techniques for Disk Array-Based Servers,” ICS ’04: Proceedings of the 18th Annual International Conference on Supercomputing, pp. 68-78, 2004.
[CrossRef] [Google Scholar] [Publisher Link]
[32] B. Prabhu Shankar, and S. Chitra, “Optimal Data Placement and Replication Approach for SIoT with Edge,” Computer Systems Science and Engineering, vol. 41, no. 2, pp. 661-676, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[33] B. Prabhu Shankar et al., “Energy-Efficient Data Offloading using Data Access Strategy-Based Data Grouping Scheme,” SSRG International Journal of Electronics and Communication Engineering, vol. 10, no. 5, pp. 28-37, 2023.
[CrossRef] [Publisher Link]
[34] T. V. V. Satyanarayana et al., “A Secured IoT-Based Model for Human Health through Sensor Data,” Measurement: Sensors, vol. 24, p. 100516, 2022.
[CrossRef] [Google Scholar] [Publisher Link]