Implementation of Effective Data Emplacement Algorithm in Heterogeneous Cloud Environment
DOI:
https://doi.org/10.51983/ajcst-2019.8.S1.1944Keywords:
Big Data, Cloud computing, Hadoop, Data Emplacement, ClusterAbstract
This paper is concerned with the study and implementation of effective Data Emplacement Algorithm in large set of databases called Big Data and proposes a model for improving the efficiency of data processing and storage utilization for dynamic load imbalance among nodes in a heterogeneous cloud environment. With the era of explosive information and data receiving, more and more fields need to deal with massive, large scale of data. A method has been proposed with an improved Data Placement algorithm called Effective Data Emplacement Algorithm with computing capacity of each node as a predominant factor that promotes and improves the efficiency in data processing in a short duration time from large set of data. The adaptability of the proposed model can be obtained by minimizing the time with processing efficiency through the computing capacity of each node in the cluster. The proposed solution improves the performance of the heterogeneous cluster environment by effectively distributing data based on the performance oriented sampling as the experimental results made with word count applications.
References
J. Xie, S. Yin, X. Ruan, Z. Ding, and Y. Tian, "Improving MapReduce Performance through Data Placement in Heterogeneous Hadoop Clusters," in 19th International Heterogeneity in Computing Workshop, Atlanta, Georgia, April 2010.
Y. Fan, W. Wu, H. Cao, H. Zhu, X. Zhao, and W. Wei, "A heterogeneity-aware data distribution and rebalance method in Hadoop cluster," in Seventh ChinaGrid Annual Conference, 2012.
M. Maurya and S. Mahajan, "Performance analysis of MapReduce Programs on Hadoop Cluster," in IEEE World Congress on Information and Communication Technologies, 2012.
W. Zhao, L. Meng, J. Sun, and Y. Ding, "An Improved Data Placement Strategy in a Heterogeneous Hadoop Cluster," The Open Cybernetics & Systemics Journal, 2014.
C.-W. Lee, K.-Y. Hsieh, S.-Y. Hsieh, and H.-C. Hsiao, "A Dynamic Data Placement Strategy for Hadoop in Heterogeneous Environments," Big Data Research, 2014.
S. V. Ambade and P. R. Deshpande, "Heterogeneity-based files placement in Big Data Cluster," in International Conference on Computational Intelligence and Communication Networks, 2015.
V. Ubarhande, "Novel Data-Distribution Technique for Hadoop in Heterogeneous Cloud Environments," IEEE Transactions, 2015.
Ch. B. VishnuVardhan and P. K. Baruah, "Improving the Performance of Heterogeneous Hadoop Cluster," in Fourth International Conference on Parallel, Distributed and Grid Computing (PDGC), 2016.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2019 The Research Publication
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.