An Empirical Review on Data Feature Selection and Big Data Clustering

Authors

  • Venkata Rao Maddumala Department of Computer Science and Engineering, Annamalai University, Tamil Nadu, India
  • R. Arunkumar Department of Information Technology, Vignan’s Nirula Institute of Technology of Science for Women, Andhra Pradesh, India
  • S. Arivalagan Department of Information Technology, Vignan’s Nirula Institute of Technology of Science for Women, Andhra Pradesh, India

DOI:

https://doi.org/10.51983/ajcst-2018.7.S1.1796

Keywords:

Big Data, Clustering, Feature Selection

Abstract

With the fast advancement of the Big Data, Big Data innovations have risen as a key data investigation apparatus, in which, feature extraction and data bunching calculations are considered as a basic part for data examination. Nonetheless, there has been constrained research that tends to the difficulties crosswise over Big Data and along these lines proposing an exploration motivation is vital to illuminate the examination challenges for bunching Big Data. By handling this particular viewpoint – grouping calculation in Big Data, this paper looks at on Big Data advancements, identified with feature determination and data bunching calculations and conceivable uses. In view of our survey, this paper distinguishes an arrangement of research difficulties that can be utilized as an exploration plan for the Big Data bunching research. This exploration plan goes for distinguishing and crossing over the examination holes between Big Data feature choice and grouping calculations.

References

N. Al-Madi, I. Aljarah, and S. A. Ludwig, "Parallel glowworm swarm optimization clustering algorithm based on MapReduce," in IEEE Symposium on Swarm Intelligence, 2014.

A. Amini, T. Y. Wah, and H. Saboohi, "On density-based data streams clustering algorithms: a survey," in Journal of Computer Science and Technology, vol. 29, no. 1, pp. 116-141, 2014.

A. Akbar et al., "Context-aware stream processing for distributed IoT applications In Internet of Things (WF-IoT)," in 2015 IEEE 2nd World Forum, pp. 663-668, Dec. 2015.

E. Ahmed and M. H. Rehmani, "Mobile edge computing: opportunities, solutions, and challenges," pp. 59-63, 2017.

P. Berkhin, "A survey of clustering data mining techniques," in Grouping multidimensional data, Springer Berlin Heidelberg, pp. 25-71, 2006.

Y. Chen et al., "Big Data analytics and Big Data science: a survey," in Journal of Management Analytics, vol. 3, no. 1, pp. 1-42, 2016.

D. Xu et al., "Internet of things in industries: A survey," in IEEE Transactions on Industrial Informatics, vol. 10, no. 4, pp. 2233-2243, 2014.

P. D'Urso et al., "Exponential distance-based fuzzy clustering for interval-valued data," in Fuzzy Optimization and Decision Making, vol. 16, no. 1, pp. 51-70, 2017.

S. K. Dash et al., "Privacy preserving K-Medoids clustering: an approach towards securing data in Mobile cloud architecture," in 2nd International Conference on Computational Science, Engineering and Information Technology, pp. 439-443, ACM, 2012.

A. K. Dey, "Understanding and using context," in Personal and Ubiquitous Computing, vol. 5, no. 1, pp. 4-7, 2001.

I. El Naqa and M. J. Murphy, "What Is Machine Learning?" in Machine Learning in Radiation Oncology, Springer International Publishing, pp. 3-11, 2015.

A. Fahad et al., "A survey of clustering algorithms for Big Data: Taxonomy and empirical analysis," in IEEE Transactions on Emerging Topics in Computing, vol. 2, no. 3, pp. 267-279, 2014.

S. B. Fredj et al., "A scalable IoT service search based on clustering and aggregation," in Green Computing and Communications (GreenCom), 2013 IEEE and Internet of Things, pp. 403-410, 2013.

K. Guo et al., "Community discovery by propagating local and global information based on the MapReduce model," in Information Sciences, vol. 323, pp. 73-93, 2015.

P. Goyal et al., "A Fast, Scalable SLINK Algorithm for Commodity Cluster Computing Exploiting Spatial Locality," in High Performance Computing and Communications; IEEE 14th International Conference on Smart City, 2016.

T. C. Havens et al., "Scalable single linkage hierarchical clustering for Big Data," in Intelligent Sensors, Sensor Networks and Information Processing, 2013 IEEE Eighth International Conference on. IEEE, 2013.

M. S. Hossain et al., "Impact of Next-Generation Mobile Technologies on IoT-Cloud Convergence," in IEEE Communications Magazine, vol. 55, no. 1, pp. 18-19, 2017.

D. Jiang and G. Liu, "An Overview of 5G Requirements," in 5G Mobile Communications, Springer International Publishing, pp. 3-26, 2017.

R. Kitchin, "Big Data—Hype or revolution," in The SAGE Handbook of Social Media Research Methods, 2017.

Y. Liu et al., "A network-assisted co-clustering algorithm to discover cancer subtypes based on gene expression," in BMC Bioinformatics, vol. 15, no. 1, pp. 37, 2014.

C. Lin et al., "A parallel Cop-K means clustering algorithm based on MapReduce framework," in Knowledge Engineering and Management, pp. 93-102, 2011.

Y. Li et al., "A grouping method based on grid density and relationship for crowd evacuation simulation," in Physical A: Statistical Mechanics and its Applications, 2017.

G. Manogaran et al., "Big Data Knowledge System in Healthcare," in Internet of Things and Big Data Technologies for Next Generation Healthcare, pp. 133-157, Springer International Publishing, 2017.

R. Van Kranenburg, "The Internet of Things: A critique of ambient technology and the all-seeing network of RFID," Institute of Network Cultures, 2008.

R. Rafailidis et al., "Landmark selection for spectral clustering based on Weighted Page Rank," in Future Generation Computer Systems, vol. 68, pp. 465–472, 2017.

A. S. Shirkhorshidi et al., "Big Data clustering: a review," in International Conference on Computational Science and Its Applications, pp. 707-720, 2014.

S. Srirama et al., "Adapting scientific computing problems to clouds using MapReduce," in Future Generation Computer Systems, vol. 28, no. 1, pp. 184-192, 2012.

A. Sreenivasulu et al., "Review of Clustering Techniques," in International Conference on Data Engineering and Communication Technology. Springer Singapore, 2017.

R. Van Kranenburg, "The Internet of Things: A critique of ambient technology and the all-seeing network of RFID," Institute of Network Cultures, 2008.

R. Xu et al., "Internet of things in industries: A survey," in IEEE Transactions on Industrial Informatics, vol. 10, no. 4, pp. 2233-2243, 2014.

L. Xu et al., "Iterative Big Data clustering algorithms: a review," in Software: Practice and Experience, vol. 46, no. 1, pp. 107-129, 2016.

A. Mohebi et al., "Parallel two-phase K-means," in International Conference on Computational Science and Its Applications. Springer Berlin Heidelberg, 2013.

R. Ng and J. Han, "CLARANS: A method for clustering objects for spatial data mining," in IEEE Transactions on Knowledge and Data Engineering, vol. 14, no. 5, pp. 1003-1016, 2002.

D. Pandove and S. Goel, "A comprehensive study on clustering approaches for Big Data mining," in Electronics and Communication Systems (ICECS), 2015 2nd International Conference on IEEE, 2015.

Downloads

Published

08-11-2018

How to Cite

Maddumala, V. R., Arunkumar, R., & Arivalagan, S. (2018). An Empirical Review on Data Feature Selection and Big Data Clustering. Asian Journal of Computer Science and Technology, 7(S1), 96–100. https://doi.org/10.51983/ajcst-2018.7.S1.1796