Comparative Study of Rainfall Prediction Modeling Techniques (A Case Study on Srinagar, J&K, India)

Authors

  • Razeef Mohd Student, University of Kashmir, Jammu and Kashmir, India
  • Muheet Ahmed Butt Scientist-D, Department of Computer Science, University of Kashmir, Jammu and Kashmir, India
  • Majid Zaman Baba Scientist-D, Directorate of IT&SS, University of Kashmir, Jammu and Kashmir, India

DOI:

https://doi.org/10.51983/ajcst-2018.7.3.1901

Keywords:

Rainfall Prediction, Data Mining, J48, Random Forest, IBk, Naive Bayesian, Bagging

Abstract

Prediction of rainfall is one of the most essential and demanding tasks for the weather forecasters since ages. Rainfall prediction plays an important role in the field of farming and industries. Precise rainfall prediction is vital for detecting the heavy rainfall and to provide the information of warnings regarding the natural calamities. Rainfall prediction involves recording the various parameters of weather like wind direction, wind speed, humidity, rainfall, temperature etc. From last few decades, it has been seen that data mining techniques have achieved good performance and accuracy in weather prediction than traditional statistical methods. This research work aims to compare the performance of few data mining algorithms for predicting rainfall using historical weather data of Srinagar, India, which is collected from http://www.wundergrounds.com website. From the collected weather data which comprises of 9 attributes, only 5 attributes which are most relevant to rainfall prediction are considered. Data mining process model is followed to obtain accurate and correct prediction results. In this paper, various data mining algorithms were explored which include decision tree based J48, Random forest, Naive Bayes, Bayes Net, Logistic Regression, IBk, PART and bagging. The experimental results show that J48 algorithm has good level of accuracy than other algorithms.

References

F. Olaiya and B. A. Adeyemo, "Application of data mining techniques in weather prediction and climate change studies," International Journal of Information Engineering and Electronic Business, vol. 4, no. 1, pp. 51, 2012.

D. Sanjay, K. P. Wagh, and P. N. Chatur, "Classification and prediction of future weather by using back propagation algorithm-an approach," International Journal of Emerging Technology and Advanced Engineering, vol. 2, no. 1, pp. 110-113, 2012.

M. Kannan, S. Prabhakaran, and P. Ramachandran, "Rainfall forecasting using data mining technique," 2010.

Wikipedia contributors, "Srinagar," Wikipedia, The Free Encyclopedia, [Online]. Available: https://en.wikipedia.org/wiki/Srinagar. [Accessed: Date].

B. Nikam, V. Valmik, and B. B. Meshram, "Modeling rainfall prediction using data mining method: A Bayesian approach," Computational Intelligence, Modelling and Simulation (CIMSim), 2013 Fifth International Conference on. IEEE, 2013.

F. Nhita, "A rainfall forecasting using fuzzy system based on genetic algorithm," Information and Communication Technology (ICoICT), 2013 International Conference of IEEE, 2013.

S. Mahajan and H. Mazumdar, "Rainfall Prediction using Neural Net based Frequency Analysis Approach," International Journal of Computer Applications, vol. 84, no. 9, 2013.

A. Geetha and G. M. Nasira, "Data mining for meteorological applications: Decision trees for modeling rainfall prediction," Computational Intelligence and Computing Research (ICCIC), 2014 IEEE International Conference on. IEEE, 2014.

P. Dutta, P. S. Saikia, and H. Tahbilder, "Prediction of rainfall using data mining technique over Assam," IJCSE, vol. 5, no. 2, 2014, pp. 85-90.

A. Sharma and M. K. Goyal, "Bayesian network model for monthly rainfall forecast," Research in Computational Intelligence and Communication Networks (ICRCICN), 2015 IEEE International Conference on IEEE, 2015.

A. Dubey and D. Akash, "Artificial neural network models for rainfall prediction in Pondicherry," International Journal of Computer Applications, vol. 120, no. 3, 2015.

Weather Underground, "Weather History for Srinagar," [Online]. Available: https://www.wunderground.com/history/airport/VISR/2015/11/6/CustomHistory.html?dayend=6&monthend=11&yearend=2016&req_city=&req_state=&req_statename=&reqdb.zip=&reqdb.magic=&reqdb.wmo=. [Accessed: Date].

B. Ahmed, "Predictive capacity of meteorological data: Will it rain tomorrow?," Science and Information Conference (SAI), 2015, IEEE, 2015.

D. Hand, H. Mannila, and P. Smyth, "Principles of data mining," MIT, 2001.

V. Kotu and B. Deshpande, Predictive analytics and data mining: concepts and practice with rapidminer, Morgan Kaufmann, 2014.

T. Wang, W. Li, H. Shi, and Z. Liu, "Software defect prediction based on classifiers ensemble," Journal of Information & Computational Science, vol. 8, no. 16, pp. 4241-4254, 2011.

I. H. Witten and E. Frank, Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann, Los Altos, US, 2005.

R. O. Duda and P. E. Hart, Pattern classification and scene analysis, John Wiley and Sons, 1973.

P. Langley, W. Iba, and K. Thompson, "An analysis of Bayesian Classifiers," in Proceedings of the Tenth National Conference on Artificial Intelligence, San Jose, CA, 1992.

A. Mccallum and K. Nigam, "A Comparison of Event Models for Naive Bayes Text Classification," Proceedings of the 15th National Conference on Artificial Intelligence (AAAI-98)-Workshop on Learning for Text Categorization, pp. 41-48, 1998.

T. Menzies, J. Greenwald, and A. Frank, "Data Mining Static Code Attributes to Learn Defect Predictors," IEEE Transactions on Software Engineering, vol. 33, no. 1, 2-13, 2007.

L. Breiman, "Random forests," Machine Learning, vol. 45, no. 1, pp. 5-32, 2001.

L. Guo, Y. Ma, B. Cukic, and H. Singh, "Robust prediction of fault-proneness by random forests," in Proc. of the 15th International Symposium on Software Reliability Engineering ISSRE’04, pp. 417-428, 2004.

Y. Jiang, B. Cukic, T. Menzies, and N. Bartlow, "Comparing design and code metrics for software quality prediction," Proc. Fourth Int. Workshop on Predictor Models in Software Engineering, PROMISE’08, New York, USA, 2008, pp. 11-18.

Wikipedia contributors, "Logistic regression," Wikipedia, The Free Encyclopedia, [Online]. Available: https://en.wikipedia.org/wiki/Logistic_regression. [Accessed: Date].

D. Aha, "Tolerating noisy, irrelevant and novel attributes in instance-based learning algorithms," Int. J. Man-Machine Studies, vol. 36, 267-287, 1992.

R. Mohammad, M. B. Ahmed, and M. B. Zaman, "Tools for Predictive Analytics: An Overview," International Journal of Scientific Research Engineering & Technology (IJSRET), vol. 6, no. 7, pp. 748-750, 2017.

R. M. Shah, M. A. Butt, and M. Z. Baba, "Predictive Analytics Modeling: A Walkthrough," Int. J. Adv. Res. Comput. Sci. Softw. Eng., vol. 7, no. 6, pp. 421-426, June 2017.

R. M. Shah, M. A. Butt, and M. Z. Baba, "Review of Predictive Analytic Modeling techniques," International Journal of Emerging Trends & Technology in Computer Science (IJETTCS), vol. 6, no. 4, pp. 58-62, 2017.

R. Mohammad, M. B. Ahmed, and M. B. Zaman, "Predictive Analytics: An Application Perspective," International Journal of Computer Engineering and Applications, vol. 9, no. 8, Aug. 2017.

Downloads

Published

04-10-2018

How to Cite

Mohd, R., Ahmed Butt, M., & Baba, M. Z. . (2018). Comparative Study of Rainfall Prediction Modeling Techniques (A Case Study on Srinagar, J&K, India). Asian Journal of Computer Science and Technology, 7(3), 13–19. https://doi.org/10.51983/ajcst-2018.7.3.1901