Fashions in Data Mining and Hidden Knowledge Innovation from Clinical Database
DOI:
https://doi.org/10.51983/ajcst-2012.1.2.1704Keywords:
Data Mining, Hidden Knowledge Innovation, Clinical DatabaseAbstract
Data Mining and Hidden Knowledge Innovation (DMHKI) is one of the fast growing computer science fields. Its reputation is caused by an increased demand for tools that help with the analysis and understanding of huge amounts of data in Clinical Database. Such data are generated on a daily basis by Hospitals. This explosion came into being through the ever increasing use of computers, scanners, digital cameras, bar codes, etc. We are in a situation when rich sources of data, stored in databases, warehouses, and other data repositories, are readily available. This in turn causes big interest of medicalsocieties in the field of DMHKI. What is needed is a clear and simple methodology for extracting the knowledge that is hidden in the database. In this chapter, an integrated DMHKI process model based on the emerging technologies like XML, PMML, SOAP, UDDI, and OLE BD-DM is introduced. These technologies help designing flexible, semi-automated, and easy to use DMHKI model. They enable the building of knowledge repositories. They allow for communication between several data mining tools, databases and knowledge repositories. They also enable integration and automation of DMHKI tasks. The Journal describes a Seven-step DMHKI process model, the above mentioned technologies, and their implementation details.
References
S. Banerjee, V. Krishnamurthy, M. Krishnaprasad and R. Murthy, “Oracle8i – The XML Enabled Data Management System”, Proceedings of the Sixteenth International Conference on Data Engineering, San Diego, California, 2007, pp. 561-568.
R. Bourret, C. Bornhvd, and A.P. Buchmann, “A Generic Load/ Extract Utility for Data Transfer Between XML Documents and Relational Databases”, Proceeding of the 2nd International Workshop on Advanced Issues of E-Commerce and Web-based Information Systems, San Jose, California, June, 2008, pp.134-143.
L. Kurgan, K.J. Cios, M. Sontag, and F.J. Accurso, “Mining a Cystic Fibrosis Database”, In: Zurada, J., and Kantardzic, M. (Eds.), Novel Applications in Data Mining, submitted, 2006.
L.Kurgan and K.J. Cios, “DataSqueezer Algorithm that Generates Small Number of Short Rules”, Submitted to the IEE Proceedings: Vision, Image and Signal Processing, 2008.
K.K. Hirji, “Exploring Data Mining Implementation”, Communications of the ACM, Vol. 44, No. 7, July 2008, pp. 87-93.
DMG, The Data Mining Group, http://www.dmg.org/, 2009
T. Bray, J. Paoli and E. Maler, “Extensible Markup Language (XML) 1.0 (Second Edition)”, W3C Recommendation, http://www.w3.org/ TR/2000/REC-xml-20001006, October 2008.
P. Buneman, M.F. Fernandez, D.Suciu, “UnQL: A Query Language and Algebra for Semistructured Data Based on Structural Recursion”,Very Large Data Bases Journal, Vol. 9, No. 1, 2008, pp.76-110.
S. Cluet and J.Simeon, YATL: A Functional and Declarative Language for XML, draft manuscript, 2007.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2012 The Research Publication
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.