Heterogeneous Record Linkage Using CAA

Authors

  • K. Kumaresan PG Scholar, Department Of Information Technology, Anna University of Technology, Coimbatore - 641 047, Tamil Nadu, India

DOI:

https://doi.org/10.51983/ajcst-2013.2.1.1709

Keywords:

Record linkage, Data Linkage, Data Matching, Record Blocking, Datamining

Abstract

Record linkage is a scheme to retrieve the related data’s from more than one table which are not in the same structure and not reside in the same places. Matching techniques facing following problems, (1) no common attribute to match Records between the data tables. (2) Record linkage in online is not an efficient and which provide traffic and may some connectivity failures will occur. (3) Previous techniques will not concentrate on unduplicated error Record (spelling mistakes). Using CAA (Concurrent Attribute Acquisition) and UGK (User Generated Key) approach not all the attributes of the entire remote attribute Records are taken into local site [LS]. Rather only the related attribute Records are taken into LS. So the communication traffic is reduced. Then Local Entity [LE] will be compared with each other Downloaded Remote table Records. Traditional Blocking (Group the record which have relationship from the Data set) to identify the required Records. Misspelled original Record also identified. After this process related Record identified with their identifier and table information. Insert this information on the new table [NT]. Publish NT as a global access Databases.

References

Debabrata Dey, Member, IEEE, Vijay S. Mookerjee, and Dengpan Liu (2011), ‘Efficient Techniques For Online Record Linkage’, IEEE Transactions On Knowledge And Data Engineering, Vol. 23, No. 3, March 2011

Steven N. Minton and Claude Nanjo,Craig A. Knoblock, Martin Michalowski, and Matthew Michelson ‘A Heterogeneous Field Matching Method For Record Linkage’ in part by the Air Force Office of Scientific Research under grant number FA9550-04-1-0105

Peter Christen, The Australian National University, “A Survey Of Indexing Techniques For Scalable Record Linkage And Deduplication”, IEEE Transactions On Knowledge And Data Engineering, Vol. Z, No. Y, Zzzz 2011 2

William E. Winkler, U.S. Bureau of the Census “Matching And Record Linkage”.

Peter Christen,”Probabilistic Data Generation For Deduplication And Data Linkage” Australian Research Council (ARC) Linkage Grant LP0453463 and partially funded by the NSW Department of Health

Peter Christen, Tim Churches “Secure Health Data Linkage And Geocoding: Current Approaches And Research Directions” Australian Research Council (ARC) Linkage Grant LP0453463.

Matthew Michelson, Craig A. Knoblock “Mining Heterogeneous Transformations For Record Linkage” Air Force Office of Scientific Research under grant number FA9550-04-1-0105

Liang Jin, Chen Li, Sharad Mehrotra , University of California, Irvine, CA 92697, USA “Efficient Record Linkage In Large Data Sets”

Peter Christen August 2007 TR-CS-07-03 “Towards Parameter-Free Blocking For Scalable Record Linkage”

Soufiane Boufous, Caroline Finch, Andrew Hayen, Ann Williamson “Data Linkage Of Hospital And Police Crash Datasets In Nsw” NSW Injury Risk Management Research Centre University of New South Wales, Sydney NSW 2052, Australia.

Peter Christen, Tim Churches “Febrl – Freely extensible biomedical record linkage” Australian National University.

Lifang Gu,Rohan Baxter,Deanne Vickers, Chris Rainsford “ Record Linkage: Current Practice and Future Directions ”CSIRO Mathematical and Information Sciences GPO Box 664, Canberra, ACT 2601, Australia,CMIS Technical Report No. 03/83.

Downloads

Published

05-05-2013

How to Cite

Kumaresan, K. (2013). Heterogeneous Record Linkage Using CAA. Asian Journal of Computer Science and Technology, 2(1), 39–44. https://doi.org/10.51983/ajcst-2013.2.1.1709